Hervé Bourlard, Apoorv Vyas
In this work, we investigate if the wav2vec 2.0 self-supervised pretraining helps mitigate the overfitting issues with connectionist temporal classification (CTC) training to reduce its performance gap with flat-start lattice-free MMI (E2E-LFMMI) for autom ...
ISCA-INT SPEECH COMMUNICATION ASSOC2021