Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
We propose a new auditory inspired feature extraction technique for automatic speech recognition (ASR). Features are extracted by filtering the temporal trajectory of spectral energies in each critical band of speech by a bank of finite impulse response (FIR) filters. Impulse responses of these filters are derived from a modified Gabor envelope in order to emulate asymmetries of the temporal receptive field (TRF) profiles observed in higher level auditory neurons. We obtain relative improvement in word error rate on OGI-Digits database and, relative improvement in phoneme error rate on TIMIT database over the MRASTA technique.
Mahsa Shoaran, Uisub Shin, Bingzhao Zhu
Subrahmanya Pavankumar Dubagunta