Jean-Marc Odobez, Petr Motlicek, Weipeng He
We propose a novel multi-task neural network-based approach for joint sound source localization and speech/non-speech classification in noisy environments. The network takes raw short time Fourier transform as input and outputs the likelihood values for th ...
ISCA-INT SPEECH COMMUNICATION ASSOC2018