Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares

The multi-channel Wiener filter (MWF) is a well-known multi-microphone speech enhancement technique, aiming at improving the quality of the recorded speech signals in noisy and reverberant environments. Assuming that reverberation and ambient noise can be modeled as a diffuse sound field and the spatial coherence of the residual noise is known, the MWF requires estimates of the relative early transfer function (RETF) vector of the target speaker as well as the power spectral densities (PSDs) of the target, diffuse and residual noise component. RETF vector and PSD estimation is often decoupled, where one quantity is estimated independently of the other quantity. In this paper, we propose to jointly estimate the RETF vector and all PSDs by minimizing the Frobenius norm of a model-based error matrix using an alternating least squares method. Experimental results using different dynamic acoustic scenarios with a moving speaker show that the proposed method leads to a larger MWF performance than a state-of-the-art method based on covariance whitening.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares

Graph Chatbot

Chat with Graph Search

Sparsely Observed Functional Time Series: Theory and Applications

SiML: Sieved Maximum Likelihood for Array Signal Processing

Parameter Estimation of Three-Phase Untransposed Short Transmission Lines from Synchrophasor Measurements

Sparsely Observed Functional Time Series: Theory and Applications

SiML: Sieved Maximum Likelihood for Array Signal Processing

Parameter Estimation of Three-Phase Untransposed Short Transmission Lines from Synchrophasor Measurements