Implementation of VTLN for Statistical Speech Synthesis

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Vocal tract length normalization is an important feature normalization technique that can be used to perform speaker adaptation when very little adaptation data is available. It was shown earlier that VTLN can be applied to statistical speech synthesis and was shown to give additive improvements to CMLLR. This paper presents an EM optimization for estimating more accurate warping factors. The EM formulation helps to embed the feature normalization in the HMM training. This helps in estimating the warping factors more efficiently and enables the use of multiple (appropriate) warping factors for different state clusters of the same speaker.

Implementation of VTLN for Statistical Speech Synthesis

Graph Chatbot

Chat with Graph Search

Novel Methods For Detection And Analysis Of Atypical Aspects In Speech

Sparse Autoencoders for Speech Modeling and Recognition

On matching data and model in LF-MMI-based dysarthric speech recognition

Sparse Autoencoders for Speech Modeling and Recognition

On matching data and model in LF-MMI-based dysarthric speech recognition

Novel Methods For Detection And Analysis Of Atypical Aspects In Speech