SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

In this paper we investigate external phone duration models (PDMs) for improving the quality of synthetic speech in hidden Markov model (HMM)-based speech synthesis. Support Vector Regression (SVR) and Multilayer Perceptron (MLP) were used for this task. SVR and MLP PDMs were compared with the explicit duration modelling of hidden semi-Markov models (HSMMs). Experiments done on an American English database showed the SVR outperforming the MLP and HSMM duration modelling on objective and subjective evaluation. In the objective test, SVR managed to outperform MLP and HSMM models achieving 15.3% and 25.09% relative improvement in terms of root mean square error (RMSE) respectively. Moreover, in the subjective evaluation test, on synthesized speech, the SVR model was preferred over the MLP and HSMMmodels, achieving a preference score of 35.93% and 56.30%, respectively.

SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis

Graph Chatbot

Chattez avec Graph Search

Machine learning models for prediction of electrochemical properties in supercapacitor electrodes using MXene and graphene nanoplatelets

Partial discharge localization in power transformer tanks using machine learning methods

Bayes-optimal Learning of Deep Random Networks of Extensive-width

Partial discharge localization in power transformer tanks using machine learning methods

Bayes-optimal Learning of Deep Random Networks of Extensive-width

Machine learning models for prediction of electrochemical properties in supercapacitor electrodes using MXene and graphene nanoplatelets