Baseline System for Automatic Speech Recognition with French GlobalPhone Database

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

This report presents one month trainee work on development of French Automatic Speech Recognition ASR system using a french part of multilingual database GlobalPhone_FR. The purpose of this report is to explain and give results of the training and testing of the ASR with this specific database. Two different methods are presented, the Hidden Markov Model (HMM) with MFCC/PLP features and tandem features from Multilayer Perceptron (MLP) phone posteriors. The report presents data preparation for GlobalPhone_FR ASR training, and compares the two different approaches. Word recognition accuracy achieved with MFCC features is 71.46% and the tandem features with 3-layer MLP improved the accuracy to 72.15%. We interpret this result as a baseline for the GlobalPhone_FR database.

Baseline System for Automatic Speech Recognition with French GlobalPhone Database

Graph Chatbot

Chat with Graph Search

Benign Overfitting in Deep Neural Networks under Lazy Training

An exact mapping from ReLU networks to spiking neural networks

A unified framework for Hamiltonian deep neural networks

A unified framework for Hamiltonian deep neural networks

Benign Overfitting in Deep Neural Networks under Lazy Training

An exact mapping from ReLU networks to spiking neural networks