Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and multi-modal tasks, including speech recognition, people and action recognition ...
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and multi-modal tasks, including speech recognition, people and action recognition ...
This work presents an Offline Cursive Word Recognition System dealing with single writer samples. The system is a continuous density hiddden Markov model trained using either the raw data, or data transformed using Principal Component Analysis or Independe ...
Automatic speech recognition (ASR) performance falls dramatically with the level of mismatch between training and test data. The human ability to recognise speech when a large proportion of frequencies are dominated by noise has inspired the "missing data" ...
This work presents an Offline Cursive Word Recognition System dealing with single writer samples. The system is a continuous density hiddden Markov model trained using either the raw data, or data transformed using Principal Component Analysis or Independe ...
Optical mode crossing is not a plausible explanation for the new broad Brillouin doublet nor for the strong acoustic anomalies observed at low temperatures in SrTiO3. Data presented to support that explanation are also inconclusive. ...
This article reviews the available methods forautomated identification of objects in digital images. The techniques are classified into groups according to the nature of the computational strategy used. Four classes are proposed: (1) the s~mplest strategie ...
We propose a vision based real-time object recognition system, that provides object identification and 3D position data for the automatic initialization of a 3D tracking system. A-priori information is generated using the models of objects which may be pre ...
Automatic speech recognition (ASR) performance falls dramatically with the level of mismatch between training and test data. The human ability to recognise speech when a large proportion of frequencies are dominated by noise has inspired the "missing data" ...