Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and multi-modal tasks, including speech recognition, people and action recognition ...
We propose a vision based real-time object recognition system, that provides object identification and 3D position data for the automatic initialization of a 3D tracking system. A-priori information is generated using the models of objects which may be pre ...
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and multi-modal tasks, including speech recognition, people and action recognition ...
This work presents an Offline Cursive Word Recognition System dealing with single writer samples. The system is a continuous density hiddden Markov model trained using either the raw data, or data transformed using Principal Component Analysis or Independe ...
Optical mode crossing is not a plausible explanation for the new broad Brillouin doublet nor for the strong acoustic anomalies observed at low temperatures in SrTiO3. Data presented to support that explanation are also inconclusive. ...
This work presents an Offline Cursive Word Recognition System dealing with single writer samples. The system is a continuous density hiddden Markov model trained using either the raw data, or data transformed using Principal Component Analysis or Independe ...
Automatic speech recognition (ASR) performance falls dramatically with the level of mismatch between training and test data. The human ability to recognise speech when a large proportion of frequencies are dominated by noise has inspired the "missing data" ...
Automatic speech recognition (ASR) performance falls dramatically with the level of mismatch between training and test data. The human ability to recognise speech when a large proportion of frequencies are dominated by noise has inspired the "missing data" ...
This article reviews the available methods forautomated identification of objects in digital images. The techniques are classified into groups according to the nature of the computational strategy used. Four classes are proposed: (1) the s~mplest strategie ...