Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR
Publications associées (57)
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
TDOA- (time difference of arrival-) based algorithms are common methods for speech source localization. The generalized cross correlation (GCC) method is the most important approach for estimating TDOA between microphone pairs. The performance of this meth ...
By ignoring events originating in noisy areas of a position-sensitive single-photon avalanche diode (SPAD), reduction of noise from fixed-position defects is experimentally shown. Additional experimental results from a position-sensitive SPAD integrated in ...
It has been shown that the tensor calculation is very sensitive to the presence of noise in the acquired images, yielding to very low-quality Diffusion Tensor Images (DTI) data. Recent investigations have shown that the noise present in the Diffusion Weigh ...
This paper proposes a new method for bimodal information fusion in audio-visual speech recognition, where cross-modal association is considered in two levels. First, the acoustic and the visual data streams are combined at the feature level by using the ca ...
In the past decades, two recording tools have established themselves as the working horses in the field of electrophysiological cell research: the microelectrode array (MEA) and the optical fluorescence imaging. The former is a grid of miniature electrodes ...
This paper investigates detection of English keywords in a conversational scenario using a combination of acoustic and LVCSR based keyword spotting systems. Acoustic KWS systems search predefined words in parameterized spoken data. Corresponding confidence ...
Comprehensive analysis of noise sources in photocharge detectors leads to two novel, compact pixel circuits for ultra-low-noise light detection using optimum bandwidth engineering. A synchronous 4T CMOS image sensor pixel with in-pixel amplification reache ...
This paper investigates detection of English keywords in a conversational scenario using a combination of acoustic and LVCSR based keyword spotting systems. Acoustic KWS systems search predefined words in parameterized spoken data. Corresponding confidence ...
Uncertainty Feature Optimization is a framework to cope with optimization problems due to noisy data, using an implicit characterazation of the noise. The Aircraft Scheduling Problem (ASP) is a particular case of such problems, where disruptions randomly p ...
Merging decisions from different modalities is a crucial problem in Audio-Visual Speech Recognition. To solve this, state synchronous multi-stream HMMs have been proposed for their important advantage of incorporating stream reliability in their fusion sch ...