Publication

Dynamic modality weighting for multi-stream HMMs in Audio- Visual Speech Recognition

Publications associées (36)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Two-level bimodal association for audio-visual speech recognition

Touradj Ebrahimi, Jong Seok Lee

This paper proposes a new method for bimodal information fusion in audio-visual speech recognition, where cross-modal association is considered in two levels. First, the acoustic and the visual data streams are combined at the feature level by using the ca ...

Springer-Verlag2009

Assessing airport noise, demand for quietness and land-structure substitution

Marco Salvi

This dissertation collects three essays on the hedonic modelling of housing prices, location attributes and environmental amenities – or lack thereof. The first essay applies spatial econometric techniques to measure the impact of airport noise on the pric ...

EPFL2009

Noise Modeling in Lateral Nonuniform MOSFET

Christian Enz, Jean-Michel Sallese, Ananda Sankar Roy

In this paper, we present an analytical noise modeling methodology for lateral nonuniform MOSFET. We demonstrate that the noise properties of lateral nonuniform MOSFETs are considerably different from the prediction obtained with the conventional Klaassen- ...

2007

Noise and small-signal modeling of nanoscale MOSFETs

Ananda Sankar Roy

After years of intensive research effort, the design of RF integrated circuits in CMOS has now reached a wide acceptance for industrial designs. This is due to the high unity gain frequency and low-noise performance of today's deep sub micrometer MOS trans ...

EPFL2007

A multimodal pattern recognition framework for speaker detection

Patricia Besson

Speaker detection is an important component of a speech-based user interface. Audiovisual speaker detection, speech and speaker recognition or speech synthesis for example find multiple applications in human-computer interaction, multimedia content indexin ...

EPFL2007

Development and implementation of the areawide Dynamic ROad traffic NoisE (DRONE) simulator

Edward Chung, Ashish Bhaskar

This paper discusses the areawide Dynamic ROad traffic NoisE (DRONE) simulator, and its implementation as a tool for noise abatement policy evaluation. DRONE involves integrating a road traffic noise estimation model with a traffic simulator to estimate ro ...

2007

Harmonic Plus Noise Model for Concatenative Speech Synthesis

This project develops the new model Harmonic Plus Noise applied for the concatenative speech synthesis. The software is composed of an analysis part (off-line process) applied on the first initial database and a synthesis part (real time process) applied o ...

IDIAP2005

Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication

Samy Bengio, Norman Hoon Thian Poh

Multi-stream approaches have proven to be very successful in speech recognition tasks and to a certain extent in speaker authentication tasks. In this study we propose a noise-robust multi-stream text-independent speaker authentication system. This system ...

IDIAP2004

Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication

Samy Bengio, Norman Hoon Thian Poh

2004

Noise Reduction by Fuzzy Image Filtering

Dimitri Nestor Alice Van De Ville

A new fuzzy filter is presented for the noise reduction of images corrupted with additive noise. The filter consists of two stages. The first stage computes a fuzzy derivative for eight different directions. The second stage uses these fuzzy derivatives to ...

IEEE2003