Publication

Stochastic techniques in deriving perceptual knowledge

Publications associées (94)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

On the Combination of Auditory and Modulation Frequency Channels for ASR applications

Hynek Hermansky, Fabio Valente

This paper investigates the combination of evidence coming from different frequency channels obtained filtering the speech signal at different auditory and modulation frequencies. In our previous work \cite{icassp2008}, we showed that combination of classi ...

IDIAP2008

Configural modulation of crowding

Michael Herzog, Bilge Sayim, Toni Saarela

The ability to make judgments about a peripheral target stimulus can be impaired when the target is surrounded by flanking stimuli. This effect is called crowding. Crowding is often related to relatively simple low-level mechanisms that pool visual informa ...

Association for Research in Vision and Ophthalmology2008

Fast Approximate Spoken Term Detection from Sequence of Phonemes

Hynek Hermansky, Joel Praveen Pinto

We investigate the detection of spoken terms in conversational speech using phoneme recognition with the objective of achieving smaller index size as well as faster search speed. Speech is processed and indexed as a sequence of one best phoneme sequence. W ...

2008

Fast Approximate Spoken Term Detection from Sequence of Phonemes

Hynek Hermansky, Joel Praveen Pinto

IDIAP2008

Perception Studies on the Attributes of Synthetic Clear Speech for the Hard of Hearing

Chandra Sekhar Seelamantula

We make a case for ‘synthetic clear speech’ in the context of the persons with hearing impairment. We study the acoustic attributes of ‘clear speech’ that enable us to understand their importance in speech perception. Our perception experiments are motivat ...

IEEE2007

Novel speech processing techniques for robust automatic speech recognition

Vivek Tyagi

The goal of this thesis is to develop and design new feature representations that can improve the automatic speech recognition (ASR) performance in clean as well noisy conditions. One of the main shortcomings of the fixed scale (typically 20-30 ms long ana ...

EPFL2006

Discriminant linear processing of time-frequency plane

Hynek Hermansky, Fabio Valente

Extending previous works done on considerably smaller data sets, the paper studies linear discriminant analysis of about 30 hours of phoneme-labeled speech data in the time-frequency domain. Analysis is carried both independently in time and frequency and ...

2006

Discriminant linear processing of time-frequency plane

Hynek Hermansky, Fabio Valente

IDIAP2006

A Discriminative Decoder for the Recognition of Phoneme Sequences

Samy Bengio, David Grangier

In this report, we propose a discriminative decoder for phoneme recognition, i.e. the identification of the uttered phoneme sequence from a speech recording. This task is solved as a 3 step process: a phoneme classifier first classifies each accoustic fram ...

IDIAP2005

Bandpass filters for 8 GHz using solidly mounted bulk acoustic wave resonators

Paul Muralt, Roman Lanz

Frequency shift, design, and fabrication issues have been investigated for the realization of 8 GHz handpass filters based on AlN thin film bulk acoustic wave resonators. Fabrication includes well-textured AlN thin films on Pt (111) electrodes and SiO2/AlN ...

2005