Publication

On Structured Sparsity of Phonological Posteriors for Linguistic Parsing

Related publications (157)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framework

Hervé Bourlard, Jitendra Ajmera

In this paper, we present a new approach towards high performance speech/music discrimination on realistic tasks related to the automatic transcription of broadcast news. In the approach presented here, the (local) Probability Density Function (PDF) estima ...

2003

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

IDIAP2003

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings

Darren Moore

This paper investigates the use of microphone arrays to acquire and recognise speech in meetings. Meetings pose several interesting problems for speech processing, as they consist of multiple competing speakers within a small space, typically around a tabl ...

2003

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings

Darren Moore

IDIAP2002

Speech Processing & Text-Independent Automatic Person Verification

In this communication we first review the human speech production process and feature extraction approaches commonly used in a speaker verification system. Experiments on the telephone speech {NTIMIT} database suggest that the performance degradation of a ...

IDIAP2002

Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framewor

Hervé Bourlard, Jitendra Ajmera

IDIAP2001

A Pragmatic View of the Application of HMM2 for ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

This report investigates the HMM2 approach recently introduced in the framework of automatic speech recognition. HMM2 can be seen as a mixture of HMMs, where a conventional primary HMM (processing a time series of speech data) is supported on a lower level ...

IDIAP2001

Some applications of a priori knowledge in multi-stream HMM and HMM/ANN based ASR

Multi-band ASR was largely inspired by the extremely high level of redundancy in the spectral signal representation which can be inferred from Fletcher's product-of-errors rule for human speech perception. Indeed, the main aim of the multi-band approach is ...

2000

Audio-Visual Speech Modelling for Continuous Speech Recognition

This paper describes a complete system for audio-visual recognition of continuous speech including robust lip tracking, visual feature extraction, noise-robust acoustic feature extraction, and sensor integration. An appearance based model of the articulato ...

2000

Relating LPC modeling to a factor-based articulatory model

Sacha Krstulovic

This paper proposes a method for recovering the articulatory parameters of a factor-based vocal tract shape model from the speech waveform. This is realized by analytically relating the shape model to a Linear Prediction lattice filter. Results pertaining ...

2000