Person

Hervé Bourlard

Related publications (367)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

In Bourlard and Kamp (Biol Cybern 59(4):291-294, 1998), it was theoretically proven that autoencoders (AE) with single hidden layer (previously called "auto-associative multilayer perceptrons") were, in the best case, implementing singular value decomposit ...

SPRINGER2022

From Undercomplete to Sparse Overcomplete Autoencoders to Improve LF-MMI Speech Recognition

Hervé Bourlard, Selen Hande Kabil

Starting from a strong Lattice-Free Maximum Mutual Information (LF-MMI) baseline system, we explore different autoencoder configurations to enhance Mel-Frequency Cepstral Coefficients (MFCC) features. Autoencoders are expected to generate new MFCC features ...

ISCA-INT SPEECH COMMUNICATION ASSOC2022

Automatic Dysarthric Speech Detection Exploiting Pairwise Distance-Based Convolutional Neural Networks

Hervé Bourlard, Ina Kodrasi, Parvaneh Janbakhshi

Automatic dysarthric speech detection can provide reliable and cost-effective computer-aided tools to assist the clinical diagnosis and management of dysarthria. In this paper we propose a novel automatic dysarthric speech detection approach based on analy ...

IEEE2021

Subspace-Based Learning for Automatic Dysarthric Speech Detection

Hervé Bourlard, Ina Kodrasi, Parvaneh Janbakhshi

To assist the clinical diagnosis and treatment of speech dysarthria, automatic dysarthric speech detection techniques providing reliable and cost-effective assessment are indispensable. Based on clinical evidence on spectro-temporal distortions associated ...

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC2021

Lattice-Free Mmi Adaptation Of Self-Supervised Pretrained Acoustic Models

Hervé Bourlard, Apoorv Vyas

In this work, we propose lattice-free MMI (LFMMI) for supervised adaptation of self-supervised pretrained acoustic model. We pretrain a Transformer model on thousand hours of untranscribed Librispeech data followed by supervised adaptation with LFMMI on th ...

IEEE2021

Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model

Hervé Bourlard, Apoorv Vyas

In this work, we investigate if the wav2vec 2.0 self-supervised pretraining helps mitigate the overfitting issues with connectionist temporal classification (CTC) training to reduce its performance gap with flat-start lattice-free MMI (E2E-LFMMI) for autom ...

ISCA-INT SPEECH COMMUNICATION ASSOC2021

In this paper, we develop Automatic Speech Recognition (ASR) systems for multi-genre speech recognition of low-resource languages where training data is predominantly conversational speech but test data can be in one of the following genres: news broadcast ...

ISCA-INT SPEECH COMMUNICATION ASSOC2021

Neural Network Based End-to-End Query by Example Spoken Term Detection

Hervé Bourlard, Dhananjay Ram

This article focuses on the problem of query by example spoken term detection (QbE-STD) in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time warping (DTW) based template matching techniques using phone posterior or bottlene ...

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC2020

AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS

Hervé Bourlard, Ina Kodrasi, Parvaneh Janbakhshi

Idiap2020

Automatic Pathological Speech Intelligibility Assessment Exploiting Subspace-Based Analyses

Hervé Bourlard, Ina Kodrasi, Parvaneh Janbakhshi

Competitive state-of-the-art automatic pathological speech intelligibility measures typically rely on regression training on a large number of features, require a large amount of healthy speech training data, or are applicable only to phonetically balanced ...

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC2020