Publications associées à Phonemic orthography

Novel Methods For Detection And Analysis Of Atypical Aspects In Speech

Atypical aspects in speech concern speech that deviates from what is commonly considered normal or healthy. In this thesis, we propose novel methods for detection and analysis of these aspects, e.g. to monitor the temporary state of a speaker, diseases tha ...

EPFL2023

Phonetic aware techniques for Speaker Verification

Subhadeep Dey

The goal of this thesis is to improve current state-of-the-art techniques in speaker verification (SV), typically based on â identity-vectorsâ (i-vectors) and deep neural network (DNN), by exploiting diverse (phonetic) information extracted using variou ...

EPFL2018

On Modeling the Synergy Between Acoustic and Lexical Information for Pronunciation Lexicon Development

Marzieh Razavi

State-of-the-art automatic speech recognition (ASR) and text-to-speech systems require a pronunciation lexicon that maps each word to a sequence of phones. Manual development of lexicons is costly as it needs linguistic knowledge and human expertise. To fa ...

EPFL2017

Exploiting sequence information for text-dependent Speaker Verification

Petr Motlicek, Subhadeep Dey

Model-based approaches to Speaker Verification (SV), such as Joint Factor Analysis (JFA), i-vector and relevance Maximum-a-Posteriori (MAP), have shown to provide state-of-the-art performance for text-dependent systems with fixed phrases. The performance o ...

Ieee2017

Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework

Mathew Magimai Doss, Ramya Rasipuram, Marzieh Razavi

One of the primary steps in building automatic speech recognition (ASR) and text-to-speech systems is the development of a phonemic lexicon that provides a mapping between each word and its pronunciation as a sequence of phonemes. Phoneme lexicons can be d ...

2016

On Compressibility of Neural Network phonological Features for Low Bit Rate Speech Coding

Hervé Bourlard, Afsaneh Asaei, Milos Cernak

Phonological features extracted by neural network have shown interesting potential for low bit rate speech vocoding. The span of phonological features is wider than the span of phonetic features, and thus fewer frames need to be transmitted. Moreover, the ...

2015

Graphene : from Growth to Devices

Laurent Syavoch Bernard

We report a full study of graphene synthesis by CVD on Cu surface. Two CVD methods have been developed. The first is a static one, which yields monolayer of graphene at low pressure of methane in 3 minutes at 1000 C. The second one is an equimolar method w ...

EPFL2015

On Learning Grapheme-to-Phoneme Relationships through the Acoustic Speech Signal

Mathew Magimai Doss, Ramya Rasipuram

Automatic speech recognition (ASR) systems, through use of the phoneme as an intermediary unit representation, split the problem of modeling the relationship between the written form, i.e., the text and the acoustic speech signal into two disjoint processe ...

2014

Grapheme-based Automatic Speech Recognition using Probabilistic Lexical Modeling

Ramya Rasipuram

Automatic speech recognition (ASR) systems incorporate expert knowledge of language or the linguistic expertise through the use of phone pronunciation lexicon (or dictionary) where each word is associated with a sequence of phones. The creation of phone pr ...

EPFL2014

Overcoming Asynchrony in Audio-Visual Speech Recognition

Jean-Philippe Thiran, Virginia Estellers Casas

In this paper we propose two alternatives to overcome the natural asynchrony of modalities in Audio-Visual Speech Recognition. We first investigate the use of asynchronous statistical models based on Dynamic Bayesian Networks with different levels of async ...

2010

Enhanced Phone Posteriors for Improving Speech Recognition Systems

Hervé Bourlard, Hamed Ketabdar

Using phone posterior probabilities has been increasingly explored for improving automatic speech recognition (ASR) systems. In this paper, we propose two approaches for hierarchically enhancing these phone posteriors, by integrating long acoustic context, ...

2010