Related publications (18)

Probabilistic Symbol Sequence Matching and its Application to Pathological Speech Intelligibility Assessment

Julian David Fritsch

Matching of a test signal to a reference word hypothesis forms the core of many speech processing problems, including objective speech intelligibility assessment. This paper first shows that the comparison of two speech signals can be formulated as matchin ...
Idiap2021

Multilingual bottleneck features for subword modeling in zero-resource languages

Enno Hermann

How can we effectively develop speech technology for languages where no transcribed data is available? Many existing approaches use no annotated resources at all, yet it makes sense to leverage information from large annotated corpora in other languages, f ...
2018

Perceptual Information Loss due to Impaired Speech Production

Hervé Bourlard, Milos Cernak, Afsaneh Asaei

Phonological classes define articulatory-free and articulatory-bound phone attributes. Deep neural network is used to estimate the probability of phonological classes from the speech signal. In theory, a unique combination of phone attributes form a phonem ...
2017

Sound Pattern Matching for Automatic Prosodic Event Detection

Hervé Bourlard, Philip Neil Garner, Milos Cernak, Afsaneh Asaei, Pierre-Edouard Jean Charles Honnet

Prosody in speech is manifested by variations of loudness, exaggeration of pitch, and specific phonetic variations of prosodic segments. For example, in the stressed and unstressed syllables, there are differences in place or manner of articulation, vowels ...
Idiap2016

Grain boundary relaxation in yellow gold bi-crystals

Daniele Mari, Robert Schaller, Iva Tkalcec Vâju

The mechanical loss spectrum of a yellow gold bi-crystal is presented and analyzed in detail. The relaxation strength is monitored as a function of several geometrical parameters such as sample width, length and thickness. It is found that the relaxation s ...
Elsevier2015

Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution

Jean-Marc Vesin, Olaf Schleusing

In this work, we present a joint source-filter optimization approach for separating voiced speech into vocal tract (VT) and voice source components. The presented method is pitch-synchronous and thereby exhibits a high robustness against vocal jitter, shim ...
Ieee-Inst Electrical Electronics Engineers Inc2013

Individual Differences in the Discrimination of Novel Speech Sounds: Effects of Sex, Temporal Processing, Musical and Cognitive Abilities

John Christian Thoresen

This study examined whether rapid temporal auditory processing, verbal working memory capacity, non-verbal intelligence, executive functioning, musical ability and prior foreign language experience predicted how well native English speakers (N = 120) discr ...
Public Library of Science2012

More Than Words: Inference of Socially Relevant Information from Nonverbal Vocal Cues in Speech

Gelareh Mohammadi, Alessandro Vinciarelli

This paper presents two examples of how nonverbal commu- nication can be automatically detected and interpreted in terms of social phenomena. In particular, the presented approaches use simple prosodic features to distinguish between journalists and non-jo ...
Springer2011

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.