Publication

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition

Related concepts (33)
English-language learner
English-Language Learner (often abbreviated as ELL) is a term used in some English-speaking countries such as the United States and Canada to describe a person who is learning the English language and has a native language that is not English. Some educational advocates, especially in the United States, classify these students as non-native English speakers or emergent bilinguals.
Voice (phonetics)
Voice or voicing is a term used in phonetics and phonology to characterize speech sounds (usually consonants). Speech sounds can be described as either voiceless (otherwise known as unvoiced) or voiced. The term, however, is used to refer to two separate concepts: Voicing can refer to the articulatory process in which the vocal folds vibrate, its primary use in phonetics to describe phones, which are particular speech sounds. It can also refer to a classification of speech sounds that tend to be associated with vocal cord vibration but may not actually be voiced at the articulatory level.
Allophone
In phonology, an allophone (ˈæləfoʊn; from the Greek ἄλλος, , 'other' and φωνή, , 'voice, sound') is one of multiple possible spoken sounds - or phones - or signs used to pronounce a single phoneme in a particular language. For example, in English, the voiceless plosive t (as in stop [ˈstɒp]) and the aspirated form th (as in top [ˈthɒp]) are allophones for the phoneme /t/, while these two are considered to be different phonemes in some languages such as Thai.
English phonology
English phonology is the system of speech sounds used in spoken English. Like many other languages, English has wide variation in pronunciation, both historically and from dialect to dialect. In general, however, the regional dialects of English share a largely similar (but not identical) phonological system. Among other things, most dialects have vowel reduction in unstressed syllables and a complex set of phonological features that distinguish fortis and lenis consonants (stops, affricates, and fricatives).
Speech perception
Speech perception is the process by which the sounds of language are heard, interpreted, and understood. The study of speech perception is closely linked to the fields of phonology and phonetics in linguistics and cognitive psychology and perception in psychology. Research in speech perception seeks to understand how human listeners recognize speech sounds and use this information to understand spoken language.
Dual-route hypothesis to reading aloud
The dual-route theory of reading aloud was first described in the early 1970s. This theory suggests that two separate mental mechanisms, or cognitive routes, are involved in reading aloud, with output of both mechanisms contributing to the pronunciation of a written stimulus. The lexical route is the process whereby skilled readers can recognize known words by sight alone, through a "dictionary" lookup procedure. According to this model, every word a reader has learned is represented in a mental database of words and their pronunciations that resembles a dictionary, or internal lexicon.
Artificial intelligence
Artificial intelligence (AI) is the intelligence of machines or software, as opposed to the intelligence of human beings or animals. AI applications include advanced web search engines (e.g., Google Search), recommendation systems (used by YouTube, Amazon, and Netflix), understanding human speech (such as Siri and Alexa), self-driving cars (e.g., Waymo), generative or creative tools (ChatGPT and AI art), and competing at the highest level in strategic games (such as chess and Go).
English as a second or foreign language
English as a second or foreign language is the use of English by speakers with different native languages. Language education for people learning English may be known as English as a foreign language (EFL), English as a second language (ESL), English for speakers of other languages (ESOL), English as an additional language (EAL), or English as a New Language (ENL). The aspect in which EFL is taught is referred to as teaching English as a foreign language (TEFL), teaching English as a second language (TESL) or teaching English to speakers of other languages (TESOL).
Phonological awareness
Phonological awareness is an individual's awareness of the phonological structure, or sound structure, of words. Phonological awareness is an important and reliable predictor of later reading ability and has, therefore, been the focus of much research. Phonological awareness involves the detection and manipulation of sounds at three levels of sound structure: (1) syllables, (2) onsets and rimes, and (3) phonemes. Awareness of these sounds is demonstrated through a variety of tasks (see below).
Affective computing
Affective computing is the study and development of systems and devices that can recognize, interpret, process, and simulate human affects. It is an interdisciplinary field spanning computer science, psychology, and cognitive science. While some core ideas in the field may be traced as far back as to early philosophical inquiries into emotion, the more modern branch of computer science originated with Rosalind Picard's 1995 paper on affective computing and her book Affective Computing published by MIT Press.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.