Publication

Probabilistic Lexical Modeling and Unsupervised Training for Zero-Resourced ASR

Related concepts (38)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Sentence processing

Sentence processing takes place whenever a reader or listener processes a language utterance, either in isolation or in the context of a conversation or a text. Many studies of the human language comprehension process have focused on reading of single utterances (sentences) without context. Extensive research has shown that language comprehension is affected by context preceding a given utterance as well as many other factors. Sentence comprehension has to deal with ambiguity in spoken and written utterances, for example lexical, structural, and semantic ambiguities.

Anomaly detection

In data analysis, anomaly detection (also referred to as outlier detection and sometimes as novelty detection) is generally understood to be the identification of rare items, events or observations which deviate significantly from the majority of the data and do not conform to a well defined notion of normal behaviour. Such examples may arouse suspicions of being generated by a different mechanism, or appear inconsistent with the remainder of that set of data.

Japanese dictionary

have a history that began over 1300 years ago when Japanese Buddhist priests, who wanted to understand Chinese sutras, adapted Chinese character dictionaries. Present-day Japanese lexicographers are exploring computerized editing and electronic dictionaries. According to Nakao Keisuke (中尾啓介): It has often been said that dictionary publishing in Japan is active and prosperous, that Japanese people are well provided for with reference tools, and that lexicography here, in practice as well as in research, has produced a number of valuable reference books together with voluminous academic studies.

Speaker recognition

Speaker recognition is the identification of a person from characteristics of voices. It is used to answer the question "Who is speaking?" The term voice recognition can refer to speaker recognition or speech recognition. Speaker verification (also called speaker authentication) contrasts with identification, and speaker recognition differs from speaker diarisation (recognizing when the same speaker is speaking).

Machine learning

Machine learning (ML) is an umbrella term for solving problems for which development of algorithms by human programmers would be cost-prohibitive, and instead the problems are solved by helping machines 'discover' their 'own' algorithms, without needing to be explicitly told what to do by any human-developed algorithms. Recently, generative artificial neural networks have been able to surpass results of many previous approaches.

Word embedding

In natural language processing (NLP), a word embedding is a representation of a word. The embedding is used in text analysis. Typically, the representation is a real-valued vector that encodes the meaning of the word in such a way that words that are closer in the vector space are expected to be similar in meaning. Word embeddings can be obtained using language modeling and feature learning techniques, where words or phrases from the vocabulary are mapped to vectors of real numbers.

Rime dictionary

A rime dictionary, rhyme dictionary, or rime book () is an ancient type of Chinese dictionary that collates characters by tone and rhyme, instead of by radical. The most important rime dictionary tradition began with the Qieyun (601), which codified correct pronunciations for reading the classics and writing poetry by combining the reading traditions of north and south China. This work became very popular during the Tang dynasty, and went through a series of revisions and expansions, of which the most famous is the Guangyun (1007–1008).

Modern Greek

Modern Greek (Νέα Ελληνικά, Néa Elliniká, ˈne.a eliniˈka or Κοινή Νεοελληνική Γλώσσα, Kiní Neoellinikí Glóssa), generally referred to by speakers simply as Greek (Ελληνικά, Elliniká), refers collectively to the dialects of the Greek language spoken in the modern era, including the official standardized form of the languages sometimes referred to as Standard Modern Greek.

Optical character recognition

Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of s of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene photo (for example the text on signs and billboards in a landscape photo) or from subtitle text superimposed on an image (for example: from a television broadcast).

Performance

A performance is an act of staging or presenting a play, concert, or other form of entertainment. It is also defined as the action or process of carrying out or accomplishing an action, task, or function. In the work place, job performance is the hypothesized conception or requirements of a role. There are two types of job performances: contextual and task. Task performance is dependent on cognitive ability, while contextual performance is dependent on personality.