Detection and Recognition of Number Sequences in Spoken Utterances

In this paper we investigate the detection and recognition of sequences of numbers in spoken utterances. This is done in two steps: first, the entire utterance is decoded assuming that only numbers were spoken. In the second step, non-number segments (garbage) are detected based on word confidence measures. We compare this approach to conventional garbage models. Also, a comparison of several phone posterior based confidence measures is presented in this paper. The work is evaluated in terms of detection task (hit rate and false alarms) and recognition task (word accuracy) within detected number sequences. The proposed method is tested on German continuous spoken utterances where target content (numbers) is only 20%.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Detection and Recognition of Number Sequences in Spoken Utterances

Graph Chatbot

Chat with Graph Search

Generative power of a protein language model trained on multiple sequence alignments

Automatic Content Curation of Visual Heritage

Poly-NL: Linear Complexity Non-local Layers With 3rd Order Polynomials

Generative power of a protein language model trained on multiple sequence alignments

Automatic Content Curation of Visual Heritage

Poly-NL: Linear Complexity Non-local Layers With 3rd Order Polynomials