Publication

Towards using slide information to enhance speech transcription of meetings

Related publications (94)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Extracting Information from Multimedia Meeting Collections

Daniel Gatica-Perez, Samy Bengio, Dong Zhang

Multimedia meeting collections, composed of unedited audio and video streams, handwritten notes, slides, and electronic documents that jointly constitute a raw record of complex human interaction processes in the workplace, have attracted interest due to t ...

IDIAP2005

Extracting Information from Multimedia Meeting Collections

Daniel Gatica-Perez, Samy Bengio, Dong Zhang

2005

Noisy Text Categorization

Alessandro Vinciarelli

This work presents categorization experiments performed over noisy texts. By noisy it is meant any text obtained through an extraction process (affected by errors) from media other than digital texts (e.g. transcriptions of speech recordings extracted with ...

2005

Sociometry Based Multiparty Audio Recordings Segmentation

Alessandro Vinciarelli

This paper shows how Social Network Analysis, the sociological domain studying the interaction between people in specific social environments, can be used to assign roles to different speakers in multiparty recordings. The experiments presented in this wor ...

IDIAP2005

On the Use of Information Retrieval Measures for Speech Recognition Evaluation

Hervé Bourlard, Daniel Gatica-Perez, John David Scott Dines, Darren Moore

This paper discusses the evaluation of automatic speech recognition (ASR) systems developed for practical applications, suggesting a set of criteria for application-oriented performance measures. The commonly used word error rate (WER), which poses ASR eva ...

IDIAP2004

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

2004

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

Guillaume Lathoud

Microphone arrays are useful in meeting rooms, where speech needs to be acquired and segmented. For example, automatic speech segmentation allows enhanced browsing experience, and facilitates automatic analysis of large amounts of data. Spontaneous multi-p ...

IDIAP2004

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

Guillaume Lathoud

2004

Noisy Text Clustering

David Grangier, Alessandro Vinciarelli

This work presents document clustering experiments performed over noisy texts (i.e. text that have been extracted through an automatic process like speech or character recognition). The effect of recognition errors on different clustering techniques is mea ...

IDIAP2004

Application of Information Retrieval Techniques to Single Writer Documents

Alessandro Vinciarelli

This work shows Information Retrieval experiments performed over handwritten documents produced by a single writer. The same retrieval task has been performed over both manual (no errors) and automatic (Word Error Rate around 45%) transcriptions of 200 han ...

IDIAP2004