Publication

Text detection and recognition in images and video sequences

Related publications (144)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Image-Based Mobile Service: Automatic Text Extraction and Translation

Jérôme Berclaz

We present a new mobile service for the translation of text from images taken by consumer-grade cell-phone cameras. Such capability represents a new paradigm for users where a simple image provides the basis for a service. The ubiquity and ease of use of c ...

Society of Photo-Optical Instrumentation Engineers2010

A Large Margin Algorithm for Forced Alignment

We describe and analyze a discriminative algorithm for learning to align a phoneme sequence of a speech utterance with its acoustical signal counterpart by predicting a timing sequence representing the phoneme start times. In contrast to common HMM-based a ...

John Wiley and Sons2009

Contextual classification of image patches with latent aspect models

Daniel Gatica-Perez, Jean-Marc Odobez, Florent Monay Michaud, Pedro Manuel Da Silva Quelhas

We present a novel approach for contextual classification of image patches in complex visual scenes, based on the use of histograms of quantized features and probabilistic aspect models. Our approach uses context in two ways: (1) by using the fact that spe ...

2009

An Ad Hoc Information Retrieval Perspective on PLSI through language model identification

Jean-Cédric Chappelier, Emmanuel Eckard

Ten years ago, PLSI opened the road to probabilistic latent semantic representations of documents. It led to a number of applications in different ﬁelds, including ad hoc Information Retrieval. However, inherent limitations hinder its use on documents not ...

Springer-Verlag New York, Ms Ingrid Cunningham, 175 Fifth Ave, New York, Ny 10010 Usa2009

Who is the expert? analyzing gaze data to predict expertise level in collaborative applications

Pierre Dillenbourg, Mirweis Sangin, Marc-Antoine Nüssli, Yuan Liu, Yan Liu

In this paper, we analyze complex gaze tracking data in a collaborative task and apply machine learning models to automatically predict skill-level differences between participants. Specifically, we present findings that address the two primary challenges ...

IEEE Press2009

Machine learning for information retrieval

David Grangier

In this thesis, we explore the use of machine learning techniques for information retrieval. More specifically, we focus on ad-hoc retrieval, which is concerned with searching large corpora to identify the documents relevant to user queries. This identific ...

EPFL2008

Machine Learning for Information Retrieval

David Grangier

École Polytechnique Fédérale de Lausanne2008

Machine Learning for Information Retrieval

David Grangier

IDIAP2008

A collaborative approach to image segmentation and behavior recognition from image sequences

Laura Ioana Gui

Visual behavior recognition is currently a highly active research area. This is due both to the scientific challenge posed by the complexity of the task, and to the growing interest in its applications, such as automated visual surveillance, human-computer ...

EPFL2008

Machine learning approaches to text representation using unlabeled data

Mikaela Keller

With the rapid expansion in the use of computers for producing digitalized textual documents, the need of automatic systems for organizing and retrieving the information contained in large databases has become essential. In general, information retrieval s ...

EPFL2008