Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
Text characters embedded in images and video sequences represents a rich source of information for content-based indexing and retrieval applications. However, these text characters are difficult to be detected and recognized due to their various sizes, gra ...
Effectively managing a large collection of multimedia documents is a challenge, addressed by many disciplines from signal processing through database systems to artificial intelligence and interaction design. The problems to be solved have rarely been cons ...
Text characters embedded in images and video sequences represents a rich source of information for content-based indexing and retrieval applications. However, these text characters are difficult to be detected and recognized due to their various sizes, gra ...
Text characters embedded in images and video sequences represents a rich source of information for content-based indexing and retrieval applications. However, these text characters are difficult to be detected and recognized due to their various sizes, gra ...
The representation of video information in terms of its content is atthe foundation of many multimedia applications, such as broadcasting,content-based information retrieval, interactive video, remotesurveillance and entertainment. In particular, object-ba ...
Text embedded in images and videos represents a rich source of information for content-based indexing and retrieval applications. In this paper, we present a new method for localizing and recognizing text in complex images and videos. Text localization is ...
The following topics are dealt with: hyperdatabases; federated information systems; relevance feedback in CBIR; Oracle Chart Builder and MapViewer; SOM-based k-nearest neighbors search; partial image retrieval; high-dimensional image indexing; design and v ...
This paper presents a video OCR system that automatically extracts closed captions from video frames as keywords (or as we called "cues") for building annotations of sport videos. In this system, text regions that contain closed captions are first identifi ...
In this article we present a novel approach of integrating textual and visual descriptors of images in a unified retrieval structure. The methodology, inspired from text retrieval and information filtering is based on Latent Semantic Indexing (LS1). ...
Spoken Document Retrieval (SDR) consists in retrieving segments of a speech database that are relevant to a query. The state-of-the-art approach to the SDR problem consists in transcribing the speech data into digital text before applying common Informatio ...