Publication

Exploiting Hyperlinks to Learn a Retrieval Model

Related publications (36)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

From scattered sources to comprehensive technology landscape : A recommendation-based retrieval approach

Karl Aberer, Chi Thang Duong

Mapping the technology landscape is crucial for market actors to take informed investment decisions. However, given the large amount of data on the Web and its subsequent information overload, manually retrieving information is a seemingly ineffective and ...

ELSEVIER2023

Studying Linguistic Changes over 200 Years of Newspapers through Resilient Words Analysis

Frédéric Kaplan, Vincent Christian Buntinx, Cyril Antoine Michel Bornet

This paper presents a methodology to analyze linguistic changes in a given textual corpus allowing to overcome two common problems related to corpus linguistics studies. One of these issues is the monotonic increase of the corpus size with time, and the ot ...

2017

Adaptive relevance feedback for large-scale image retrieval

François Fleuret, Nicolae Suditu

Content-based image retrieval aims at substituting traditional indexing based on manual annotation by using automatically-extracted visual indexing features. Novel techniques are needed however to efficiently deal with the semantic gap (i.e. the partial ma ...

2016

Selection and Aggregation of Ranking Criteria for Retrieval from Scientific Publication Databases

Martin Veselý

Selection and aggregation of ranking criteria became an important topic in information retrieval as search is getting more specialized and as volume of electronically available information grows. In this context, document ranking has undergone a shift from ...

EPFL2012

Extracting Informative Textual Parts from Web Pages Containing User-Generated Content

Nikolaos Pappas

The vast amount of user-generated content on the Web has increased the need for handling the problem of automatically processing content in web pages. The segmentation of web pages and noise (non-informative segment) removal are important pre-processing st ...

ACM2012

PLSI: the true Fisher Kernel and beyond

Jean-Cédric Chappelier, Emmanuel Eckard

The Probalistic Latent Semantic Indexing model, introduced by T. Hofmann (1999), has engendered applications in numerous fields, notably document classification and information retrieval. In this context, the Fisher kernel was found to be an appropriate do ...

2009

Utilisation de PLSI en recherche d'information

Jean-Cédric Chappelier, Emmanuel Eckard

The PLSI model (“Probabilistic Latent Semantic Indexing”) offers a document indexing scheme based on probabilistic latent category models. It entailed applications in diverse ﬁelds, notably in information retrieval (IR). Nevertheless, PLSI cannot process d ...

2009

Machine learning for information retrieval

David Grangier

In this thesis, we explore the use of machine learning techniques for information retrieval. More specifically, we focus on ad-hoc retrieval, which is concerned with searching large corpora to identify the documents relevant to user queries. This identific ...

EPFL2008

Machine Learning for Information Retrieval

David Grangier

École Polytechnique Fédérale de Lausanne2008

Machine Learning for Information Retrieval

David Grangier

IDIAP2008