Publication

Building Word Embeddings for Solving Natural Language Processing

Related publications (151)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Joint Image and Word Sense Discrimination For Image Retrieval

Aurélien Lucchi

We study the task of learning to rank images given a text query, a problem that is complicated by the issue of multiple senses. That is, the senses of interest are typically the visually distinct concepts that a user wishes to retrieve. In this paper, we p ...

2012

Natural Language Processing (Almost) from Scratch

Ronan Collobert, Michael Karlen

We propose a unified neural network architecture and learning algorithm that can be applied to various natural language processing tasks including part-of-speech tagging, chunking, named entity recognition, and semantic role labeling. This versatility is a ...

2011

A novel multi-aspect consistency measurement for ontologies

Zoltán Miklós, Zhao Lu

Web developers have started to integrate semantic information to their systems increasingly often. The semantic metadata embedded with the resources is typically linked to ontologies or taxonomies. Meta information can bring a number of advantages for user ...

2011

Fisher Kernels and Probabilistic Latent Semantic Models

Emmanuel Eckard

Tasks that rely on semantic content of documents, notably Information Retrieval and Document Classification, can benefit from a good account of document context, i.e. the semantic association between documents. To this effect, the scheme of latent semantic ...

EPFL2010

Semantic Vector Machines

Vincent Etter

We first present our work in machine translation, during which we used aligned sentences to train a neural network to embed n-grams of different languages into an d-dimensional space, such that n-grams that are the translation of each other are close with ...

2009

An Ad Hoc Information Retrieval Perspective on PLSI through language model identification

Jean-Cédric Chappelier, Emmanuel Eckard

Ten years ago, PLSI opened the road to probabilistic latent semantic representations of documents. It led to a number of applications in different ﬁelds, including ad hoc Information Retrieval. However, inherent limitations hinder its use on documents not ...

Springer-Verlag New York, Ms Ingrid Cunningham, 175 Fifth Ave, New York, Ny 10010 Usa2009

PLSI: The True Fisher Kernel and beyond IID Processes, Information Matrix and Model Identification in PLSI

Jean-Cédric Chappelier, Emmanuel Eckard

The Probabilistic Latent Semantic indexing model, introduced by T. Hofmann (1999), has engendered applications ill numerous fields, notably document classification and information retrieval. In this context, the Fisher kernel was found to be an appropriate ...

Springer-Verlag New York, Ms Ingrid Cunningham, 175 Fifth Ave, New York, Ny 10010 Usa2009

Rôle de la matrice d'information et pondération des composantes dans les noyaux de Fisher pour PLSI

Jean-Cédric Chappelier, Emmanuel Eckard

ABSTRACT. An information-geometric approach for document similarities in the framework of “Probabilistic Latent Semantic Indexing” was ﬁrst proposed by T. Hofmann (2000) and later extended (“revisited”) by Nyffenegger et al. (2006). This paper presents an ...

2009

idMesh: graph-based disambiguation of linked data

Karl Aberer, Philippe Cudré-Mauroux, Parisa Haghani, Michael Jost

We tackle the problem of disambiguating entities on the Web. We propose a user-driven scheme where graphs of entities -- represented by globally identifiable declarative artifacts -- self-organize in a dynamic and probabilistic manner. Our solution has the ...

ACM2009

Machine learning for information retrieval

David Grangier

In this thesis, we explore the use of machine learning techniques for information retrieval. More specifically, we focus on ad-hoc retrieval, which is concerned with searching large corpora to identify the documents relevant to user queries. This identific ...

EPFL2008