Better Word Embeddings by Disentangling Contextual n-Gram Information

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Pre-trained word vectors are ubiquitous in Natural Language Processing applications. In this paper, we show how training word embeddings jointly with bigram and even trigram embeddings, results in improved unigram embeddings. We claim that training word embeddings along with higher n-gram embeddings helps in the removal of the contextual information from the unigrams, resulting in better stand-alone word embeddings. We empirically show the validity of our hypothesis by outperforming other competing word representation models by a significant margin on a wide variety of tasks. We make our models publicly available.

Better Word Embeddings by Disentangling Contextual n-Gram Information

Graph Chatbot

Chat with Graph Search

Infusing structured knowledge priors in neural models for sample-efficient symbolic reasoning

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

The multimodality cell segmentation challenge: toward universal solutions

Infusing structured knowledge priors in neural models for sample-efficient symbolic reasoning

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

The multimodality cell segmentation challenge: toward universal solutions