Learning Word Vectors for 157 Languages

Distributed word representations, or word vectors, have recently been applied to many tasks in natural language processing, leading to state-of-the-art performance. A key ingredient to the successful application of these representations is to train them on very large corpora, and use these pre-trained models in downstream tasks. In this paper, we describe how we trained such high quality word representations for 157 languages. We used two sources of data to train these models: the free online encyclopedia Wikipedia and data from the common crawl project. We also introduce three new word analogy datasets to evaluate these word vectors, for French, Hindi and Polish. Finally, we evaluate our pre-trained word vectors on 10 languages for which evaluation datasets exists, showing very strong performance compared to previous models.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Learning Word Vectors for 157 Languages

Graph Chatbot

Chat with Graph Search

Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning

Modeling Structured Data in Attention-based Models

Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models

Modeling Structured Data in Attention-based Models

Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning

Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models