Related publications (32)

Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications

Petr Motlicek, Amrutha Prasad

Voice communication is the main channel to exchange information between pilots and Air-Traffic Controllers (ATCos). Recently, several projects have explored the employment of speech recognition technology to automatically extract spoken key information suc ...
MDPI2021

Language Resources for Historical Newspapers: the Impresso Collection

Maud Ehrmann, Matteo Romanello, Raphaël Barman

Following decades of massive digitization, an unprecedented amount of historical document facsimiles can now be retrieved and accessed via cultural heritage online portals. If this represents a huge step forward in terms of preservation and accessibility, ...
European Language Resources Association2020

Idiap Submission to Swiss-German Language Detection Shared Task

Petr Motlicek

Language detection is a key part of the NLP pipeline for text processing. The task of automatically detecting languages belonging to disjoint groups is relatively easy. It is considerably challenging to detect languages that have similar origins or dialect ...
CEUR Workshop Proceedings2020

CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model

Jan Frederik Jonas Florian Mai

Continuous Bag of Words (CBOW) is a powerful text embedding method. Due to its strong capabilities to encode word content, CBOW embeddings perform well on a wide range of downstream tasks while being efficient to compute. However, CBOW is not capable of ca ...
2019

CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model

Jan Frederik Jonas Florian Mai

Continuous Bag of Words (CBOW) is a powerful text embedding method. Due to its strong capabilities to encode word content, CBOW embeddings perform well on a wide range of downstream tasks while being efficient to compute. However, CBOW is not capable of ca ...
Idiap2019

Modular and reconfigurable desktop microfactory for high precision manufacturing

Zhenishbek Zhakypov

Sub-millimeter scale devices are developing rapidly taking smaller, smarter, and more precise forms. This is achieved thanks to advancements in micro-manufacturing tools and techniques. For micro-production, a miniaturization of the machinery is a prominen ...
Springer London Ltd2017

Storage method and apparatus for random access memory using codeword storage

Harm Cronie

A memory circuit, such as an embedded DRAM array, stores information as groups of bits or data using information coding in storage and retrieval data, instead of each bit being stored separately. Write data words can be mapped to storage format words that ...
2016

Kamusi Pre:D – Lexicon-based source-side predisambiguation for MT and other text processing applications

Martin Benjamin

Kamusi has been developing a system to analyze texts on the source side and present users with sense-specified dictionary options. Similarly to spellcheck, the user selects the intended meaning. We then use a multilingual lexical database to bridge to matc ...
ENeL2016

Building Word Embeddings for Solving Natural Language Processing

Rémi Philippe Lebret

Word embedding is a feature learning technique which aims at mapping words from a vocabulary into vectors of real numbers in a low-dimensional space. By leveraging large corpora of unlabeled text, such continuous space representations can be computed for c ...
École Polytechnique Fédérale de Lausanne2016

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.