On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation

Robert West, Maxime Jean Julien Peyrard, Yang Gao, Wei Zhao
2020
conference papers

Abstract

Evaluation of cross-lingual encoders is usually performed either via zero-shot cross-lingual transfer in supervised downstream tasks or via unsupervised cross-lingual textual similarity. In this paper, we concern ourselves with reference-free machine translation (MT) evaluation where we directly compare source texts to (sometimes low-quality) system translations, which represents a natural adversarial setup for multilingual encoders. Reference-free evaluation holds the promise of web-scale comparison of MT systems. We systematically investigate a range of metrics based on state-of-the-art cross-lingual semantic representations obtained with pretrained M-BERT and LASER. We find that they perform poorly as semantic encoders for reference-free MT evaluation and identify their two key limitations, namely, (a) a semantic mismatch between representations of mutual translations and, more prominently, (b) the inability to punish "translationese", i.e., low-quality literal translations. We propose two partial remedies: (1) post-hoc re-alignment of the vector spaces and (2) coupling of semantic-similarity based metrics with target-side language modeling. In segment-level MT evaluation, our best metric surpasses reference-based BLEU by 5.7 correlation points. We make our MT evaluation code available.

Official source

https://infoscience.epfl.ch/entities/publication/d87ab34c-198c-4bb3-88fc-b64044bef8a4

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation

Graph Chatbot

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Dense Image-based Predictions for Comics Analysis

Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Dense Image-based Predictions for Comics Analysis

Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages