Cross-lingual Language Model Pretraining

Lample, Guillaume; Conneau, Alexis

doi:10.48550/arxiv.1901.07291

preprintarXiv (Cornell University)Jan 22, 2019GREEN OA

Cross-lingual Language Model Pretraining

GLGuillaume Lample ACAlexis Conneau

Indexed inarxivdatacite

Abstract

Recent studies have demonstrated the efficiency of generative pretraining for English natural language understanding. In this work, we extend this approach to multiple languages and show the effectiveness of cross-lingual pretraining. We propose two methods to learn cross-lingual language models (XLMs): one unsupervised that only relies on monolingual data, and one supervised that leverages parallel data with a new cross-lingual language model objective. We obtain state-of-the-art results on cross-lingual classification, unsupervised and supervised machine translation. On XNLI, our approach pushes the state of the art by an absolute gain of 4.9% accuracy. On unsupervised machine translation, we obtain 34.3…

Citation impact

1,620

total citations

FWCI: —
Percentile: —
References: 35

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

BLEU
Computer science
Machine translation
Artificial intelligence
Natural language processing
Translation (biology)
Generative grammar
Code (set theory)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.