Multilingual Denoising Pre-training for Neural Machine Translation

Liu, Yinhan; Gu, Jiatao; Goyal, Naman; Li, Xian; Edunov, Sergey; Ghazvininejad, Marjan; Lewis, Mike; Zettlemoyer, Luke

doi:10.1162/tacl_a_00343

articleTransactions of the Association for Computational LinguisticsNov 25, 2020DIAMOND OA

Multilingual Denoising Pre-training for Neural Machine Translation

YLYinhan Liu JGJiatao Gu NGNaman Goyal XLXian Li SESergey Edunov

Bircham International University · Bansal Institute Of Research Technology & Science · +2 more institutions

Indexed incrossrefdoaj

Abstract

This paper demonstrates that multilingual denoising pre-training produces significant performance gains across a wide variety of machine translation (MT) tasks. We present mBART—a sequence-to-sequence denoising auto-encoder pre-trained on large-scale monolingual corpora in many languages using the BART objective (Lewis et al., 2019 ). mBART is the first method for pre-training a complete sequence-to-sequence model by denoising full texts in multiple languages, whereas previous approaches have focused only on the encoder, decoder, or reconstructing parts of the text. Pre-training a complete model allows it to be directly fine-tuned for supervised (both sentence-level and document-level) and unsupervised machine…

Citation impact

1,013

total citations

FWCI: 77.80
Percentile: 100%
References: 57

Citations per year

Authors

8

Topics & keywords

Topics

Keywords

Computer science
Machine translation
Initialization
Artificial intelligence
Encoder
Sentence
Natural language processing
BLEU

UN Sustainable Development Goals

Quality Education

No related works found for this paper.