preprintJan 1, 2019GOLD OA
How Multilingual is Multilingual BERT?
Indexed incrossref
Abstract
In this paper, we show that Multilingual BERT (M-BERT), released by To understand why, we present a large number of probing experiments, showing that transfer is possible even to languages in different scripts, that transfer works best between typologically similar languages, that monolingual corpora can train models for code-switching, and that the model can find translation pairs. From these results, we can conclude that M-BERT does create multilingual representations, but that these representations exhibit systematic deficiencies affecting certain language pairs.
Citation impact
1,143
total citations
- FWCI
- 85.60
- Percentile
- 100%
- References
- 19
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Computer science
- Scripting language
- Natural language processing
- Artificial intelligence
- Task (project management)
- Language model
- Transfer (computing)
- Zero (linguistics)
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.