preprintJan 1, 2019GOLD OA

How Multilingual is Multilingual BERT?

Google (United States)

Indexed incrossref

Abstract

In this paper, we show that Multilingual BERT (M-BERT), released by To understand why, we present a large number of probing experiments, showing that transfer is possible even to languages in different scripts, that transfer works best between typologically similar languages, that monolingual corpora can train models for code-switching, and that the model can find translation pairs. From these results, we can conclude that M-BERT does create multilingual representations, but that these representations exhibit systematic deficiencies affecting certain language pairs.

Citation impact

1,143
total citations
FWCI
85.60
Percentile
100%
References
19
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Scripting language
  • Natural language processing
  • Artificial intelligence
  • Task (project management)
  • Language model
  • Transfer (computing)
  • Zero (linguistics)
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.