ERNIE 2.0: A Continual Pre-Training Framework for Language Understanding

Baidu (China)

Indexed incrossref

Abstract

Recently pre-trained models have achieved state-of-the-art results in various language understanding tasks. Current pre-training procedures usually focus on training the model with several simple tasks to grasp the co-occurrence of words or sentences. However, besides co-occurring information, there exists other valuable lexical, syntactic and semantic information in training corpora, such as named entities, semantic closeness and discourse relations. In order to extract the lexical, syntactic and semantic information from training corpora, we propose a continual pre-training framework named ERNIE 2.0 which incrementally builds pre-training tasks and then learn pre-trained models on these constructed tasks via…

Citation impact

736
total citations
FWCI
58.13
Percentile
100%
References
27
Citations per year

Authors

7

Topics & keywords

Keywords
  • Computer science
  • Natural language processing
  • Artificial intelligence
  • Task (project management)
  • GRASP
  • Construct (python library)
  • Focus (optics)
  • Language model
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.