preprintarXiv (Cornell University)May 8, 2019GREEN OA

Unified Language Model Pre-training for Natural Language Understanding\n and Generation

Indexed inarxiv

Abstract

This paper presents a new Unified pre-trained Language Model (UniLM) that can\nbe fine-tuned for both natural language understanding and generation tasks. The\nmodel is pre-trained using three types of language modeling tasks:\nunidirectional, bidirectional, and sequence-to-sequence prediction. The unified\nmodeling is achieved by employing a shared Transformer network and utilizing\nspecific self-attention masks to control what context the prediction conditions\non. UniLM compares favorably with BERT on the GLUE benchmark, and the SQuAD 2.0\nand CoQA question answering tasks. Moreover, UniLM achieves new\nstate-of-the-art results on five natural language generation datasets,\nincluding improving the…

Citation impact

539
total citations
FWCI
Percentile
References
0
Citations per year

Authors

9

Topics & keywords

Keywords
  • Computer science
  • Automatic summarization
  • Natural language generation
  • Question answering
  • Language model
  • Natural language processing
  • Artificial intelligence
  • Transformer
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.