Unified Language Model Pre-training for Natural Language Understanding\n and Generation

Dong, Li; Yang, Nan; Wang, Wenhui; Wei, Furu; Liu, Xiaodong; Wang, Yu; Gao, Jianfeng; Zhou, Ming; Hon, Hsiao-Wuen

doi:10.48550/arxiv.1905.03197

preprintarXiv (Cornell University)May 8, 2019GREEN OA

Unified Language Model Pre-training for Natural Language Understanding\n and Generation

LDLi Dong NYNan Yang WWWenhui Wang FWFuru Wei XLXiaodong Liu

Indexed inarxiv

Abstract

This paper presents a new Unified pre-trained Language Model (UniLM) that can\nbe fine-tuned for both natural language understanding and generation tasks. The\nmodel is pre-trained using three types of language modeling tasks:\nunidirectional, bidirectional, and sequence-to-sequence prediction. The unified\nmodeling is achieved by employing a shared Transformer network and utilizing\nspecific self-attention masks to control what context the prediction conditions\non. UniLM compares favorably with BERT on the GLUE benchmark, and the SQuAD 2.0\nand CoQA question answering tasks. Moreover, UniLM achieves new\nstate-of-the-art results on five natural language generation datasets,\nincluding improving the…

Citation impact

539

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

9

Topics & keywords

Topics

Keywords

Computer science
Automatic summarization
Natural language generation
Question answering
Language model
Natural language processing
Artificial intelligence
Transformer

UN Sustainable Development Goals

Quality Education

No related works found for this paper.