Unified Language Model Pre-training for Natural Language Understanding\n and Generation
Indexed inarxiv
Abstract
This paper presents a new Unified pre-trained Language Model (UniLM) that can\nbe fine-tuned for both natural language understanding and generation tasks. The\nmodel is pre-trained using three types of language modeling tasks:\nunidirectional, bidirectional, and sequence-to-sequence prediction. The unified\nmodeling is achieved by employing a shared Transformer network and utilizing\nspecific self-attention masks to control what context the prediction conditions\non. UniLM compares favorably with BERT on the GLUE benchmark, and the SQuAD 2.0\nand CoQA question answering tasks. Moreover, UniLM achieves new\nstate-of-the-art results on five natural language generation datasets,\nincluding improving the…
Citation impact
539
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
9Topics & keywords
Topics
Keywords
- Computer science
- Automatic summarization
- Natural language generation
- Question answering
- Language model
- Natural language processing
- Artificial intelligence
- Transformer
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.