Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey

Min, Bonan; Ross, Hayley; Sulem, Elior; Veyseh, Amir Pouran Ben; Nguyen, Thien Huu; Sainz, Oscar; Agirre, Eneko; Heintz, Ilana; Roth, Dan

doi:10.1145/3605943

reviewACM Computing SurveysJun 27, 2023Closed access

Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey

BMBonan Min HRHayley Ross ESElior Sulem APAmir Pouran Ben Veyseh THThien Huu Nguyen

Amazon (United States) · Harvard University Press · +4 more institutions

Indexed incrossref

Abstract

Large, pre-trained language models (PLMs) such as BERT and GPT have drastically changed the Natural Language Processing (NLP) field. For numerous NLP tasks, approaches leveraging PLMs have achieved state-of-the-art performance. The key idea is to learn a generic, latent representation of language from a generic task once, then share it across disparate NLP tasks. Language modeling serves as the generic task, one with abundant self-supervised text available for extensive training. This article presents the key fundamental concepts of PLM architectures and a comprehensive view of the shift to PLM-driven NLP techniques. It surveys work applying the pre-training then fine-tuning, prompting, and text generation…

Citation impact

1,139

total citations

FWCI: 185.61
Percentile: 100%
References: 169

Citations per year

Authors

9

Topics & keywords

Topics

Keywords

Computer science
Task (project management)
Artificial intelligence
Natural language processing
Language model
Key (lock)
Representation (politics)
Language understanding

UN Sustainable Development Goals

Quality Education

No related works found for this paper.