Emergent Abilities of Large Language Models

Jason, Wei,; Tay, Yi; Bommasani, Rishi; Raffel, Colin; Zoph, Barret; Borgeaud, Sebastian; Yogatama, Dani; Bosma, Maarten; Zhou, Denny; Metzler, Donald; H., Ed; Hashimoto, Tatsunori; Vinyals, Oriol; Liang, Percy; Dean, Jeff; Fedus, William

doi:10.48550/arxiv.2206.07682

preprintarXiv (Cornell University)Jun 15, 2022GREEN OA

Emergent Abilities of Large Language Models

WJWei, JasonYTYi Tay RBRishi Bommasani CRColin Raffel BZBarret Zoph

Indexed inarxivdatacite

Abstract

Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that we refer to as emergent abilities of large language models. We consider an ability to be emergent if it is not present in smaller models but is present in larger models. Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models. The existence of such emergence implies that additional scaling could further expand the range of capabilities of language models.

Citation impact

1,029

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

16

Topics & keywords

Topics

Keywords

Scaling
Computer science
Language model
Range (aeronautics)
Phenomenon
Sample (material)
Cognitive psychology
Artificial intelligence

UN Sustainable Development Goals

Quality Education

No related works found for this paper.