The Curious Case of Neural Text Degeneration

Holtzman, Ari; Buys, Jan; Du, Li; Forbes, Maxwell; Choi, Yejin

doi:10.48550/arxiv.1904.09751

preprintarXiv (Cornell University)Apr 22, 2019GREEN OA

The Curious Case of Neural Text Degeneration

AHAri Holtzman JBJan Buys LDLi Du MFMaxwell Forbes YCYejin Choi

University of Washington · University of Cape Town

Indexed inarxivdatacite

Abstract

Despite considerable advancements with deep neural language models, the enigma of neural text degeneration persists when these models are tested as text generators. The counter-intuitive empirical observation is that even though the use of likelihood as training objective leads to high quality models for a broad range of language understanding tasks, using likelihood as a decoding objective leads to text that is bland and strangely repetitive. In this paper, we reveal surprising distributional differences between human text and machine text. In addition, we find that decoding strategies alone can dramatically effect the quality of machine text, even when generated from exactly the same neural language model.…

Citation impact

1,110

total citations

FWCI: —
Percentile: —
References: 40

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Degeneration (medical)
Neuroscience
Computer science
Psychology
Medicine
Pathology

UN Sustainable Development Goals

Quality Education

No related works found for this paper.