The Curious Case of Neural Text Degeneration
University of Washington · University of Cape Town
Abstract
Despite considerable advancements with deep neural language models, the enigma of neural text degeneration persists when these models are tested as text generators. The counter-intuitive empirical observation is that even though the use of likelihood as training objective leads to high quality models for a broad range of language understanding tasks, using likelihood as a decoding objective leads to text that is bland and strangely repetitive. In this paper, we reveal surprising distributional differences between human text and machine text. In addition, we find that decoding strategies alone can dramatically effect the quality of machine text, even when generated from exactly the same neural language model.…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 40
Authors
5Topics & keywords
- Degeneration (medical)
- Neuroscience
- Computer science
- Psychology
- Medicine
- Pathology
- Quality Education