AI models collapse when trained on recursively generated data
University of Oxford · University of Cambridge · +5 more institutions
Abstract
) demonstrated high performance across a variety of language tasks. ChatGPT introduced such language models to the public. It is now clear that generative artificial intelligence (AI) such as large language models (LLMs) is here to stay and will substantially change the ecosystem of online text and images. Here we consider what may happen to GPT-{n} once LLMs contribute much of the text found online. We find that indiscriminate use of model-generated content in training causes irreversible defects in the resulting models, in which tails of the original content distribution disappear. We refer to this effect as 'model collapse' and show that it can occur in LLMs as well as in variational autoencoders (VAEs) and…
Citation impact
- FWCI
- 171.42
- Percentile
- 100%
- References
- 9
Authors
6Topics & keywords
- Generative grammar
- Generative model
- Intuition
- Computer science
- Variety (cybernetics)
- The Internet
- Artificial intelligence
- Data science
- Reduced inequalities