Lost in the Middle: How Language Models Use Long Contexts

Liu, Nelson F.; Lin, Kevin; Hewitt, John; Paranjape, Ashwin; Bevilacqua, Michele; Petroni, Fabio; Liang, Percy

doi:10.1162/tacl_a_00638

articleTransactions of the Association for Computational LinguisticsJan 1, 2024DIAMOND OA

Lost in the Middle: How Language Models Use Long Contexts

NFNelson F. Liu KLKevin Lin JHJohn Hewitt APAshwin Paranjape MBMichele Bevilacqua

Stanford University · University of California, Berkeley

Indexed incrossrefdoaj

Abstract

Abstract While recent language models have the ability to take long contexts as input, relatively little is known about how well they use longer context. We analyze the performance of language models on two tasks that require identifying relevant information in their input contexts: multi-document question answering and key-value retrieval. We find that performance can degrade significantly when changing the position of relevant information, indicating that current language models do not robustly make use of information in long input contexts. In particular, we observe that performance is often highest when relevant information occurs at the beginning or end of the input context, and significantly degrades…

Citation impact

857

total citations

FWCI: 261.68
Percentile: 100%
References: 55

Citations per year

Authors

7

Topics & keywords

Topics

Keywords

Computer science
Language model
Natural language processing
Artificial intelligence
Linguistics
Data science

UN Sustainable Development Goals

Quality Education

No related works found for this paper.