Shared computational principles for language processing in humans and deep language models
Google (United States) · Princeton University · +5 more institutions
Abstract
Departing from traditional linguistic models, advances in deep learning have resulted in a new type of predictive (autoregressive) deep language models (DLMs). Using a self-supervised next-word prediction task, these models generate appropriate linguistic responses in a given context. In the current study, nine participants listened to a 30-min podcast while their brain responses were recorded using electrocorticography (ECoG). We provide empirical evidence that the human brain and autoregressive DLMs share three fundamental computational principles as they process the same natural narrative: (1) both are engaged in continuous next-word prediction before word onset; (2) both match their pre-onset predictions…
Citation impact
- FWCI
- 47.09
- Percentile
- 100%
- References
- 66
Authors
32Topics & keywords
- Surprise
- Computer science
- Autoregressive model
- Artificial intelligence
- Context (archaeology)
- Computational model
- Natural language processing
- Word (group theory)
- Quality Education