Shared computational principles for language processing in humans and deep language models

Goldstein, Ariel; Zada, Zaid; Buchnik, Eliav; Schain, Mariano; Price, Amy; Aubrey, Bobbi; Nastase, Samuel A.; Feder, Amir; Emanuel, Dotan; Cohen, Alon; Jansen, Aren; Gazula, Harshvardhan; Choe, Gina; Rao, Aditi; Kim, Catherine; Casto, Colton; Fanda, Lora; Doyle, Werner; Friedman, Daniel; Dugan, Patricia; Melloni, Lucía; Reichart, Roi; Devore, Sasha; Flinker, Adeen; Hasenfratz, Liat; Levy, Omer; Hassidim, Avinatan; Brenner, Michael P.; Matias, Yossi; Norman, Kenneth A.; Devinsky, Orrin; Hasson, Uri

doi:10.1038/s41593-022-01026-4

articleNature NeuroscienceMar 1, 2022HYBRID OA

Shared computational principles for language processing in humans and deep language models

AGAriel Goldstein ZZZaid Zada EBEliav Buchnik MSMariano Schain APAmy Price

Google (United States) · Princeton University · +5 more institutions

PubMed

Indexed incrossrefpubmed

Abstract

Departing from traditional linguistic models, advances in deep learning have resulted in a new type of predictive (autoregressive) deep language models (DLMs). Using a self-supervised next-word prediction task, these models generate appropriate linguistic responses in a given context. In the current study, nine participants listened to a 30-min podcast while their brain responses were recorded using electrocorticography (ECoG). We provide empirical evidence that the human brain and autoregressive DLMs share three fundamental computational principles as they process the same natural narrative: (1) both are engaged in continuous next-word prediction before word onset; (2) both match their pre-onset predictions…

Citation impact

443

total citations

FWCI: 47.09
Percentile: 100%
References: 66

Citations per year

Authors

32

Topics & keywords

Topics

Keywords

Surprise
Computer science
Autoregressive model
Artificial intelligence
Context (archaeology)
Computational model
Natural language processing
Word (group theory)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.