preprintAnnual Review of EconomicsApr 6, 2026GREEN OA

Large Language Models: An Applied Econometric Framework

National Bureau of Economic Research · Chicago Department of Public Health · +1 more institution

Indexed inarxivcrossrefdatacite

Abstract

Large language models (LLMs) enable researchers to analyze text at unprecedented scale and minimal cost. Researchers can now revisit old questions and tackle novel ones with rich data. We provide an econometric framework for realizing this potential in two empirical uses. For prediction problems—forecasting outcomes from text—valid conclusions require “no training leakage” between the LLM's training data and the researcher's sample, which can be enforced through careful model choice and research design. For estimation problems—automating the measurement of economic concepts for downstream analysis—valid downstream inference requires combining LLM outputs with a small validation sample to deliver consistent and…

Citation impact

4
total citations
FWCI
0.00
Percentile
98%
References
0
Too recent for citation history.

Authors

3

Topics & keywords

Keywords
  • Econometric model
  • Econometrics
  • Computer science
  • Economics
No related works found for this paper.

Funding