articleNatureSep 25, 2024HYBRID OA

Larger and more instructable language models become less reliable

University of Cambridge · Artificial Intelligence Research Institute · +3 more institutions

PubMed
Indexed incrossrefpubmed

Abstract

Abstract The prevailing methods to make large language models more powerful and amenable have been based on continuous scaling up (that is, increasing their size, data volume and computational resources 1 ) and bespoke shaping up (including post-filtering 2,3 , fine tuning or use of human feedback 4,5 ). However, larger and more instructable large language models may have become less reliable. By studying the relationship between difficulty concordance, task avoidance and prompting stability of several language model families, here we show that easy instances for human participants are also easy for the models, but scaled-up, shaped-up models do not secure areas of low difficulty in which either the model does…

No related works found for this paper.

Funding