Larger and more instructable language models become less reliable
University of Cambridge · Artificial Intelligence Research Institute · +3 more institutions
Abstract
Abstract The prevailing methods to make large language models more powerful and amenable have been based on continuous scaling up (that is, increasing their size, data volume and computational resources 1 ) and bespoke shaping up (including post-filtering 2,3 , fine tuning or use of human feedback 4,5 ). However, larger and more instructable large language models may have become less reliable. By studying the relationship between difficulty concordance, task avoidance and prompting stability of several language model families, here we show that easy instances for human participants are also easy for the models, but scaled-up, shaped-up models do not secure areas of low difficulty in which either the model does…
Citation impact
- FWCI
- 47.01
- Percentile
- 100%
- References
- 51
Authors
6- LZLexin ZhouCorresponding
University of Cambridge, Artificial Intelligence Research Institute, Universitat Politècnica de València
- WSWout Schellaert
University of Cambridge, Leverhulme Trust, Artificial Intelligence Research Institute, Universitat Politècnica de València
- FMFernando Martínez‐Plumed
Artificial Intelligence Research Institute, Universitat Politècnica de València
- YMYael Moros-Daval
Artificial Intelligence Research Institute, Universitat Politècnica de València
- CFCèsar Ferri
Artificial Intelligence Research Institute, Universitat Politècnica de València
Topics & keywords
- Bespoke
- Computer science
- Task (project management)
- Language model
- Stability (learning theory)
- Scaling
- Cognitive psychology
- Data science
- Quality Education
Funding
- OPOpen Philanthropy Project
- GVGeneralitat ValencianaAward: CIPROM/2022/6
- DADefense Advanced Research Projects Agency
- H2Horizon 2020 Framework ProgrammeAwards: EC H2020, 952215
- EREuropean Regional Development FundAwards: MCIN/AEI/10, H2020, PID2021-122830OB-C42, MICIU/AEI/10
- AEAgencia Estatal de InvestigaciónAwards: 13039, PID2021-122830OB-C42, 10.13039, H2020, AEI/10, AEI/10.