Mathematical discoveries from program search with large language models
Google DeepMind (United Kingdom) · Google (United Kingdom) · +5 more institutions
Abstract
Abstract Large language models (LLMs) have demonstrated tremendous capabilities in solving complex tasks, from quantitative reasoning to understanding natural language. However, LLMs sometimes suffer from confabulations (or hallucinations), which can result in them making plausible but incorrect statements 1,2 . This hinders the use of current large models in scientific discovery. Here we introduce FunSearch (short for searching in the function space), an evolutionary procedure based on pairing a pretrained LLM with a systematic evaluator. We demonstrate the effectiveness of this approach to surpass the best-known results in important problems, pushing the boundary of existing LLM-based approaches 3 . Applying…
Citation impact
- FWCI
- 62.85
- Percentile
- 100%
- References
- 70
Authors
12- BRBernardino Romera‐ParedesCorresponding
Google DeepMind (United Kingdom), Google (United Kingdom)
- MBMohammadamin Barekatain
Google DeepMind (United Kingdom), Google (United Kingdom)
- ANAlexander Novikov
Google DeepMind (United Kingdom), Google (United Kingdom)
- MBMatej Balog
Google DeepMind (United Kingdom), Google (United Kingdom)
- MKManish Kumar
Google DeepMind (United Kingdom), Google (United Kingdom)
Topics & keywords
- Computer science
- Programming language
- Data science