Mathematical discoveries from program search with large language models

Romera‐Paredes, Bernardino; Barekatain, Mohammadamin; Novikov, Alexander; Balog, Matej; Kumar, Manish; Dupont, Emilien; Ruiz, Francisco J. R.; Ellenberg, Jordan S.; Wang, Pengming; Fawzi, Omar; Kohli, Pushmeet; Fawzi, Alhussein

doi:10.1038/s41586-023-06924-6

articleNatureDec 14, 2023HYBRID OA

Mathematical discoveries from program search with large language models

BRBernardino Romera‐Paredes MBMohammadamin Barekatain ANAlexander Novikov MBMatej Balog MKManish Kumar

Google DeepMind (United Kingdom) · Google (United Kingdom) · +5 more institutions

PubMed

Indexed incrossrefpubmed

Abstract

Abstract Large language models (LLMs) have demonstrated tremendous capabilities in solving complex tasks, from quantitative reasoning to understanding natural language. However, LLMs sometimes suffer from confabulations (or hallucinations), which can result in them making plausible but incorrect statements 1,2 . This hinders the use of current large models in scientific discovery. Here we introduce FunSearch (short for searching in the function space), an evolutionary procedure based on pairing a pretrained LLM with a systematic evaluator. We demonstrate the effectiveness of this approach to surpass the best-known results in important problems, pushing the boundary of existing LLM-based approaches 3 . Applying…

Citation impact

333

total citations

FWCI: 62.85
Percentile: 100%
References: 70

Citations per year

Authors

12

Topics & keywords

Topics

Keywords

Computer science
Programming language
Data science

No related works found for this paper.