articleNatureJan 8, 2025HYBRID OA

Accurate predictions on small data with a tabular foundation model

University of Freiburg · The Priory Hospital · +4 more institutions

PubMed
Indexed incrossrefpubmed

Abstract

Abstract Tabular data, spreadsheets organized in rows and columns, are ubiquitous across scientific fields, from biomedicine to particle physics to economics and climate science 1,2 . The fundamental prediction task of filling in missing values of a label column based on the rest of the columns is essential for various applications as diverse as biomedical risk models, drug discovery and materials science. Although deep learning has revolutionized learning from raw data and led to numerous high-profile success stories 3–5 , gradient-boosted decision trees 6–9 have dominated tabular data for the past 20 years. Here we present the Tabular Prior-data Fitted Network (TabPFN), a tabular foundation model that…

Citation impact

564
total citations
FWCI
1056.27
Percentile
100%
References
43
Citations per year

Authors

8

Topics & keywords

Keywords
  • Computer science
  • Raw data
  • Artificial intelligence
  • Deep learning
  • Machine learning
  • Ensemble learning
  • Margin (machine learning)
  • Data mining
UN Sustainable Development Goals
  • Climate action
No related works found for this paper.

Funding