Representation Learning for Tabular Data: A Comprehensive Survey

Nanjing University

PubMed
Indexed incrossrefpubmed

Abstract

Tabular data, structured as rows and columns, is among the most prevalent data types in machine learning classification and regression applications. Models for learning from tabular data have continuously evolved, with Deep Neural Networks (DNNs) recently demonstrating promising results through their capability of representation learning. In this survey, we systematically introduce the field of tabular representation learning, covering the background, challenges, and benchmarks, along with the pros and cons of using DNNs. We organize existing methods into three main categories according to their generalization capabilities: specialized, transferable, and general models. Specialized models focus on tasks where…

Citation impact

5
total citations
FWCI
65.34
Percentile
100%
References
106
Citations per year

Authors

5

Topics & keywords

Keywords
  • Generalization
  • Field (mathematics)
  • Representation (politics)
  • Artificial neural network
  • Deep learning
  • Feature learning
  • External Data Representation
  • Key (lock)
No related works found for this paper.

Funding