preprintACM Computing SurveysApr 23, 2026HYBRID OA

Tabular Data Augmentation for Machine Learning: Progress and Prospects of Embracing Generative AI

Zhejiang University

Indexed inarxivcrossrefdatacite

Abstract

Machine learning (ML) on tabular data is ubiquitous, yet obtaining abundant high-quality tabular data for model training remains a significant obstacle. Numerous works have focused on tabular data augmentation (TDA) to enhance the original table with additional data, thereby improving downstream ML tasks. Recently, there has been a growing interest in leveraging the capabilities of generative AI for TDA. Therefore, we believe it is time to provide a comprehensive review of the progress and future prospects of TDA, with a particular emphasis on the trending generative AI. Specifically, we present an architectural view of the TDA pipeline, comprising three main procedures: pre-augmentation, augmentation, and…

Citation impact

10
total citations
FWCI
0.00
Percentile
99%
References
0
Citations per year

Authors

5

Topics & keywords

Keywords
  • Generative grammar
  • Artificial intelligence
  • Computer science
  • Machine learning
No related works found for this paper.