Tabular Data Augmentation for Machine Learning: Progress and Prospects of Embracing Generative AI
Abstract
Machine learning (ML) on tabular data is ubiquitous, yet obtaining abundant high-quality tabular data for model training remains a significant obstacle. Numerous works have focused on tabular data augmentation (TDA) to enhance the original table with additional data, thereby improving downstream ML tasks. Recently, there has been a growing interest in leveraging the capabilities of generative AI for TDA. Therefore, we believe it is time to provide a comprehensive review of the progress and future prospects of TDA, with a particular emphasis on the trending generative AI. Specifically, we present an architectural view of the TDA pipeline, comprising three main procedures: pre-augmentation, augmentation, and…
Citation impact
- FWCI
- 0.00
- Percentile
- 99%
- References
- 0
Authors
5- LCLingxi CuiCorresponding
Zhejiang University
- HLHuan Li
Zhejiang University
- KCKe Chen
Zhejiang University
- LSLidan Shou
Zhejiang University
- GCGang Chen
Zhejiang University
Topics & keywords
- Generative grammar
- Artificial intelligence
- Computer science
- Machine learning