A Systematic Review of Synthetic Data Generation Techniques Using Generative AI
Indexed incrossref
Abstract
Synthetic data are increasingly being recognized for their potential to address serious real-world challenges in various domains. They provide innovative solutions to combat the data scarcity, privacy concerns, and algorithmic biases commonly used in machine learning applications. Synthetic data preserve all underlying patterns and behaviors of the original dataset while altering the actual content. The methods proposed in the literature to generate synthetic data vary from large language models (LLMs), which are pre-trained on gigantic datasets, to generative adversarial networks (GANs) and variational autoencoders (VAEs). This study provides a systematic review of the various techniques proposed in the…
Citation impact
176
total citations
- FWCI
- 42.41
- Percentile
- 100%
- References
- 41
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Generative grammar
- Computer science
- Synthetic data
- Artificial intelligence
- Data science
- Data mining
No related works found for this paper.