reviewElectronicsSep 4, 2024GOLD OA

A Systematic Review of Synthetic Data Generation Techniques Using Generative AI

Ontario Tech University

Indexed incrossref

Abstract

Synthetic data are increasingly being recognized for their potential to address serious real-world challenges in various domains. They provide innovative solutions to combat the data scarcity, privacy concerns, and algorithmic biases commonly used in machine learning applications. Synthetic data preserve all underlying patterns and behaviors of the original dataset while altering the actual content. The methods proposed in the literature to generate synthetic data vary from large language models (LLMs), which are pre-trained on gigantic datasets, to generative adversarial networks (GANs) and variational autoencoders (VAEs). This study provides a systematic review of the various techniques proposed in the…

Citation impact

176
total citations
FWCI
42.41
Percentile
100%
References
41
Citations per year

Authors

2

Topics & keywords

Keywords
  • Generative grammar
  • Computer science
  • Synthetic data
  • Artificial intelligence
  • Data science
  • Data mining
No related works found for this paper.