articleGlobal Transitions ProceedingsApr 3, 2022DIAMOND OA

A review: Data pre-processing and data augmentation techniques

University of Mumbai

Indexed incrossref

Abstract

This review paper provides an overview of data pre-processing in Machine learning, focusing on all types of problems while building the machine learning problems. It deals with two significant issues in the pre-processing process (i). issues with data and (ii). Steps to follow to do data analysis with its best approach. As raw data are vulnerable to noise, corruption, missing, and inconsistent data, it is necessary to perform pre-processing steps, which is done using classification, clustering, and association and many other pre-processing techniques available. Poor data can primarily affect the accuracy and lead to false prediction, so it is necessary to improve the dataset's quality. So, data pre-processing…

Citation impact

1,192
total citations
FWCI
96.84
Percentile
100%
References
24
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Raw data
  • Data processing
  • Machine learning
  • Cluster analysis
  • Data quality
  • Data transformation
  • Data cleansing
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.