articleBig Data AnalyticsSep 27, 2016GOLD OA

Big data preprocessing: methods and prospects

Universidad de Granada

Indexed incrossrefdoaj

Abstract

The massive growth in the scale of data has been observed in recent years being a key factor of the Big Data scenario. Big Data can be defined as high volume, velocity and variety of data that require a new high-performance processing. Addressing big data is a challenging and time-demanding task that requires a large computational infrastructure to ensure successful data processing and analysis. The presence of data preprocessing methods for data mining in big data is reviewed in this paper. The definition, characteristics, and categorization of data preprocessing approaches in big data are introduced. The connection between big data and data preprocessing throughout all families of methods and big data…

Citation impact

691
total citations
FWCI
64.19
Percentile
100%
References
129
Citations per year

Authors

5

Topics & keywords

Keywords
  • Big data
  • SPARK (programming language)
  • Computer science
  • Data pre-processing
  • Data science
  • Preprocessor
  • Variety (cybernetics)
  • Categorization
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.

Funding