Data Preprocessing For Supervised Leaning
Indexed indatacite
Abstract
Many factors affect the success of Machine Learning (ML) on a given task. The representation and quality of the instance data is first and foremost. If there is much irrelevant and redundant information present or noisy and unreliable data, then knowledge discovery during the training phase is more difficult. It is well known that data preparation and filtering steps take considerable amount of processing time in ML problems. Data pre-processing includes data cleaning, normalization, transformation, feature extraction and selection, etc. The product of data pre-processing is the final training set. It would be nice if a single sequence of data pre-processing algorithms had the best performance for each data…
Citation impact
644
total citations
- FWCI
- 3.96
- Percentile
- 100%
- References
- 37
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Preprocessor
- Computer science
- Data pre-processing
- Artificial intelligence
- Data mining
- Pattern recognition (psychology)
No related works found for this paper.