An outliers detection and elimination framework in classification task of data mining
Fakir Mohan University · Indian Statistical Institute
Abstract
An outlier is a datum that is far from other data points in which it occurs. It can have a considerable impact on the output. Therefore, removing or resolving it before the analysis is essential to prevent skewing. Outliers in a survey sampling can have a significant outcome on statistical results. The goal of discovering outliers in data mining is to find a pattern in data that does not conform to expected behavior. In this paper, we have proposed a framework in which a popular statistical approach termed Inter-Quartile Range (IQR) is used to detect outliers in data and deal with them by Winsorizing method. A radial basis function network trained by teaching a learning-based optimization model is developed…
Citation impact
- FWCI
- 42.34
- Percentile
- 100%
- References
- 60
Authors
4Topics & keywords
- Outlier
- Computer science
- Data mining
- Preprocessor
- Data pre-processing
- Task (project management)
- Range (aeronautics)
- Anomaly detection