A survey on unsupervised outlier detection in high‐dimensional numerical data
University of Alberta · Ludwig-Maximilians-Universität München
Abstract
Abstract High‐dimensional data in Euclidean space pose special challenges to data mining algorithms. These challenges are often indiscriminately subsumed under the term ‘curse of dimensionality’, more concrete aspects being the so‐called ‘distance concentration effect’, the presence of irrelevant attributes concealing relevant information, or simply efficiency issues. In about just the last few years, the task of unsupervised outlier detection has found new specialized solutions for tackling high‐dimensional data in Euclidean space. These approaches fall under mainly two categories, namely considering or not considering subspaces (subsets of attributes) for the definition of outliers. The former are…
Citation impact
- FWCI
- 33.59
- Percentile
- 100%
- References
- 166
Authors
3Topics & keywords
- Outlier
- Curse of dimensionality
- Anomaly detection
- Computer science
- Linear subspace
- Data mining
- Euclidean distance
- Clustering high-dimensional data
- Reduced inequalities