Angle-based outlier detection in high-dimensional data
Ludwig-Maximilians-Universität München
Abstract
Detecting outliers in a large set of data objects is a major data mining task aiming at finding different mechanisms responsible for different groups of objects in a data set. All existing approaches, however, are based on an assessment of distances (sometimes indirectly by assuming certain distributions) in the full-dimensional Euclidean data space. In high-dimensional data, these approaches are bound to deteriorate due to the notorious "curse of dimensionality". In this paper, we propose a novel approach named ABOD (Angle-Based Outlier Detection) and some variants assessing the variance in the angles between the difference vectors of a point to the other points. This way, the effects of the "curse of…
Citation impact
- FWCI
- 16.45
- Percentile
- 100%
- References
- 44
Authors
3Topics & keywords
- Outlier
- Curse of dimensionality
- Computer science
- Ranking (information retrieval)
- Anomaly detection
- Data set
- Set (abstract data type)
- Euclidean distance