A survey on unsupervised outlier detection in high‐dimensional numerical data

University of Alberta · Ludwig-Maximilians-Universität München

Indexed incrossref

Abstract

Abstract High‐dimensional data in Euclidean space pose special challenges to data mining algorithms. These challenges are often indiscriminately subsumed under the term ‘curse of dimensionality’, more concrete aspects being the so‐called ‘distance concentration effect’, the presence of irrelevant attributes concealing relevant information, or simply efficiency issues. In about just the last few years, the task of unsupervised outlier detection has found new specialized solutions for tackling high‐dimensional data in Euclidean space. These approaches fall under mainly two categories, namely considering or not considering subspaces (subsets of attributes) for the definition of outliers. The former are…

Citation impact

850
total citations
FWCI
33.59
Percentile
100%
References
166
Citations per year

Authors

3

Topics & keywords

Keywords
  • Outlier
  • Curse of dimensionality
  • Anomaly detection
  • Computer science
  • Linear subspace
  • Data mining
  • Euclidean distance
  • Clustering high-dimensional data
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.