Unsupervised feature selection using feature similarity

Indian Statistical Institute

Indexed incrossref

Abstract

In this article, we describe an unsupervised feature selection algorithm suitable for data sets, large in both dimension and size. The method is based on measuring similarity between features whereby redundancy therein is removed. This does not need any search and, therefore, is fast. A new feature similarity measure, called maximum information compression index, is introduced. The algorithm is generic in nature and has the capability of multiscale representation of data sets. The superiority of the algorithm, in terms of speed and performance, is established extensively over various real-life data sets of different sizes and dimensions. It is also demonstrated how redundancy and information loss in feature…

Citation impact

1,458
total citations
FWCI
11.87
Percentile
100%
References
31
Citations per year

Authors

3

Topics & keywords

Keywords
  • Pattern recognition (psychology)
  • Feature selection
  • Artificial intelligence
  • Computer science
  • Minimum redundancy feature selection
  • Entropy (arrow of time)
  • Redundancy (engineering)
  • Feature (linguistics)
No related works found for this paper.