articleIEEE Transactions on Knowledge and Data EngineeringAug 25, 2011Closed access

A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data

Xi'an Jiaotong University

Indexed incrossref

Abstract

Feature selection involves identifying a subset of the most useful features that produces compatible results as the original entire set of features. A feature selection algorithm may be evaluated from both the efficiency and effectiveness points of view. While the efficiency concerns the time required to find a subset of features, the effectiveness is related to the quality of the subset of features. Based on these criteria, a fast clustering-based feature selection algorithm (FAST) is proposed and experimentally evaluated in this paper. The FAST algorithm works in two steps. In the first step, features are divided into clusters by using graph-theoretic clustering methods. In the second step, the most…

Citation impact

666
total citations
FWCI
9.84
Percentile
100%
References
102
Citations per year

Authors

3

Topics & keywords

Keywords
  • Cluster analysis
  • Computer science
  • Feature selection
  • Pattern recognition (psychology)
  • Feature (linguistics)
  • Data mining
  • Artificial intelligence
  • Naive Bayes classifier
No related works found for this paper.

Funding