A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data

Song, Qinbao; Ni, Jingjie; Wang, Guangtao

doi:10.1109/tkde.2011.181

articleIEEE Transactions on Knowledge and Data EngineeringAug 25, 2011Closed access

A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data

QSQinbao Song JNJingjie Ni GWGuangtao Wang

Xi'an Jiaotong University

Indexed incrossref

Abstract

Feature selection involves identifying a subset of the most useful features that produces compatible results as the original entire set of features. A feature selection algorithm may be evaluated from both the efficiency and effectiveness points of view. While the efficiency concerns the time required to find a subset of features, the effectiveness is related to the quality of the subset of features. Based on these criteria, a fast clustering-based feature selection algorithm (FAST) is proposed and experimentally evaluated in this paper. The FAST algorithm works in two steps. In the first step, features are divided into clusters by using graph-theoretic clustering methods. In the second step, the most…

Citation impact

666

total citations

FWCI: 9.84
Percentile: 100%
References: 102

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Cluster analysis
Computer science
Feature selection
Pattern recognition (psychology)
Feature (linguistics)
Data mining
Artificial intelligence
Naive Bayes classifier

No related works found for this paper.

Funding

NN
National Natural Science Foundation of China