Feature Selection for Unsupervised Learning

Dy, Jennifer; Brodley, Carla E.

doi:10.5555/1005332.1016787

articleDec 1, 2004Closed access

Feature Selection for Unsupervised Learning

Abstract

In this paper, we identify two issues involved in developing an automated feature subset selection algorithm for unlabeled data: the need for finding the number of clusters in conjunction with feature selection, and the need for normalizing the bias of feature selection criteria with respect to dimension. We explore the feature selection problem and these issues through FSSEM (Feature Subset Selection using Expectation-Maximization (EM) clustering) and through two different performance criteria for evaluating candidate feature subsets: scatter separability and maximum likelihood. We present proofs on the dimensionality biases of these feature criteria, and present a cross-projection normalization scheme that…

Citation impact

892

total citations

FWCI: 21.25
Percentile: 100%
References: 65

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Feature selection
Artificial intelligence
Cluster analysis
Normalization (sociology)
Minimum redundancy feature selection
Dimensionality reduction
Pattern recognition (psychology)
Feature (linguistics)

No related works found for this paper.