Simultaneous feature selection and clustering using mixture models

Michigan State University · Instituto de Telecomunicações · +1 more institution

PubMed
Indexed incrossrefpubmed

Abstract

Clustering is a common unsupervised learning technique used to discover group structure in a set of data. While there exist many algorithms for clustering, the important issue of feature selection, that is, what attributes of the data should be used by the clustering algorithms, is rarely touched upon. Feature selection for clustering is difficult because, unlike in supervised learning, there are no class labels for the data and, thus, no obvious criteria to guide the search. Another important problem in clustering is the determination of the number of clusters, which clearly impacts and is influenced by the feature selection issue. In this paper, we propose the concept of feature saliency and introduce an…

Citation impact

668
total citations
FWCI
38.85
Percentile
100%
References
82
Citations per year

Authors

3

Topics & keywords

Keywords
  • Cluster analysis
  • Artificial intelligence
  • Feature selection
  • Pattern recognition (psychology)
  • Computer science
  • Correlation clustering
  • Feature (linguistics)
  • Single-linkage clustering
No related works found for this paper.