articleApr 11, 2002Closed access

CHARM: An Efficient Algorithm for Closed Itemset Mining

Rensselaer Polytechnic Institute

Indexed incrossref

Abstract

The set of frequent closed itemsets uniquely determines the exact frequency of all itemsets, yet it can be orders of magnitude smaller than the set of all frequent itemsets. In this paper we present CHARM, an efficient algorithm for mining all frequent closed itemsets. It enumerates closed sets using a dual itemset-tidset search tree, using an efficient hybrid search that skips many levels. It also uses a technique called diffsets to reduce the memory footprint of intermediate computations. Finally it uses a fast hash-based approach to remove any “non-closed” sets found during computation. An extensive experimental evaluation on a number of real and synthetic databases shows that CHARM significantly…

Citation impact

901
total citations
FWCI
102.78
Percentile
100%
References
20
Citations per year

Authors

2

Topics & keywords

Keywords
  • Scalability
  • Computer science
  • Hash function
  • Charm (quantum number)
  • Computation
  • Set (abstract data type)
  • Trie
  • Memory footprint
No related works found for this paper.