articleIEEE Transactions on Knowledge and Data EngineeringOct 4, 2004Closed access

Mining sequential patterns by pattern-growth: the PrefixSpan approach

Simon Fraser University · University of Illinois Urbana-Champaign · +2 more institutions

Indexed incrossref

Abstract

Sequential pattern mining is an important data mining problem with broad applications. However, it is also a difficult problem since the mining may have to generate or examine a combinatorially explosive number of intermediate subsequences. Most of the previously developed sequential pattern mining methods, such as GSP, explore a candidate generation-and-test approach [R. Agrawal et al. (1994)] to reduce the number of candidates to be examined. However, this approach may not be efficient in mining large sequence databases having numerous patterns and/or long patterns. In this paper, we propose a projection-based, sequential pattern-growth approach for efficient mining of sequential patterns. In this approach,…

Citation impact

1,319
total citations
FWCI
94.45
Percentile
100%
References
42
Citations per year

Authors

8

Topics & keywords

Keywords
  • Computer science
  • Sequential Pattern Mining
  • Data mining
  • Sequence database
  • Sequence (biology)
  • A priori and a posteriori
  • GSP Algorithm
  • Set (abstract data type)
No related works found for this paper.

Funding