Mining sequential patterns by pattern-growth: the PrefixSpan approach
Simon Fraser University · University of Illinois Urbana-Champaign · +2 more institutions
Abstract
Sequential pattern mining is an important data mining problem with broad applications. However, it is also a difficult problem since the mining may have to generate or examine a combinatorially explosive number of intermediate subsequences. Most of the previously developed sequential pattern mining methods, such as GSP, explore a candidate generation-and-test approach [R. Agrawal et al. (1994)] to reduce the number of candidates to be examined. However, this approach may not be efficient in mining large sequence databases having numerous patterns and/or long patterns. In this paper, we propose a projection-based, sequential pattern-growth approach for efficient mining of sequential patterns. In this approach,…
Citation impact
- FWCI
- 94.45
- Percentile
- 100%
- References
- 42
Authors
8Topics & keywords
- Computer science
- Sequential Pattern Mining
- Data mining
- Sequence database
- Sequence (biology)
- A priori and a posteriori
- GSP Algorithm
- Set (abstract data type)