articleJun 26, 2003Closed access
gSpan: graph-based substructure pattern mining
University of Illinois Urbana-Champaign
Indexed incrossref
Abstract
We investigate new approaches for frequent graph-based pattern mining in graph datasets and propose a novel algorithm called gSpan (graph-based substructure pattern mining), which discovers frequent substructures without candidate generation. gSpan builds a new lexicographic order among graphs, and maps each graph to a unique minimum DFS code as its canonical label. Based on this lexicographic order gSpan adopts the depth-first search strategy to mine frequent connected subgraphs efficiently. Our performance study shows that gSpan substantially outperforms previous algorithms, sometimes by an order of magnitude.
Citation impact
2,030
total citations
- FWCI
- 180.12
- Percentile
- 100%
- References
- 15
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Lexicographical order
- Substructure
- Computer science
- Graph
- Theoretical computer science
- Data mining
- Mathematics
- Combinatorics
No related works found for this paper.