articleJun 26, 2003Closed access

gSpan: graph-based substructure pattern mining

University of Illinois Urbana-Champaign

Indexed incrossref

Abstract

We investigate new approaches for frequent graph-based pattern mining in graph datasets and propose a novel algorithm called gSpan (graph-based substructure pattern mining), which discovers frequent substructures without candidate generation. gSpan builds a new lexicographic order among graphs, and maps each graph to a unique minimum DFS code as its canonical label. Based on this lexicographic order gSpan adopts the depth-first search strategy to mine frequent connected subgraphs efficiently. Our performance study shows that gSpan substantially outperforms previous algorithms, sometimes by an order of magnitude.

Citation impact

2,030
total citations
FWCI
180.12
Percentile
100%
References
15
Citations per year

Authors

2

Topics & keywords

Keywords
  • Lexicographical order
  • Substructure
  • Computer science
  • Graph
  • Theoretical computer science
  • Data mining
  • Mathematics
  • Combinatorics
No related works found for this paper.