Sampling from large graphs

Leskovec, Jure; Faloutsos, Christos

doi:10.1145/1150402.1150479

articleAug 20, 2006Closed access

Sampling from large graphs

JLJure Leskovec CFChristos Faloutsos

Carnegie Mellon University

Indexed incrossref

Abstract

Given a huge real graph, how can we derive a representative sample? There are many known algorithms to compute interesting measures (shortest paths, centrality, betweenness, etc.), but several of them become impractical for large graphs. Thus graph sampling is essential.The natural questions to ask are (a) which sampling method to use, (b) how small can the sample size be, and (c) how to scale up the measurements of the sample (e.g., the diameter), to get estimates for the large graph. The deeper, underlying question is subtle: how do we measure success?.We answer the above questions, and test our answers by thorough experiments on several, diverse datasets, spanning thousands nodes and edges. We consider…

Citation impact

1,191

total citations

FWCI: 10.26
Percentile: 100%
References: 14

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Sampling (signal processing)
Betweenness centrality
Graph
Simple random sample
Theoretical computer science
Random graph
Sample (material)

UN Sustainable Development Goals

Life in Land

No related works found for this paper.