articleAug 20, 2006Closed access

Sampling from large graphs

Carnegie Mellon University

Indexed incrossref

Abstract

Given a huge real graph, how can we derive a representative sample? There are many known algorithms to compute interesting measures (shortest paths, centrality, betweenness, etc.), but several of them become impractical for large graphs. Thus graph sampling is essential.The natural questions to ask are (a) which sampling method to use, (b) how small can the sample size be, and (c) how to scale up the measurements of the sample (e.g., the diameter), to get estimates for the large graph. The deeper, underlying question is subtle: how do we measure success?.We answer the above questions, and test our answers by thorough experiments on several, diverse datasets, spanning thousands nodes and edges. We consider…

Citation impact

1,191
total citations
FWCI
10.26
Percentile
100%
References
14
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Sampling (signal processing)
  • Betweenness centrality
  • Graph
  • Simple random sample
  • Theoretical computer science
  • Random graph
  • Sample (material)
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.

Funding