articleMay 17, 2004Closed access

The webgraph framework I

University of Milan

Indexed incrossref

Abstract

Studying web graphs is often difficult due to their large size. Recently,several proposals have been published about various techniques that allow tostore a web graph in memory in a limited space, exploiting the inner redundancies of the web. The WebGraph framework is a suite of codes, algorithms and tools that aims at making it easy to manipulate large web graphs. This papers presents the compression techniques used in WebGraph, which are centred around referentiation and intervalisation (which in turn are dual to each other). WebGraph can compress the WebBase graph (118 Mnodes, 1 Glinks)in as little as 3.08 bits per link, and its transposed version in as littleas 2.89 bits per link.

Citation impact

1,213
total citations
FWCI
44.40
Percentile
100%
References
11
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Theoretical computer science
  • Suite
  • Graph
No related works found for this paper.