articleProceedings of the VLDB EndowmentSep 1, 2010Closed access

HaLoop

University of Washington

Indexed incrossref

Abstract

The growing demand for large-scale data mining and data analysis applications has led both industry and academia to design new types of highly scalable data-intensive computing platforms. MapReduce and Dryad are two popular platforms in which the dataflow takes the form of a directed acyclic graph of operators. These platforms lack built-in support for iterative programs, which arise naturally in many applications including data mining, web ranking, graph analysis, model fitting, and so on. This paper presents HaLoop, a modified version of the Hadoop MapReduce framework that is designed to serve these applications. HaLoop not only extends MapReduce with programming support for iterative applications, it also…

Citation impact

688
total citations
FWCI
157.69
Percentile
100%
References
13
Citations per year

Authors

4

Topics & keywords

Keywords
  • Dataflow
  • Computer science
  • Scalability
  • Directed acyclic graph
  • Graph
  • Task (project management)
  • Big data
  • Programming paradigm
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.

Funding