articleOct 11, 2009Closed access

Quincy

Microsoft (United States)

Indexed incrossref

Abstract

This paper addresses the problem of scheduling concurrent jobs on clusters where application data is stored on the computing nodes. This setting, in which scheduling computations close to their data is crucial for performance, is increasingly common and arises in systems such as MapReduce, Hadoop, and Dryad as well as many grid-computing environments. We argue that data-intensive computation benefits from a fine-grain resource sharing model that differs from the coarser semi-static resource allocations implemented by most existing cluster computing architectures. The problem of scheduling with locality and fairness constraints has not previously been extensively studied under this resource-sharing model.

Citation impact

852
total citations
FWCI
114.34
Percentile
100%
References
34
Citations per year

Authors

6

Topics & keywords

Keywords
  • Computer science
  • Distributed computing
  • Locality
  • Scheduling (production processes)
  • Grid computing
  • Computation
  • Shared resource
  • Grid
No related works found for this paper.