articleOct 11, 2009Closed access
Quincy
Indexed incrossref
Abstract
This paper addresses the problem of scheduling concurrent jobs on clusters where application data is stored on the computing nodes. This setting, in which scheduling computations close to their data is crucial for performance, is increasingly common and arises in systems such as MapReduce, Hadoop, and Dryad as well as many grid-computing environments. We argue that data-intensive computation benefits from a fine-grain resource sharing model that differs from the coarser semi-static resource allocations implemented by most existing cluster computing architectures. The problem of scheduling with locality and fairness constraints has not previously been extensively studied under this resource-sharing model.
Citation impact
852
total citations
- FWCI
- 114.34
- Percentile
- 100%
- References
- 34
Citations per year
Authors
6Topics & keywords
Topics
Keywords
- Computer science
- Distributed computing
- Locality
- Scheduling (production processes)
- Grid computing
- Computation
- Shared resource
- Grid
No related works found for this paper.