preprintAug 14, 2019GOLD OA

Learning scheduling algorithms for data processing clusters

Tsinghua University

Indexed incrossref

Abstract

Efficiently scheduling data processing jobs on distributed compute clusters requires complex algorithms. Current systems use simple, generalized heuristics and ignore workload characteristics, since developing and tuning a scheduling policy for each workload is infeasible. In this paper, we show that modern machine learning techniques can generate highly-efficient policies automatically.

Citation impact

656
total citations
FWCI
62.06
Percentile
100%
References
76
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Heuristics
  • Workload
  • Scheduling (production processes)
  • Distributed computing
  • Fair-share scheduling
  • Dynamic priority scheduling
  • Rate-monotonic scheduling
No related works found for this paper.

Funding