articleNeural Information Processing SystemsDec 4, 2017Closed access

Machine learning with adversaries: byzantine tolerant gradient descent

École Polytechnique Fédérale de Lausanne

Abstract

We study the resilience to Byzantine failures of distributed implementations of Stochastic Gradient Descent (SGD). So far, distributed machine learning frameworks have largely ignored the possibility of failures, especially arbitrary (i.e., Byzantine) ones. Causes of failures include software bugs, network asynchrony, biases in local datasets, as well as attackers trying to compromise the entire system. Assuming a set of n workers, up to f being Byzantine, we ask how resilient can SGD be, without limiting the dimension, nor the size of the parameter space. We first show that no gradient aggregation rule based on a linear combination of the vectors proposed by the workers (i.e, current approaches) tolerates a…

Citation impact

985
total citations
FWCI
23.12
Percentile
100%
References
25
Citations per year

Authors

4

Topics & keywords

Keywords
  • Byzantine fault tolerance
  • Computer science
  • Resilience (materials science)
  • Quantum Byzantine agreement
  • Stochastic gradient descent
  • Asynchrony (computer programming)
  • Convergence (economics)
  • Byzantine architecture
No related works found for this paper.