preprintUvA-DARE (University of Amsterdam)Dec 22, 2014GREEN OA

Adam: A Method for Stochastic Optimization

University of Amsterdam · University of Toronto

Indexed inarxivdatacite

Abstract

We introduce Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. The method is straightforward to implement, is computationally efficient, has little memory requirements, is invariant to diagonal rescaling of the gradients, and is well suited for problems that are large in terms of data and/or parameters. The method is also appropriate for non-stationary objectives and problems with very noisy and/or sparse gradients. The hyper-parameters have intuitive interpretations and typically require little tuning. Some connections to related algorithms, on which Adam was inspired, are discussed. We also analyze the…

Citation impact

84,665
total citations
FWCI
Percentile
References
20
Citations per year

Authors

2

Topics & keywords

Keywords
  • Regret
  • Mathematical optimization
  • Computer science
  • Diagonal
  • Convergence (economics)
  • Stochastic optimization
  • Rate of convergence
  • Optimization problem
No related works found for this paper.