articleNeural Information Processing SystemsDec 5, 2016Closed access

Weight normalization: a simple reparameterization to accelerate training of deep neural networks

OpenAI (United States)

Abstract

We present weight normalization: a reparameterization of the weight vectors in a neural network that decouples the length of those weight vectors from their direction. By reparameterizing the weights in this way we improve the conditioning of the optimization problem and we speed up convergence of stochastic gradient descent. Our reparameterization is inspired by batch normalization but does not introduce any dependencies between the examples in a minibatch. This means that our method can also be applied successfully to recurrent models such as LSTMs and to noise-sensitive applications such as deep reinforcement learning or generative models, for which batch normalization is less well suited. Although our…

Citation impact

715
total citations
FWCI
45.58
Percentile
100%
References
26
Citations per year

Authors

2

Topics & keywords

Keywords
  • Normalization (sociology)
  • Computer science
  • Artificial intelligence
  • Reinforcement learning
  • Artificial neural network
  • Stochastic gradient descent
  • Generative grammar
  • Gradient descent
No related works found for this paper.