articleJun 21, 2010Closed access

Deep learning via Hessian-free optimization

University of Toronto

Abstract

We develop a 2 nd-order optimization method based on the “Hessian-free ” approach, and apply it to training deep auto-encoders. Without using pre-training, we obtain results superior to those reported by Hinton & Salakhutdinov (2006) on the same tasks they considered. Our method is practical, easy to use, scales nicely to very large datasets, and isn’t limited in applicability to autoencoders, or any specific model class. We also discuss the issue of “pathological curvature ” as a possible explanation for the difficulty of deeplearning and how 2 nd-order optimization, and our method in particular, effectively deals with it. 1.

Citation impact

715
total citations
FWCI
25.54
Percentile
100%
References
10
Citations per year

Authors

1

Topics & keywords

Keywords
  • Hessian matrix
  • Artificial intelligence
  • Computer science
  • Deep learning
  • Autoencoder
  • Class (philosophy)
  • Encoder
  • Curvature
No related works found for this paper.