articleJun 21, 2010Closed access
Deep learning via Hessian-free optimization
Abstract
We develop a 2 nd-order optimization method based on the “Hessian-free ” approach, and apply it to training deep auto-encoders. Without using pre-training, we obtain results superior to those reported by Hinton & Salakhutdinov (2006) on the same tasks they considered. Our method is practical, easy to use, scales nicely to very large datasets, and isn’t limited in applicability to autoencoders, or any specific model class. We also discuss the issue of “pathological curvature ” as a possible explanation for the difficulty of deeplearning and how 2 nd-order optimization, and our method in particular, effectively deals with it. 1.
Citation impact
715
total citations
- FWCI
- 25.54
- Percentile
- 100%
- References
- 10
Citations per year
Authors
1Topics & keywords
Topics
Keywords
- Hessian matrix
- Artificial intelligence
- Computer science
- Deep learning
- Autoencoder
- Class (philosophy)
- Encoder
- Curvature
No related works found for this paper.