Deep Sparse Rectifier Neural Networks
Département d'Informatique · Heuristics and Diagnostics for Complex Systems
Abstract
While logistic sigmoid neurons are more biologically plausible than hyperbolic tangent neurons, the latter work better for training multi-layer neural networks. This paper shows that rectifying neurons are an even better model of biological neurons and yield equal or better performance than hyperbolic tangent networks in spite of the hard non-linearity and non-differentiability at zero, creating sparse representations with true zeros, which seem remarkably suitable for naturally sparse data. Even though they can take advantage of semi-supervised setups with extra-unlabeled data, deep rectifier networks can reach their best performance without requiring any unsupervised pre-training on purely supervised tasks…
Citation impact
- FWCI
- 87.99
- Percentile
- 100%
- References
- 36
Authors
3Topics & keywords
- Sigmoid function
- Rectifier (neural networks)
- Tangent
- Artificial neural network
- Hyperbolic function
- Computer science
- Yield (engineering)
- Layer (electronics)