Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)
Johannes Kepler University of Linz
Indexed inarxivdatacite
Abstract
We introduce the "exponential linear unit" (ELU) which speeds up learning in deep neural networks and leads to higher classification accuracies. Like rectified linear units (ReLUs), leaky ReLUs (LReLUs) and parametrized ReLUs (PReLUs), ELUs alleviate the vanishing gradient problem via the identity for positive values. However, ELUs have improved learning characteristics compared to the units with other activation functions. In contrast to ReLUs, ELUs have negative values which allows them to push mean unit activations closer to zero like batch normalization but with lower computational complexity. Mean shifts toward zero speed up learning by bringing the normal gradient closer to the unit natural gradient…
Citation impact
2,316
total citations
- FWCI
- —
- Percentile
- —
- References
- 41
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Normalization (sociology)
- Computer science
- Exponential function
- Artificial intelligence
- Generalization
- Algorithm
- Mathematics
No related works found for this paper.