Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)

Clevert, Djork-Arné; Unterthiner, Thomas; Hochreiter, Sepp

doi:10.48550/arxiv.1511.07289

preprintarXiv (Cornell University)Nov 23, 2015GREEN OA

Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)

DCDjork-Arné Clevert TUThomas Unterthiner SHSepp Hochreiter

Johannes Kepler University of Linz

Indexed inarxivdatacite

Abstract

We introduce the "exponential linear unit" (ELU) which speeds up learning in deep neural networks and leads to higher classification accuracies. Like rectified linear units (ReLUs), leaky ReLUs (LReLUs) and parametrized ReLUs (PReLUs), ELUs alleviate the vanishing gradient problem via the identity for positive values. However, ELUs have improved learning characteristics compared to the units with other activation functions. In contrast to ReLUs, ELUs have negative values which allows them to push mean unit activations closer to zero like batch normalization but with lower computational complexity. Mean shifts toward zero speed up learning by bringing the normal gradient closer to the unit natural gradient…

Citation impact

2,316

total citations

FWCI: —
Percentile: —
References: 41

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Normalization (sociology)
Computer science
Exponential function
Artificial intelligence
Generalization
Algorithm
Mathematics

No related works found for this paper.