Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Microsoft Research (United Kingdom) · Microsoft (United States) · +1 more institution
Abstract
Rectified activation units (rectifiers) are essential for state-of-the-art neural networks. In this work, we study rectifier neural networks for image classification from two aspects. First, we propose a Parametric Rectified Linear Unit (PReLU) that generalizes the traditional rectified unit. PReLU improves model fitting with nearly zero extra computational cost and little overfitting risk. Second, we derive a robust initialization method that particularly considers the rectifier nonlinearities. This method enables us to train extremely deep rectified models directly from scratch and to investigate deeper or wider network architectures. Based on the learnable activation and advanced initialization, we achieve…
Citation impact
- FWCI
- 574.38
- Percentile
- 100%
- References
- 62
Authors
4- KHKaiming HeCorresponding
Microsoft Research (United Kingdom), Microsoft (United States)
- XZXiangyu Zhang
Microsoft Research (United Kingdom), Xi'an Jiaotong University
- SRShaoqing Ren
Microsoft (United States), Microsoft Research (United Kingdom)
- JSJian Sun
Microsoft Research (United Kingdom), Microsoft (United States)
Topics & keywords
- Overfitting
- Initialization
- Computer science
- Scratch
- Artificial intelligence
- Rectifier (neural networks)
- Artificial neural network
- Contextual image classification