Mish: A Self Regularized Non-Monotonic Neural Activation Function
Abstract
The concept of non-linearity in a Neural Network is introduced by an activation function which serves an integral role in the training and performance evaluation of the network. Over the years of theoretical research, many activation functions have been proposed, however, only a few are widely used in mostly all applications which include ReLU (Rectified Linear Unit), TanH (Tan Hyperbolic), Sigmoid, Leaky ReLU and Swish. In this work, a novel neural activation function called as Mish is proposed. The experiments show that Mish tends to work better than both ReLU and Swish along with other standard activation functions in many deep networks across challenging datasets. For instance, in Squeeze Excite Net- 18…
Citation impact
492
total citations
- FWCI
- 53.89
- Percentile
- 100%
- References
- 0
Citations per year
Authors
1Topics & keywords
Topics
Keywords
- Activation function
- Sigmoid function
- Artificial neural network
- Computer science
- Function (biology)
- Artificial intelligence
- Monotonic function
- Similarity (geometry)
No related works found for this paper.