Recent Advances in Stochastic Gradient Descent in Deep Learning

Tian, Yingjie; Zhang, Yuqi; Zhang, Haibin

doi:10.3390/math11030682

articleMathematicsJan 29, 2023GOLD OA

Recent Advances in Stochastic Gradient Descent in Deep Learning

YTYingjie Tian YZYuqi Zhang HZHaibin Zhang

Chinese Academy of Sciences · Beijing Institute of Big Data Research · +2 more institutions

Indexed incrossrefdoaj

Abstract

In the age of artificial intelligence, the best approach to handling huge amounts of data is a tremendously motivating and hard problem. Among machine learning models, stochastic gradient descent (SGD) is not only simple but also very effective. This study provides a detailed analysis of contemporary state-of-the-art deep learning applications, such as natural language processing (NLP), visual data processing, and voice and audio processing. Following that, this study introduces several versions of SGD and its variant, which are already in the PyTorch optimizer, including SGD, Adagrad, adadelta, RMSprop, Adam, AdamW, and so on. Finally, we propose theoretical conditions under which these methods are applicable…

Citation impact

237

total citations

FWCI: 39.26
Percentile: 100%
References: 114

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Stochastic gradient descent
Computer science
Artificial intelligence
Deep learning
Bridge (graph theory)
Machine learning
Gradient descent
Simple (philosophy)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.

Funding

NN
National Natural Science Foundation of China
Awards: 12071458, 71731009