Gradient Descent Algorithm Survey

2511.20725v1 cs.LG, cs.AI 2025-11-27
Авторы:

Deng Fucheng, Wang Wanjie, Gong Ao, Wang Xiaoqi, Wang Fan

Abstract

Focusing on the practical configuration needs of optimization algorithms in deep learning, this article concentrates on five major algorithms: SGD, Mini-batch SGD, Momentum, Adam, and Lion. It systematically analyzes the core advantages, limitations, and key practical recommendations of each algorithm. The research aims to gain an in-depth understanding of these algorithms and provide a standardized reference for the reasonable selection, parameter tuning, and performance improvement of optimization algorithms in both academic research and engineering practice, helping to solve optimization challenges in different scales of models and various training scenarios.

Ссылки и действия