📞 +91-7667918914 | ✉️ iarjset@gmail.com
International Advanced Research Journal in Science, Engineering and Technology
International Advanced Research Journal in Science, Engineering and Technology A Monthly Peer-Reviewed Multidisciplinary Journal
ISSN Online 2393-8021ISSN Print 2394-1588Since 2014
IARJSET aligns to the suggestive parameters by the latest University Grants Commission (UGC) for peer-reviewed journals, committed to promoting research excellence, ethical publishing practices, and a global scholarly impact.
← Back to VOLUME 10, ISSUE 11, NOVEMBER 2023

A Comprehensive Overview of Gradient Descent and its Optimization Algorithms

Atharva Tapkir

👁 1 view📥 0 downloads
Share: 𝕏 f in

Abstract: This study explores machine learning gradient-based optimization algorithms, highlighting the critical importance of gradient descent and investigating adaptive strategies to improve its performance. The fundamental technique of optimization is gradient descent, although balancing convergence speed and accuracy can be difficult due to gradient descent's reliance on fixed learning rates. The study explores a variety of adaptive learning techniques, including as drop, decay, cyclic learning, and adaptive learning. These techniques are designed to modify learning rates in real-time during optimization, hence affecting stability and convergence. Additionally, the research delves into momentum-based methods like Adam, RMSProp, AdaGrad, and AdaDelta, clarifying their use in reducing the difficulties associated with traditional gradient descent. The study also clarifies gradient clipping methods, addressing the problem of expanding gradients and offering solutions to stabilize and enhance machine learning models. The goal of this thorough investigation is to provide practitioners with a sophisticated grasp of optimization techniques so they may guide machine learning models toward effectiveness, precision, and robustness in a variety of application domains.

Keywords: Gradient Descent, Optimizations, Learning Rate, Adam, Neural Networks

How to Cite:

[1] Atharva Tapkir, “A Comprehensive Overview of Gradient Descent and its Optimization Algorithms,” International Advanced Research Journal in Science, Engineering and Technology (IARJSET), DOI: 10.17148/IARJSET.2023.101106

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.