Javascript must be enabled to continue!
RLion: A Refined Lion Optimizer for Deep Learning
View through CrossRef
Abstract
Optimization algorithms play a fundamental role in training neural networks. The optimizer focuses on the updating weights ofmomentum and velocity on learning rates and losses, furthermore the complexity of the optimizer and the quantity of updatedparameters are considered. In this paper, a RLion(Refined Lion Optimizer) based on the Lion optimizer is denoted with ascaleable factor α and arctan operation to denote the update rule θt = θt−1 − ηt ( 2π arctan(α ∗ ˆmt ) + λ θt−1). arctan is continousmonotonic function and its expectation, variation are less than those of sign, also the θ ’s fluctuation of RLion is less than that ofLion. The higher α, the convey faster. The RLion is able to smooth out the fluctuations, converge faster and more reliable.The FasterNet, EfficientNetV2 and the YOLO_V8 with ImageNet1k dataset are trained without warm up for classificationleveraging the RLion optimizer. Object detection with Vision Transformers on Caltech 101 dataset and the DeepLabV3+ forsemantic segmentation on camera data are trained with AdamW, Lion and RLion optimizer too. Compared to the AdamWand Lion optimizer, the loss and accuracy present the RLion can promote the validation accuracy about 0 ∼ +20% higher thanAdamW on many models even the learning rate is as high as AdamW. The RLion has better convergence performance andversatility.
Title: RLion: A Refined Lion Optimizer for Deep Learning
Description:
Abstract
Optimization algorithms play a fundamental role in training neural networks.
The optimizer focuses on the updating weights ofmomentum and velocity on learning rates and losses, furthermore the complexity of the optimizer and the quantity of updatedparameters are considered.
In this paper, a RLion(Refined Lion Optimizer) based on the Lion optimizer is denoted with ascaleable factor α and arctan operation to denote the update rule θt = θt−1 − ηt ( 2π arctan(α ∗ ˆmt ) + λ θt−1).
arctan is continousmonotonic function and its expectation, variation are less than those of sign, also the θ ’s fluctuation of RLion is less than that ofLion.
The higher α, the convey faster.
The RLion is able to smooth out the fluctuations, converge faster and more reliable.
The FasterNet, EfficientNetV2 and the YOLO_V8 with ImageNet1k dataset are trained without warm up for classificationleveraging the RLion optimizer.
Object detection with Vision Transformers on Caltech 101 dataset and the DeepLabV3+ forsemantic segmentation on camera data are trained with AdamW, Lion and RLion optimizer too.
Compared to the AdamWand Lion optimizer, the loss and accuracy present the RLion can promote the validation accuracy about 0 ∼ +20% higher thanAdamW on many models even the learning rate is as high as AdamW.
The RLion has better convergence performance andversatility.
Related Results
Deep ocular tumor classification model using cuckoo search algorithm and Caputo fractional gradient descent
Deep ocular tumor classification model using cuckoo search algorithm and Caputo fractional gradient descent
While digital ocular fundus images are commonly used for diagnosing ocular tumors, interpreting these images poses challenges due to their complexity and the subtle features specif...
LION FIGURES AND ICONOGRAPHY ON THE DOOR KNOCKERS OF HOCA AHMED YESEVI TOMB
LION FIGURES AND ICONOGRAPHY ON THE DOOR KNOCKERS OF HOCA AHMED YESEVI TOMB
By blending old Turkish beliefs with Islam, Hodja Ahmet Yesevi ensured the formation of an understanding of Sufism based on Turkish wisdom, love of Allah, tolerance and human love....
Deep convolutional neural network and IoT technology for healthcare
Deep convolutional neural network and IoT technology for healthcare
Background Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find complex patterns in ...
Alzheimer’s Disease Detection in Various Brain Anatomies Based on Optimized Vision Transformer
Alzheimer’s Disease Detection in Various Brain Anatomies Based on Optimized Vision Transformer
Alzheimer’s disease (AD) is a progressive neurodegenerative disorder and a growing public health concern. Despite significant advances in deep learning for medical image analysis, ...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Abstract
Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
Enhancing Non-Formal Learning Certificate Classification with Text Augmentation: A Comparison of Character, Token, and Semantic Approaches
Enhancing Non-Formal Learning Certificate Classification with Text Augmentation: A Comparison of Character, Token, and Semantic Approaches
Aim/Purpose: The purpose of this paper is to address the gap in the recognition of prior learning (RPL) by automating the classification of non-formal learning certificates using d...
Effect of Learning Management Using Problem-based Learning on Fine Arts Basic Ability of Freshmen in Suzhou Arts and Design Institute, The People’s Republic of China
Effect of Learning Management Using Problem-based Learning on Fine Arts Basic Ability of Freshmen in Suzhou Arts and Design Institute, The People’s Republic of China
Background and Aim: Learning Management Using Problem-Based Learning students can have better development of creativity, the ability to apply in real-world situations, aesthetic ap...
Recent metaheuristic algorithms for solving some civil engineering optimization problems
Recent metaheuristic algorithms for solving some civil engineering optimization problems
Abstract
In this study, a novel hybrid metaheuristic algorithm, termed (BES–GO), is proposed for solving benchmark structural design optimization problems, including weld...

