Javascript must be enabled to continue!

RLion: A Refined Lion Optimizer for Deep Learning

Abstract Optimization algorithms play a fundamental role in training neural networks. The optimizer focuses on the updating weights ofmomentum and velocity on learning rates and losses, furthermore the complexity of the optimizer and the quantity of updatedparameters are considered. In this paper, a RLion(Refined Lion Optimizer) based on the Lion optimizer is denoted with ascaleable factor α and arctan operation to denote the update rule θt = θt−1 − ηt ( 2π arctan(α ∗ ˆmt ) + λ θt−1). arctan is continousmonotonic function and its expectation, variation are less than those of sign, also the θ ’s fluctuation of RLion is less than that ofLion. The higher α, the convey faster. The RLion is able to smooth out the fluctuations, converge faster and more reliable.The FasterNet, EfficientNetV2 and the YOLO_V8 with ImageNet1k dataset are trained without warm up for classificationleveraging the RLion optimizer. Object detection with Vision Transformers on Caltech 101 dataset and the DeepLabV3+ forsemantic segmentation on camera data are trained with AdamW, Lion and RLion optimizer too. Compared to the AdamWand Lion optimizer, the loss and accuracy present the RLion can promote the validation accuracy about 0 ∼ +20% higher thanAdamW on many models even the learning rate is as high as AdamW. The RLion has better convergence performance andversatility.

Springer Science and Business Media LLC

Jian Rong ChenHao Ma QingHui Zhang Yong Cao

2024

Title: RLion: A Refined Lion Optimizer for Deep Learning

Description:

Abstract Optimization algorithms play a fundamental role in training neural networks.

The optimizer focuses on the updating weights ofmomentum and velocity on learning rates and losses, furthermore the complexity of the optimizer and the quantity of updatedparameters are considered.

In this paper, a RLion(Refined Lion Optimizer) based on the Lion optimizer is denoted with ascaleable factor α and arctan operation to denote the update rule θt = θt−1 − ηt ( 2π arctan(α ∗ ˆmt ) + λ θt−1).

arctan is continousmonotonic function and its expectation, variation are less than those of sign, also the θ ’s fluctuation of RLion is less than that ofLion.

The higher α, the convey faster.

The RLion is able to smooth out the fluctuations, converge faster and more reliable.

The FasterNet, EfficientNetV2 and the YOLO_V8 with ImageNet1k dataset are trained without warm up for classificationleveraging the RLion optimizer.

Object detection with Vision Transformers on Caltech 101 dataset and the DeepLabV3+ forsemantic segmentation on camera data are trained with AdamW, Lion and RLion optimizer too.

Compared to the AdamWand Lion optimizer, the loss and accuracy present the RLion can promote the validation accuracy about 0 ∼ +20% higher thanAdamW on many models even the learning rate is as high as AdamW.

The RLion has better convergence performance andversatility.

Back

BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...

Deep ocular tumor classification model using cuckoo search algorithm and Caputo fractional gradient descent

While digital ocular fundus images are commonly used for diagnosing ocular tumors, interpreting these images poses challenges due to their complexity and the subtle features specif...

LION FIGURES AND ICONOGRAPHY ON THE DOOR KNOCKERS OF HOCA AHMED YESEVI TOMB

By blending old Turkish beliefs with Islam, Hodja Ahmet Yesevi ensured the formation of an understanding of Sufism based on Turkish wisdom, love of Allah, tolerance and human love....

Deep convolutional neural network and IoT technology for healthcare

Background Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find complex patterns in ...

Alzheimer’s Disease Detection in Various Brain Anatomies Based on Optimized Vision Transformer

Alzheimer’s disease (AD) is a progressive neurodegenerative disorder and a growing public health concern. Despite significant advances in deep learning for medical image analysis, ...

Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic 

Abstract Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...

Enhancing Non-Formal Learning Certificate Classification with Text Augmentation: A Comparison of Character, Token, and Semantic Approaches

Aim/Purpose: The purpose of this paper is to address the gap in the recognition of prior learning (RPL) by automating the classification of non-formal learning certificates using d...

Effect of Learning Management Using Problem-based Learning on Fine Arts Basic Ability of Freshmen in Suzhou Arts and Design Institute, The People’s Republic of China

Background and Aim: Learning Management Using Problem-Based Learning students can have better development of creativity, the ability to apply in real-world situations, aesthetic ap...

Email:
Password:

Email:

RLion: A Refined Lion Optimizer for Deep Learning

Related Results