Javascript must be enabled to continue!

Enhancing deep neural network training efficiency and performance through linear prediction

AbstractDeep neural networks have achieved remarkable success in various fields. However, training an effective deep neural network still poses challenges. This paper aims to propose a method to optimize the training effectiveness of deep neural networks, with the goal of improving their performance. Firstly, based on the observation that parameters (weights and bias) of deep neural network change in certain rules during training process, the potential of parameters prediction for improving training efficiency is discovered. Secondly, the potential of parameters prediction to improve the performance of deep neural network by noise injection introduced by prediction errors is revealed. And then, considering the limitations comprehensively, a deep neural network Parameters Linear Prediction method is exploit. Finally, performance and hyperparameter sensitivity validations are carried out on some representative backbones. Experimental results show that by employing proposed Parameters Linear Prediction method, as opposed to SGD, has led to an approximate 1% increase in accuracy for optimal model, along with a reduction of about 0.01 in top-1/top-5 error. Moreover, it also exhibits stable performance under various hyperparameter settings, shown the effectiveness of the proposed method and validated its capacity in enhancing network’s training efficiency and performance.

Springer Science and Business Media LLC

Hejie Ying Mengmeng Song Yaohong Tang Shungen Xiao Zimin Xiao

Scientific Reports

2024

Title: Enhancing deep neural network training efficiency and performance through linear prediction

Description:

AbstractDeep neural networks have achieved remarkable success in various fields.

However, training an effective deep neural network still poses challenges.

This paper aims to propose a method to optimize the training effectiveness of deep neural networks, with the goal of improving their performance.

Firstly, based on the observation that parameters (weights and bias) of deep neural network change in certain rules during training process, the potential of parameters prediction for improving training efficiency is discovered.

Secondly, the potential of parameters prediction to improve the performance of deep neural network by noise injection introduced by prediction errors is revealed.

And then, considering the limitations comprehensively, a deep neural network Parameters Linear Prediction method is exploit.

Finally, performance and hyperparameter sensitivity validations are carried out on some representative backbones.

Experimental results show that by employing proposed Parameters Linear Prediction method, as opposed to SGD, has led to an approximate 1% increase in accuracy for optimal model, along with a reduction of about 0.

01 in top-1/top-5 error.

Moreover, it also exhibits stable performance under various hyperparameter settings, shown the effectiveness of the proposed method and validated its capacity in enhancing network’s training efficiency and performance.

Back

BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...

Deep convolutional neural network and IoT technology for healthcare

Background Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find complex patterns in ...

Adaptive hybrid potential evapotranspiration (PET) prediction method based on automatic machine learning

Abstract In arid areas, estimation of crop water demand through potential evapotranspiration (PET) forecast has a guiding effect on water-saving irrigation, to cope with th...

Prediction using Machine Learning

This chapter begins with a concise introduction to machine learning and the classification of machine learning systems (supervised learning, unsupervised learning, and reinforcemen...

Inversion using adaptive physics‐based neural network: Application to magnetotelluric inversion

ABSTRACTA new trend to solve geophysical problems aims to combine the advantages of deterministic inversion with neural network inversion. The neural networks applied to geophysica...

Fuzzy Chaotic Neural Networks

An understanding of the human brain’s local function has improved in recent years. But the cognition of human brain’s working process as a whole is still obscure. Both fuzzy logic ...

Deep Neural Networks for Human’s Fall-risk Prediction using Force-Plate Time Series Signal

ABSTRACTEarly and accurate identification of the balance deficits could reduce falls, in particular for older adults, a prone population. Our work investigates deep neural networks...

Prediction of 131i Therapeutic Dose and Prognosis in Hyperthyroidism Patients Using Mechanical Learning Model

Abstract ObjectiveMultiple mechanical learning models were used to predict the therapeutic dose of 131I radionuclide in patients with hyperthyroidism, and to compare the ca...

Email:
Password:

Email:

Enhancing deep neural network training efficiency and performance through linear prediction

Related Results