Javascript must be enabled to continue!
Enhancing deep neural network training efficiency and performance through linear prediction
View through CrossRef
AbstractDeep neural networks have achieved remarkable success in various fields. However, training an effective deep neural network still poses challenges. This paper aims to propose a method to optimize the training effectiveness of deep neural networks, with the goal of improving their performance. Firstly, based on the observation that parameters (weights and bias) of deep neural network change in certain rules during training process, the potential of parameters prediction for improving training efficiency is discovered. Secondly, the potential of parameters prediction to improve the performance of deep neural network by noise injection introduced by prediction errors is revealed. And then, considering the limitations comprehensively, a deep neural network Parameters Linear Prediction method is exploit. Finally, performance and hyperparameter sensitivity validations are carried out on some representative backbones. Experimental results show that by employing proposed Parameters Linear Prediction method, as opposed to SGD, has led to an approximate 1% increase in accuracy for optimal model, along with a reduction of about 0.01 in top-1/top-5 error. Moreover, it also exhibits stable performance under various hyperparameter settings, shown the effectiveness of the proposed method and validated its capacity in enhancing network’s training efficiency and performance.
Springer Science and Business Media LLC
Title: Enhancing deep neural network training efficiency and performance through linear prediction
Description:
AbstractDeep neural networks have achieved remarkable success in various fields.
However, training an effective deep neural network still poses challenges.
This paper aims to propose a method to optimize the training effectiveness of deep neural networks, with the goal of improving their performance.
Firstly, based on the observation that parameters (weights and bias) of deep neural network change in certain rules during training process, the potential of parameters prediction for improving training efficiency is discovered.
Secondly, the potential of parameters prediction to improve the performance of deep neural network by noise injection introduced by prediction errors is revealed.
And then, considering the limitations comprehensively, a deep neural network Parameters Linear Prediction method is exploit.
Finally, performance and hyperparameter sensitivity validations are carried out on some representative backbones.
Experimental results show that by employing proposed Parameters Linear Prediction method, as opposed to SGD, has led to an approximate 1% increase in accuracy for optimal model, along with a reduction of about 0.
01 in top-1/top-5 error.
Moreover, it also exhibits stable performance under various hyperparameter settings, shown the effectiveness of the proposed method and validated its capacity in enhancing network’s training efficiency and performance.
Related Results
Deep convolutional neural network and IoT technology for healthcare
Deep convolutional neural network and IoT technology for healthcare
Background Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find complex patterns in ...
Adaptive hybrid potential evapotranspiration (PET) prediction method based on automatic machine learning
Adaptive hybrid potential evapotranspiration (PET) prediction method based on automatic machine learning
Abstract
In arid areas, estimation of crop water demand through potential evapotranspiration (PET) forecast has a guiding effect on water-saving irrigation, to cope with th...
Prediction using Machine Learning
Prediction using Machine Learning
This chapter begins with a concise introduction to machine learning and the
classification of machine learning systems (supervised learning, unsupervised learning,
and reinforcemen...
Inversion using adaptive physics‐based neural network: Application to magnetotelluric inversion
Inversion using adaptive physics‐based neural network: Application to magnetotelluric inversion
ABSTRACTA new trend to solve geophysical problems aims to combine the advantages of deterministic inversion with neural network inversion. The neural networks applied to geophysica...
Fuzzy Chaotic Neural Networks
Fuzzy Chaotic Neural Networks
An understanding of the human brain’s local function has improved in recent years. But the cognition of human brain’s working process as a whole is still obscure. Both fuzzy logic ...
Deep Neural Networks for Human’s Fall-risk Prediction using Force-Plate Time Series Signal
Deep Neural Networks for Human’s Fall-risk Prediction using Force-Plate Time Series Signal
ABSTRACTEarly and accurate identification of the balance deficits could reduce falls, in particular for older adults, a prone population. Our work investigates deep neural networks...
Prediction of 131i Therapeutic Dose and Prognosis in Hyperthyroidism Patients Using Mechanical Learning Model
Prediction of 131i Therapeutic Dose and Prognosis in Hyperthyroidism Patients Using Mechanical Learning Model
Abstract
ObjectiveMultiple mechanical learning models were used to predict the therapeutic dose of 131I radionuclide in patients with hyperthyroidism, and to compare the ca...
Traffic Prediction in 5G Networks Using Machine Learning
Traffic Prediction in 5G Networks Using Machine Learning
The advent of 5G technology promises a paradigm shift in the realm of
telecommunications, offering unprecedented speeds and connectivity. However, the
...

