Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Training Pi-Sigma Network by Online Gradient Algorithm with Penalty for Small Weight Update

View through CrossRef
A pi-sigma network is a class of feedforward neural networks with product units in the output layer. An online gradient algorithm is the simplest and most often used training method for feedforward neural networks. But there arises a problem when the online gradient algorithm is used for pi-sigma networks in that the update increment of the weights may become very small, especially early in training, resulting in a very slow convergence. To overcome this difficulty, we introduce an adaptive penalty term into the error function, so as to increase the magnitude of the update increment of the weights when it is too small. This strategy brings about faster convergence as shown by the numerical experiments carried out in this letter.
Title: Training Pi-Sigma Network by Online Gradient Algorithm with Penalty for Small Weight Update
Description:
A pi-sigma network is a class of feedforward neural networks with product units in the output layer.
An online gradient algorithm is the simplest and most often used training method for feedforward neural networks.
But there arises a problem when the online gradient algorithm is used for pi-sigma networks in that the update increment of the weights may become very small, especially early in training, resulting in a very slow convergence.
To overcome this difficulty, we introduce an adaptive penalty term into the error function, so as to increase the magnitude of the update increment of the weights when it is too small.
This strategy brings about faster convergence as shown by the numerical experiments carried out in this letter.

Related Results

L᾽«unilinguisme» officiel de Constantinople byzantine (VIIe-XIIe s.)
L᾽«unilinguisme» officiel de Constantinople byzantine (VIIe-XIIe s.)
&nbsp; <p>&Nu;ί&kappa;&omicron;&sigmaf; &Omicron;&iota;&kappa;&omicron;&nu;&omicron;&mu;ί&delta;&eta;&sigmaf;</...
Detection of Hidden Structures in Nonstationary Spike Trains
Detection of Hidden Structures in Nonstationary Spike Trains
We propose an algorithm for simultaneously estimating state transitions among neural states and nonstationary firing rates using a switching state-space model (SSSM). This algorith...
The Brightness-Weight Illusion
The Brightness-Weight Illusion
Bigger objects look heavier than smaller but otherwise identical objects. When hefted as well as seen, however, bigger objects feel lighter (the size-weight illusion), confirming t...
Optimizing Random Forests: Spark Implementations of Random Genetic Forests
Optimizing Random Forests: Spark Implementations of Random Genetic Forests
The Random Forest (RF) algorithm, originally proposed by Breiman [7], is a widely used machine learning algorithm that gains its merit from its fast learning speed as well as high ...
Forewarned is forearmed: The brave new world of (Creative) Writing online
Forewarned is forearmed: The brave new world of (Creative) Writing online
Online Writing courses, including Creative Writing programs, have been delivered in Australia for more than a decade. While most providers of online writing programs offer units in...
2nd c. CE defenses around small towns in Roman Britain structured by road network connectivity
2nd c. CE defenses around small towns in Roman Britain structured by road network connectivity
AbstractThe large-scale provision of defenses around small towns in Roman Britain during the 2nd c. CE is without parallel in the Roman Empire. Although the relationship between de...
Evaluation of the Quality of Robust Clustering Algorithm TCLUST on the Example of Dataset of Air Pollutants Emission in Krakow
Evaluation of the Quality of Robust Clustering Algorithm TCLUST on the Example of Dataset of Air Pollutants Emission in Krakow
Acquisition and data collection is currently a very dynamic processes. In order to obtain from data useful information, when huge quantities of data, the processing of the data is ...

Back to Top