Javascript must be enabled to continue!
<span class="word">Adaptive <span class="word"><span class="changedDisabled">Actor-<span class="word"><span class="changedDisabled">Critic <span class="word"><span class="changedDisabled">Optimal <span class="w
View through CrossRef
This study develops an adaptive optimal tracking control law using neural network (NN)-based reinforcement learning (RL) for high-order partially unknown nonlinear systems. By designing a cost function associated with the sliding mode surface (SMS), the original tracking control problem is equivalently transformed into solving the optimal control problem related to the tracking Hamilton-Jacobi-Bellman (HJB) equation. Since the analytical solution of the HJB equation is generally intractable, we employ a policy iteration algorithm derived from the HJB equation, where both the partial derivative of the optimal tracking cost function and the optimal control law are approximated by NNs. The proposed RL framework achieves simplification through actor-critic training laws derived under the condition that a simple function is zero. Finally, two simulative examples are provided to demonstrate the effectiveness and advantages of the proposed adaptive optimal tracking control method.
Title: <span class="word">Adaptive <span class="word"><span class="changedDisabled">Actor-<span class="word"><span class="changedDisabled">Critic <span class="word"><span class="changedDisabled">Optimal <span class="w
Description:
This study develops an adaptive optimal tracking control law using neural network (NN)-based reinforcement learning (RL) for high-order partially unknown nonlinear systems.
By designing a cost function associated with the sliding mode surface (SMS), the original tracking control problem is equivalently transformed into solving the optimal control problem related to the tracking Hamilton-Jacobi-Bellman (HJB) equation.
Since the analytical solution of the HJB equation is generally intractable, we employ a policy iteration algorithm derived from the HJB equation, where both the partial derivative of the optimal tracking cost function and the optimal control law are approximated by NNs.
The proposed RL framework achieves simplification through actor-critic training laws derived under the condition that a simple function is zero.
Finally, two simulative examples are provided to demonstrate the effectiveness and advantages of the proposed adaptive optimal tracking control method.
Related Results
On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
<span style="font-size:11pt"><span style="background:#f9f9f4"><span style="line-height:normal"><span style="font-family:Calibri,sans-serif"><b><spa...
Crescimento de feijoeiro sob influência de carvão vegetal e esterco bovino
Crescimento de feijoeiro sob influência de carvão vegetal e esterco bovino
<p align="justify"><span style="color: #000000;"><span style="font-family: 'Times New Roman', serif;"><span><span lang="pt-BR">É indiscutível a import...
Automatic classification of paddy leaf disease
Automatic classification of paddy leaf disease
<span lang="EN-MY">Riceisastaple<span>f</span>o<span>o</span>din<span>m</span>ost<span>o</span>ft<span>h</span>e&l...
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Effects of a new land surface parametrization scheme on thermal extremes in a Regional Climate Model
Effects of a new land surface parametrization scheme on thermal extremes in a Regional Climate Model
<p><span>The </span><span>EFRE project Big Data@Geo aims at providing high resolution </span><span&...
Stress transfer process in doublet events studied by numerical TREMOL simulations: Study case Ometepec 1982 Doublet.
Stress transfer process in doublet events studied by numerical TREMOL simulations: Study case Ometepec 1982 Doublet.
<pre class="western"><span><span lang="en-US">Earthquake doublets are a characteristic rupture <...
Modeling gravitational instabilities in the partially molten crust with a Volume-Of-Fluid method
Modeling gravitational instabilities in the partially molten crust with a Volume-Of-Fluid method
<p><span>This work aims at </span><span>investigat</span><span>ing</span><s...
Cometary Physics Laboratory: spectrophotometric experiments
Cometary Physics Laboratory: spectrophotometric experiments
<p><strong><span dir="ltr" role="presentation">1. Introduction</span></strong&...

