Javascript must be enabled to continue!

Adaptive Actor-Critic Optimal <span class="w

This study develops an adaptive optimal tracking control law using neural network (NN)-based reinforcement learning (RL) for high-order partially unknown nonlinear systems. By designing a cost function associated with the sliding mode surface (SMS), the original tracking control problem is equivalently transformed into solving the optimal control problem related to the tracking Hamilton-Jacobi-Bellman (HJB) equation. Since the analytical solution of the HJB equation is generally intractable, we employ a policy iteration algorithm derived from the HJB equation, where both the partial derivative of the optimal tracking cost function and the optimal control law are approximated by NNs. The proposed RL framework achieves simplification through actor-critic training laws derived under the condition that a simple function is zero. Finally, two simulative examples are provided to demonstrate the effectiveness and advantages of the proposed adaptive optimal tracking control method.

MDPI AG

Dengguo Xu Xinsuo Li Fapeng Li Jingbei Tian

2026

Title: Adaptive Actor-Critic Optimal <span class="w

Description:

This study develops an adaptive optimal tracking control law using neural network (NN)-based reinforcement learning (RL) for high-order partially unknown nonlinear systems.

By designing a cost function associated with the sliding mode surface (SMS), the original tracking control problem is equivalently transformed into solving the optimal control problem related to the tracking Hamilton-Jacobi-Bellman (HJB) equation.

Since the analytical solution of the HJB equation is generally intractable, we employ a policy iteration algorithm derived from the HJB equation, where both the partial derivative of the optimal tracking cost function and the optimal control law are approximated by NNs.

The proposed RL framework achieves simplification through actor-critic training laws derived under the condition that a simple function is zero.

Finally, two simulative examples are provided to demonstrate the effectiveness and advantages of the proposed adaptive optimal tracking control method.

Back

<spa...

Crescimento de feijoeiro sob influência de carvão vegetal e esterco bovino

É indiscutível a import...

Automatic classification of paddy leaf disease

Riceisastaplefoodinmostofthe&l...

Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas

<span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...

Effects of a new land surface parametrization scheme on thermal extremes in a Regional Climate Model

The EFRE project Big Data@Geo aims at providing high resolution <span&...

Stress transfer process in doublet events studied by numerical TREMOL simulations: Study case Ometepec 1982 Doublet.

<pre class="western">Earthquake doublets are a characteristic rupture &lt...

Modeling gravitational instabilities in the partially molten crust with a Volume-Of-Fluid method

This work aims at investigating<s...

Cometary Physics Laboratory: spectrophotometric experiments

1. Introduction</strong&...

<span class="word">Adaptive <span class="word"><span class="changedDisabled">Actor-<span class="word"><span class="changedDisabled">Critic <span class="word"><span class="changedDisabled">Optimal <span class="w

Related Results

Email:
Password:

Email: