Javascript must be enabled to continue!
Enhancing Quadrotor Control Robustness with Multi-Proportional–Integral–Derivative Self-Attention-Guided Deep Reinforcement Learning
View through CrossRef
Deep reinforcement learning has demonstrated flexibility advantages in the control field of quadrotor aircraft. However, when there are sudden disturbances in the environment, especially special disturbances beyond experience, the algorithm often finds it difficult to maintain good control performance. Additionally, due to the randomness in the algorithm’s exploration of states, the model’s improvement efficiency during the training process is low and unstable. To address these issues, we propose a deep reinforcement learning framework guided by Multi-PID Self-Attention to tackle the challenges in the training speed and environmental adaptability of quadrotor aircraft control algorithms. In constructing the simulation experiment environment, we introduce multiple disturbance models to simulate complex situations in the real world. By combining the PID control strategy with deep reinforcement learning and utilizing the multi-head self-attention mechanism to optimize the state reward function in the simulation environment, this framework achieves an efficient and stable training process. This experiment aims to train a quadrotor simulation model to accurately fly to a predetermined position under various disturbance conditions and subsequently maintain a stable hovering state. The experimental results show that, compared with traditional deep reinforcement learning algorithms, this method achieves significant improvements in training efficiency and state exploration ability. At the same time, this study deeply analyzes the application effect of the algorithm in different complex environments, verifies its superior robustness and generalization ability in dealing with environmental disturbances, and provides a new solution for the intelligent control of quadrotor aircraft.
Title: Enhancing Quadrotor Control Robustness with Multi-Proportional–Integral–Derivative Self-Attention-Guided Deep Reinforcement Learning
Description:
Deep reinforcement learning has demonstrated flexibility advantages in the control field of quadrotor aircraft.
However, when there are sudden disturbances in the environment, especially special disturbances beyond experience, the algorithm often finds it difficult to maintain good control performance.
Additionally, due to the randomness in the algorithm’s exploration of states, the model’s improvement efficiency during the training process is low and unstable.
To address these issues, we propose a deep reinforcement learning framework guided by Multi-PID Self-Attention to tackle the challenges in the training speed and environmental adaptability of quadrotor aircraft control algorithms.
In constructing the simulation experiment environment, we introduce multiple disturbance models to simulate complex situations in the real world.
By combining the PID control strategy with deep reinforcement learning and utilizing the multi-head self-attention mechanism to optimize the state reward function in the simulation environment, this framework achieves an efficient and stable training process.
This experiment aims to train a quadrotor simulation model to accurately fly to a predetermined position under various disturbance conditions and subsequently maintain a stable hovering state.
The experimental results show that, compared with traditional deep reinforcement learning algorithms, this method achieves significant improvements in training efficiency and state exploration ability.
At the same time, this study deeply analyzes the application effect of the algorithm in different complex environments, verifies its superior robustness and generalization ability in dealing with environmental disturbances, and provides a new solution for the intelligent control of quadrotor aircraft.
Related Results
Event-triggered Asynchronous Distributed MPC for Multi-Quadrotor Systems with Communication Delays
Event-triggered Asynchronous Distributed MPC for Multi-Quadrotor Systems with Communication Delays
This paper investigates the formation composition and keeping problem
for multi-quadrotor systems under time-varying communication delays.
First, a distributed model predictive con...
Diseño de un controlador robusto basado en Lmis aplicado en un Quadrotor
Diseño de un controlador robusto basado en Lmis aplicado en un Quadrotor
Introducción— Una Aeronave No Tripulada UAV o UAVS, por sus siglas en inglés, es aquella capaz de realizar un vuelo sin necesidad de tener un piloto a bordo. Las técnicas de contro...
The Effect of Compression Reinforcement on the Shear Behavior of Concrete Beams with Hybrid Reinforcement
The Effect of Compression Reinforcement on the Shear Behavior of Concrete Beams with Hybrid Reinforcement
Abstract
This study examines the impact of steel compression reinforcement on the shear behavior of concrete beams reinforced with glass fiber reinforced polymer (GFRP) bar...
Quadrotor UAV Trajectory Tracking Control Based on Adaptive Non-singular Terminal Sliding Mode
Quadrotor UAV Trajectory Tracking Control Based on Adaptive Non-singular Terminal Sliding Mode
To address the issue of the quadrotor unmanned aerial vehicle (UAV) being susceptible to external disturbances and model uncertainties during flight, an adaptive non-singular termi...
DEEP AND REINFORCEMENT LEARNING
DEEP AND REINFORCEMENT LEARNING
The book “Deep and Reinforcement Learning” offers a comprehensive journey into the intricate realms of deep learning and reinforcement learning. Unit-1 sets the stage by delving in...
Study on Scheme Optimization of bridge reinforcement increasing ratio
Study on Scheme Optimization of bridge reinforcement increasing ratio
Abstract
The bridge reinforcement methods, each method has its advantages and disadvantages. The load-bearing capacity of bridge members is controlled by the ultimat...
Fault Tolerant Predictive Control Based on Discrete-Time Sliding Mode Observer for Quadrotor UAV
Fault Tolerant Predictive Control Based on Discrete-Time Sliding Mode Observer for Quadrotor UAV
An active fault-tolerant control scheme for a quadrotor unmanned aerial vehicle (UAV) with actuators faults is presented in this paper. The proposed scheme is based on model predic...
Deep convolutional neural network and IoT technology for healthcare
Deep convolutional neural network and IoT technology for healthcare
Background Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find complex patterns in ...

