Javascript must be enabled to continue!
3D path planning of unmanned ground vehicles based on improved DDQN
View through CrossRef
Abstract
For safe and efficient path planning of unmanned ground vehicles in complex 3D environment, this paper proposes an improved deep reinforcement learning algorithm (Dual Experience Dynamic Target DDQN, DEDT DDQN) to solve the problems of sparse reward convergence and over-estimation that are difficulties for traditional DDQN algorithms in complex maps. The algorithm improves the performance of the DDQN algorithm in dealing with complex environments by dividing the input quality experience and dynamically fusing the a priori knowledge of DDQN and average DDQN for network parameter training. For unstructured 3D environments, this paper adopts a path planning strategy based on the digital elevation model (DEM) considering environmental characteristics and time cost. Simulation experiments of the DEDT DDQN algorithm in 3D maps modeled on realistic environments show that the DEDT DDQN algorithm reduces the number of inflection points and the average slope change by 40% and 16.7%, respectively, and improves the performance of optimization searching as well as the convergence speed by 5.34% and 60%, respectively. The proposed improved algorithm and adopted strategy can be applied in two different types of maps, which verifies the effectiveness and robustness of the algorithm and strategy.
Title: 3D path planning of unmanned ground vehicles based on improved DDQN
Description:
Abstract
For safe and efficient path planning of unmanned ground vehicles in complex 3D environment, this paper proposes an improved deep reinforcement learning algorithm (Dual Experience Dynamic Target DDQN, DEDT DDQN) to solve the problems of sparse reward convergence and over-estimation that are difficulties for traditional DDQN algorithms in complex maps.
The algorithm improves the performance of the DDQN algorithm in dealing with complex environments by dividing the input quality experience and dynamically fusing the a priori knowledge of DDQN and average DDQN for network parameter training.
For unstructured 3D environments, this paper adopts a path planning strategy based on the digital elevation model (DEM) considering environmental characteristics and time cost.
Simulation experiments of the DEDT DDQN algorithm in 3D maps modeled on realistic environments show that the DEDT DDQN algorithm reduces the number of inflection points and the average slope change by 40% and 16.
7%, respectively, and improves the performance of optimization searching as well as the convergence speed by 5.
34% and 60%, respectively.
The proposed improved algorithm and adopted strategy can be applied in two different types of maps, which verifies the effectiveness and robustness of the algorithm and strategy.
Related Results
Autonomous localized path planning algorithm for UAVs based on TD3 strategy
Autonomous localized path planning algorithm for UAVs based on TD3 strategy
AbstractUnmanned Aerial Vehicles are useful tools for many applications. However, autonomous path planning for Unmanned Aerial Vehicles in unfamiliar environments is a challenging ...
Persistent Unmanned Surface Vehicles for Subsea Support
Persistent Unmanned Surface Vehicles for Subsea Support
Abstract
This paper discusses the role of unmanned systems in subsea support. Recent developments in mobile unmanned vehicle networks are reviewed, demonstrating ...
Research on Path Smoothing Optimization based on Improved RRT-Connect Algorithm and third-order Bezier curve
Research on Path Smoothing Optimization based on Improved RRT-Connect Algorithm and third-order Bezier curve
Abstract
Targeting the deficiencies of the original RRT-Connect path planning algorithm in dealing with obstacle avoidance, planning efficiency and path smoothing in static...
Observation Method for Autonomous Maneuver of Spacecraft under Emergency Conditions
Observation Method for Autonomous Maneuver of Spacecraft under Emergency Conditions
Abstract
To deal with space threats with strong maneuverability such as kinetic energy interceptors, remote-sensing satellites need to perform autonomous avoidance while ca...
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Abstarct
Introduction
Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...
Application of Unmanned Flying Vehicle for Obtaining Digital Orthofotomaps
Application of Unmanned Flying Vehicle for Obtaining Digital Orthofotomaps
Nowadays, surveys using unmanned aerial vehicles is becoming popular. The resulting orthophotomap is the final product for creating digital plans and cardboard. The objectives of t...
Path Planning Followed by Kinodynamic Smoothing for Multirotor Aerial Vehicles (MAVs)
Path Planning Followed by Kinodynamic Smoothing for Multirotor Aerial Vehicles (MAVs)
Any obstacle-free path planning algorithm, in general, gives a sequence of waypoints that connect start and goal positions by a sequence of straight lines, which does not ensure th...
A Finite-Time Path Tracking Control Scheme for Unmanned Vehicles Based on Multidimensional Taylor Network and Adaptive Filtering
A Finite-Time Path Tracking Control Scheme for Unmanned Vehicles Based on Multidimensional Taylor Network and Adaptive Filtering
Aiming at the path tracking control problem of unmanned vehicles under the conditions of model uncertainty and measurement noise, a finite-time path tracking control scheme based o...

