Javascript must be enabled to continue!
3D path planning of unmanned ground vehicles based on improved DDQN
View through CrossRef
Abstract
For safe and efficient path planning of unmanned ground vehicles in complex 3D environment, this paper proposes an improved deep reinforcement learning algorithm (Dual Experience Dynamic Target DDQN, DEDT DDQN) to solve the problems of sparse reward convergence and over-estimation that are difficulties for traditional DDQN algorithms in complex maps. The algorithm improves the performance of the DDQN algorithm in dealing with complex environments by dividing the input quality experience and dynamically fusing the a priori knowledge of DDQN and average DDQN for network parameter training. For unstructured 3D environments, this paper adopts a path planning strategy based on the digital elevation model (DEM) considering environmental characteristics and time cost. Simulation experiments of the DEDT DDQN algorithm in 3D maps modeled on realistic environments show that the DEDT DDQN algorithm reduces the number of inflection points and the average slope change by 40% and 16.7%, respectively, and improves the performance of optimization searching as well as the convergence speed by 5.34% and 60%, respectively. The proposed improved algorithm and adopted strategy can be applied in two different types of maps, which verifies the effectiveness and robustness of the algorithm and strategy.
Title: 3D path planning of unmanned ground vehicles based on improved DDQN
Description:
Abstract
For safe and efficient path planning of unmanned ground vehicles in complex 3D environment, this paper proposes an improved deep reinforcement learning algorithm (Dual Experience Dynamic Target DDQN, DEDT DDQN) to solve the problems of sparse reward convergence and over-estimation that are difficulties for traditional DDQN algorithms in complex maps.
The algorithm improves the performance of the DDQN algorithm in dealing with complex environments by dividing the input quality experience and dynamically fusing the a priori knowledge of DDQN and average DDQN for network parameter training.
For unstructured 3D environments, this paper adopts a path planning strategy based on the digital elevation model (DEM) considering environmental characteristics and time cost.
Simulation experiments of the DEDT DDQN algorithm in 3D maps modeled on realistic environments show that the DEDT DDQN algorithm reduces the number of inflection points and the average slope change by 40% and 16.
7%, respectively, and improves the performance of optimization searching as well as the convergence speed by 5.
34% and 60%, respectively.
The proposed improved algorithm and adopted strategy can be applied in two different types of maps, which verifies the effectiveness and robustness of the algorithm and strategy.
Related Results
RECOMMENDATIONS ON PREPARING UNITS OF UNMANNED GROUND COMPLEXES FOR USE
RECOMMENDATIONS ON PREPARING UNITS OF UNMANNED GROUND COMPLEXES FOR USE
This work is devoted to practical recommendations on preparation for the use of unmanned ground systems as part of a unit. The materials of the article provide a theoretical justif...
Autonomous localized path planning algorithm for UAVs based on TD3 strategy
Autonomous localized path planning algorithm for UAVs based on TD3 strategy
AbstractUnmanned Aerial Vehicles are useful tools for many applications. However, autonomous path planning for Unmanned Aerial Vehicles in unfamiliar environments is a challenging ...
Persistent Unmanned Surface Vehicles for Subsea Support
Persistent Unmanned Surface Vehicles for Subsea Support
Abstract
This paper discusses the role of unmanned systems in subsea support. Recent developments in mobile unmanned vehicle networks are reviewed, demonstrating ...
Nonlinear geometric multivariable control for unmanned aircraft flight system
Nonlinear geometric multivariable control for unmanned aircraft flight system
Purpose
Due to the important role of unmanned aircraft in military and human’s normal practical application, this paper aims to extend the interesting research ...
Ground ice detection and implications for permafrost geomorphology
Ground ice detection and implications for permafrost geomorphology
Most permafrost contains ground ice, often as pore ice or thin veins or lenses of ice. In certain circumstance, larger bodies of ice can form, such as ice wedges, or massive lenses...
An Energy Efficient Design of Computation Offloading Enabled by UAV
An Energy Efficient Design of Computation Offloading Enabled by UAV
The data volume is exploding due to various newly-developing applications that call for stringent communication requirements towards 5th generation wireless systems. Fortunately, m...
Development of a control system for multiple unmanned aerial vehicles
Development of a control system for multiple unmanned aerial vehicles
Background. The study considers the problem associated with the development of a control system for many unmanned aerial vehicles. An analysis of the possibilities of improving the...
Observation Method for Autonomous Maneuver of Spacecraft under Emergency Conditions
Observation Method for Autonomous Maneuver of Spacecraft under Emergency Conditions
Abstract
To deal with space threats with strong maneuverability such as kinetic energy interceptors, remote-sensing satellites need to perform autonomous avoidance while ca...

