Javascript must be enabled to continue!
Intelligent Ship Collision Avoidance Algorithm Based on DDQN with Prioritized Experience Replay under COLREGs
View through CrossRef
Ship collisions often result in huge losses of life, cargo and ships, as well as serious pollution of the water environment. Meanwhile, it is estimated that between 75% and 86% of maritime accidents are related to human factors. Thus, it is necessary to enhance the intelligence of ships to partially or fully replace the traditional piloting mode and eventually achieve autonomous collision avoidance to reduce the influence of human factors. In this paper, we propose a multi-ship automatic collision avoidance method based on a double deep Q network (DDQN) with prioritized experience replay. Firstly, we vectorize the predicted hazardous areas as the observation states of the agent so that similar ship encounter scenarios can be clustered and the input dimension of the neural network can be fixed. The reward function is designed based on the International Regulations for Preventing Collision at Sea (COLREGs) and human experience. Different from the architecture of previous collision avoidance methods based on deep reinforcement learning (DRL), in this paper, the interaction between the agent and the environment occurs only in the collision avoidance decision-making phase, which greatly reduces the number of state transitions in the Markov decision process (MDP). The prioritized experience replay method is also used to make the model converge more quickly. Finally, 19 single-vessel collision avoidance scenarios were constructed based on the encounter situations classified by the COLREGs, which were arranged and combined as the training set for the agent. The effectiveness of the proposed method in close-quarters situation was verified using the Imazu problem. The simulation results show that the method can achieve multi-ship collision avoidance in crowded waters, and the decisions generated by this method conform to the COLREGs and are close to the level of human ship handling.
Title: Intelligent Ship Collision Avoidance Algorithm Based on DDQN with Prioritized Experience Replay under COLREGs
Description:
Ship collisions often result in huge losses of life, cargo and ships, as well as serious pollution of the water environment.
Meanwhile, it is estimated that between 75% and 86% of maritime accidents are related to human factors.
Thus, it is necessary to enhance the intelligence of ships to partially or fully replace the traditional piloting mode and eventually achieve autonomous collision avoidance to reduce the influence of human factors.
In this paper, we propose a multi-ship automatic collision avoidance method based on a double deep Q network (DDQN) with prioritized experience replay.
Firstly, we vectorize the predicted hazardous areas as the observation states of the agent so that similar ship encounter scenarios can be clustered and the input dimension of the neural network can be fixed.
The reward function is designed based on the International Regulations for Preventing Collision at Sea (COLREGs) and human experience.
Different from the architecture of previous collision avoidance methods based on deep reinforcement learning (DRL), in this paper, the interaction between the agent and the environment occurs only in the collision avoidance decision-making phase, which greatly reduces the number of state transitions in the Markov decision process (MDP).
The prioritized experience replay method is also used to make the model converge more quickly.
Finally, 19 single-vessel collision avoidance scenarios were constructed based on the encounter situations classified by the COLREGs, which were arranged and combined as the training set for the agent.
The effectiveness of the proposed method in close-quarters situation was verified using the Imazu problem.
The simulation results show that the method can achieve multi-ship collision avoidance in crowded waters, and the decisions generated by this method conform to the COLREGs and are close to the level of human ship handling.
Related Results
3D path planning of unmanned ground vehicles based on improved DDQN
3D path planning of unmanned ground vehicles based on improved DDQN
Abstract
For safe and efficient path planning of unmanned ground vehicles in complex 3D environment, this paper proposes an improved deep reinforcement learning algorithm (...
Model predictive control approach to global air collision avoidance
Model predictive control approach to global air collision avoidance
PurposeMost of the existing approaches for flight collision avoidance are concerned with local traffic alone for which the separation is based on the pairwise analysis of aircraft ...
Evaluating hippocampal replay without a ground truth
Evaluating hippocampal replay without a ground truth
AbstractDuring rest and sleep, memory traces replay in the brain. The dialogue between brain regions during replay is thought to stabilize labile memory traces for long-term storag...
How Does Psilocybin Therapy Work? an Exploration of Experiential Avoidance as a Putative Mechanism of Change
How Does Psilocybin Therapy Work? an Exploration of Experiential Avoidance as a Putative Mechanism of Change
Although psilocybin therapy is currently receiving attention as a novel intervention for a wide range of mental health concerns, limited research has examined the underlying psycho...
Connecting Ship Operation and Architecture in Ship Design Processes
Connecting Ship Operation and Architecture in Ship Design Processes
It is challenging to deal with the operation of ships by crew members in ship design processes. This is important because the efficiency and safety of ship operations ultimately de...
Study on the assessment of absorbed energy of bulbous bow in ship collision
Study on the assessment of absorbed energy of bulbous bow in ship collision
The evaluation of energy absorption characteristics of the bulbous bow structure in ship collisions or grounding accidents is a crucial research area. Predicting dynamic reactions ...
Collision risk analysis of mega constellations in low Earth orbit
Collision risk analysis of mega constellations in low Earth orbit
Abstract
The LEO megaconstellations have thousands of satellites, which operate on similar orbital heights. Because of increasing space debris, the satellites accelerate th...
Exploring the roles of memory replay in targeted memory reactivation and birdsong development: Insights from computational models of complementary learning systems
Exploring the roles of memory replay in targeted memory reactivation and birdsong development: Insights from computational models of complementary learning systems
AbstractReplay facilitates memory consolidation in both biological and artificial systems. Using the complementary learning systems (CLS) framework, we study replay in both humans ...

