Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Intelligent Ship Collision Avoidance Algorithm Based on DDQN with Prioritized Experience Replay under COLREGs

View through CrossRef
Ship collisions often result in huge losses of life, cargo and ships, as well as serious pollution of the water environment. Meanwhile, it is estimated that between 75% and 86% of maritime accidents are related to human factors. Thus, it is necessary to enhance the intelligence of ships to partially or fully replace the traditional piloting mode and eventually achieve autonomous collision avoidance to reduce the influence of human factors. In this paper, we propose a multi-ship automatic collision avoidance method based on a double deep Q network (DDQN) with prioritized experience replay. Firstly, we vectorize the predicted hazardous areas as the observation states of the agent so that similar ship encounter scenarios can be clustered and the input dimension of the neural network can be fixed. The reward function is designed based on the International Regulations for Preventing Collision at Sea (COLREGs) and human experience. Different from the architecture of previous collision avoidance methods based on deep reinforcement learning (DRL), in this paper, the interaction between the agent and the environment occurs only in the collision avoidance decision-making phase, which greatly reduces the number of state transitions in the Markov decision process (MDP). The prioritized experience replay method is also used to make the model converge more quickly. Finally, 19 single-vessel collision avoidance scenarios were constructed based on the encounter situations classified by the COLREGs, which were arranged and combined as the training set for the agent. The effectiveness of the proposed method in close-quarters situation was verified using the Imazu problem. The simulation results show that the method can achieve multi-ship collision avoidance in crowded waters, and the decisions generated by this method conform to the COLREGs and are close to the level of human ship handling.
Title: Intelligent Ship Collision Avoidance Algorithm Based on DDQN with Prioritized Experience Replay under COLREGs
Description:
Ship collisions often result in huge losses of life, cargo and ships, as well as serious pollution of the water environment.
Meanwhile, it is estimated that between 75% and 86% of maritime accidents are related to human factors.
Thus, it is necessary to enhance the intelligence of ships to partially or fully replace the traditional piloting mode and eventually achieve autonomous collision avoidance to reduce the influence of human factors.
In this paper, we propose a multi-ship automatic collision avoidance method based on a double deep Q network (DDQN) with prioritized experience replay.
Firstly, we vectorize the predicted hazardous areas as the observation states of the agent so that similar ship encounter scenarios can be clustered and the input dimension of the neural network can be fixed.
The reward function is designed based on the International Regulations for Preventing Collision at Sea (COLREGs) and human experience.
Different from the architecture of previous collision avoidance methods based on deep reinforcement learning (DRL), in this paper, the interaction between the agent and the environment occurs only in the collision avoidance decision-making phase, which greatly reduces the number of state transitions in the Markov decision process (MDP).
The prioritized experience replay method is also used to make the model converge more quickly.
Finally, 19 single-vessel collision avoidance scenarios were constructed based on the encounter situations classified by the COLREGs, which were arranged and combined as the training set for the agent.
The effectiveness of the proposed method in close-quarters situation was verified using the Imazu problem.
The simulation results show that the method can achieve multi-ship collision avoidance in crowded waters, and the decisions generated by this method conform to the COLREGs and are close to the level of human ship handling.

Related Results

Complex Collision Tumors: A Systematic Review
Complex Collision Tumors: A Systematic Review
Abstract Introduction: A collision tumor consists of two distinct neoplastic components located within the same organ, separated by stromal tissue, without histological intermixing...
CORALL: A COLREGs-Guided Risk-Aware LLM for Decision-Making in Maritime Autonomous Surface Ships
CORALL: A COLREGs-Guided Risk-Aware LLM for Decision-Making in Maritime Autonomous Surface Ships
The paper introduces a novel approach to utilising Large Language Models (LLMs) for real-time COLREGs-based decision-making in collision encounters. The COLREGs (collision regulati...
3D path planning of unmanned ground vehicles based on improved DDQN
3D path planning of unmanned ground vehicles based on improved DDQN
Abstract For safe and efficient path planning of unmanned ground vehicles in complex 3D environment, this paper proposes an improved deep reinforcement learning algorithm (...
Design and Optimization for Ship Structure Based on Knowledge-Based Engineering
Design and Optimization for Ship Structure Based on Knowledge-Based Engineering
It is always pursued that the excellent ship structure is rapidly designed and modified on the premise of ensuring security in ship engineering. In this paper, design and optimizat...
Design of a Ship Path Planning Scheme Incorporating Wind and Current Effects with COLREGS Compliance
Design of a Ship Path Planning Scheme Incorporating Wind and Current Effects with COLREGS Compliance
<div class="section abstract"><div class="htmlview paragraph">Path planning algorithms are critical technologies for intelligent ship systems, as sc...
Theta-band phase locking during encoding leads to coordinated entorhinal-hippocampal replay
Theta-band phase locking during encoding leads to coordinated entorhinal-hippocampal replay
Abstract Precisely timed interactions between hippocampal and cortical neurons during replay epochs are thought to support memory consolidation. ...
Evaluating hippocampal replay without a ground truth
Evaluating hippocampal replay without a ground truth
AbstractDuring rest and sleep, memory traces replay in the brain. The dialogue between brain regions during replay is thought to stabilize labile memory traces for long-term storag...
Ship Collaborative Path Planning Method Based on CS-STHA
Ship Collaborative Path Planning Method Based on CS-STHA
Ship path planning is one of the key technologies for ship automation. Establishing a cooperative collision avoidance (CA) path for multi-ship encounters is of great value to marit...

Back to Top