基于改进DDPG算法的无人船自主避碰决策方法

Autonomous decision-making method of unmanned ship based on improved DDPG algorithm

作　　者：关巍[1] 郝淑慧崔哲闻王淼淼 GUAN Wei;HAO Shuhui;CUI Zhewen;WANG Miaomiao(Navigation College,Dalian Maritime University,Dalian 116026,China)

机构地区：[1]大连海事大学航海学院,辽宁大连116026

出　　处：《中国舰船研究》2025年第1期172-180,共9页Chinese Journal of Ship Research

基　　金：国家自然科学基金资助项目(51409033,52171342)。

摘　　要：[目的]针对传统深度确定性策略梯度(DDPG)算法数据利用率低、收敛性差的特点,改进并提出一种新的无人船自主避碰决策方法。[方法]利用优先经验回放(PER)自适应调节经验优先级,降低样本的相关性,并利用长短期记忆(LSTM)网络提高算法的收敛性。基于船舶领域和《国际海上避碰规则》(COLREGs),设置会遇情况判定模型和一组新定义的奖励函数,并考虑了紧迫危险以应对他船不遵守规则的情况。为验证所提方法的有效性,在两船和多船会遇局面下进行仿真实验。[结果]结果表明,改进的DDPG算法相比于传统DDPG算法在收敛速度上提升约28.8%,[结论]训练好的自主避碰模型可以使无人船在遵守COLREGs的同时实现自主决策和导航,为实现更加安全、高效的海上交通智能化决策提供参考。[Objectives]To enhance the safety and efficiency of maritime traffic,this paper proposes an autonomous collision avoidance decision-making method for unmanned ships based on an enhanced Deep Deterministic Policy Gradient(DDPG)algorithm.[Methods]In order to address the issues of low data utilization and poor convergence in traditional DDPG algorithms,we employ Priority Experience Replay(PER)to dynamically adjust experience priority,reduce sample correlation,and utilize the Long Short-Term Memory(LSTM)network to improve the algorithm convergence.Based on the domain knowledge of ships and adhering to the International Regulations for Preventing Collisions at Sea(COLREGs),a model for determining meeting situations and a novel set of reward functions that consider urgent scenarios when other ships fail to comply with the COLREGs are introduced.Generalization experiments are conducted involving two-ship and multi-ship encounters to validate the effectiveness of the proposed method.[Results]As the experimental results demonstrate,compared to traditional DDPG algorithms,our improved approach enhances the convergence speed by approximately 28.8%.[Conclusions]The trained model enables autonomous decision-making and navigation while ensuring compliance with the COLREGs,thereby providing valuable insights for intelligent decision-making in the field of maritime transportation.

关键词：无人船深度确定性策略梯度算法自主避碰决策优先经验回放国际海上避碰规则避碰

分类号：U664.82[交通运输工程—船舶及航道工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进DDPG算法的无人船自主避碰决策方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进DDPG算法的无人船自主避碰决策方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索