基于改进DDPG算法的无人船自主避碰决策方法  

Autonomous decision-making method of unmanned ship based on improved DDPG algorithm

作  者:关巍[1] 郝淑慧 崔哲闻 王淼淼 GUAN Wei;HAO Shuhui;CUI Zhewen;WANG Miaomiao(Navigation College,Dalian Maritime University,Dalian 116026,China)

机构地区:[1]大连海事大学航海学院,辽宁大连116026

出  处:《中国舰船研究》2025年第1期172-180,共9页Chinese Journal of Ship Research

基  金:国家自然科学基金资助项目(51409033,52171342)。

摘  要:[目的]针对传统深度确定性策略梯度(DDPG)算法数据利用率低、收敛性差的特点,改进并提出一种新的无人船自主避碰决策方法。[方法]利用优先经验回放(PER)自适应调节经验优先级,降低样本的相关性,并利用长短期记忆(LSTM)网络提高算法的收敛性。基于船舶领域和《国际海上避碰规则》(COLREGs),设置会遇情况判定模型和一组新定义的奖励函数,并考虑了紧迫危险以应对他船不遵守规则的情况。为验证所提方法的有效性,在两船和多船会遇局面下进行仿真实验。[结果]结果表明,改进的DDPG算法相比于传统DDPG算法在收敛速度上提升约28.8%,[结论]训练好的自主避碰模型可以使无人船在遵守COLREGs的同时实现自主决策和导航,为实现更加安全、高效的海上交通智能化决策提供参考。[Objectives]To enhance the safety and efficiency of maritime traffic,this paper proposes an autonomous collision avoidance decision-making method for unmanned ships based on an enhanced Deep Deterministic Policy Gradient(DDPG)algorithm.[Methods]In order to address the issues of low data utilization and poor convergence in traditional DDPG algorithms,we employ Priority Experience Replay(PER)to dynamically adjust experience priority,reduce sample correlation,and utilize the Long Short-Term Memory(LSTM)network to improve the algorithm convergence.Based on the domain knowledge of ships and adhering to the International Regulations for Preventing Collisions at Sea(COLREGs),a model for determining meeting situations and a novel set of reward functions that consider urgent scenarios when other ships fail to comply with the COLREGs are introduced.Generalization experiments are conducted involving two-ship and multi-ship encounters to validate the effectiveness of the proposed method.[Results]As the experimental results demonstrate,compared to traditional DDPG algorithms,our improved approach enhances the convergence speed by approximately 28.8%.[Conclusions]The trained model enables autonomous decision-making and navigation while ensuring compliance with the COLREGs,thereby providing valuable insights for intelligent decision-making in the field of maritime transportation.

关 键 词:无人船 深度确定性策略梯度算法 自主避碰决策 优先经验回放 国际海上避碰规则 避碰 

分 类 号:U664.82[交通运输工程—船舶及航道工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象