检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:关巍[1] 郝淑慧 崔哲闻 王淼淼 GUAN Wei;HAO Shuhui;CUI Zhewen;WANG Miaomiao(Navigation College,Dalian Maritime University,Dalian 116026,China)
出 处:《中国舰船研究》2025年第1期172-180,共9页Chinese Journal of Ship Research
基 金:国家自然科学基金资助项目(51409033,52171342)。
摘 要:[目的]针对传统深度确定性策略梯度(DDPG)算法数据利用率低、收敛性差的特点,改进并提出一种新的无人船自主避碰决策方法。[方法]利用优先经验回放(PER)自适应调节经验优先级,降低样本的相关性,并利用长短期记忆(LSTM)网络提高算法的收敛性。基于船舶领域和《国际海上避碰规则》(COLREGs),设置会遇情况判定模型和一组新定义的奖励函数,并考虑了紧迫危险以应对他船不遵守规则的情况。为验证所提方法的有效性,在两船和多船会遇局面下进行仿真实验。[结果]结果表明,改进的DDPG算法相比于传统DDPG算法在收敛速度上提升约28.8%,[结论]训练好的自主避碰模型可以使无人船在遵守COLREGs的同时实现自主决策和导航,为实现更加安全、高效的海上交通智能化决策提供参考。[Objectives]To enhance the safety and efficiency of maritime traffic,this paper proposes an autonomous collision avoidance decision-making method for unmanned ships based on an enhanced Deep Deterministic Policy Gradient(DDPG)algorithm.[Methods]In order to address the issues of low data utilization and poor convergence in traditional DDPG algorithms,we employ Priority Experience Replay(PER)to dynamically adjust experience priority,reduce sample correlation,and utilize the Long Short-Term Memory(LSTM)network to improve the algorithm convergence.Based on the domain knowledge of ships and adhering to the International Regulations for Preventing Collisions at Sea(COLREGs),a model for determining meeting situations and a novel set of reward functions that consider urgent scenarios when other ships fail to comply with the COLREGs are introduced.Generalization experiments are conducted involving two-ship and multi-ship encounters to validate the effectiveness of the proposed method.[Results]As the experimental results demonstrate,compared to traditional DDPG algorithms,our improved approach enhances the convergence speed by approximately 28.8%.[Conclusions]The trained model enables autonomous decision-making and navigation while ensuring compliance with the COLREGs,thereby providing valuable insights for intelligent decision-making in the field of maritime transportation.
关 键 词:无人船 深度确定性策略梯度算法 自主避碰决策 优先经验回放 国际海上避碰规则 避碰
分 类 号:U664.82[交通运输工程—船舶及航道工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.117.151.179