基于改进Sarsa算法的拖轮动态调度方法

Tugboat Dynamic Scheduling Method Based on Improved Sarsa Algorithm

作　　者：李佳琛段兴锋[1] LI Jiachen;DUAN Xingfeng(School of Navigation,Jimei University,Xiamen Fujian 361000 China)

出　　处：《重庆科技学院学报（自然科学版）》2024年第3期42-48,共7页Journal of Chongqing University of Science and Technology：Natural Sciences Edition

基　　金：福建省自然科学基金项目“海上智能调度”(2019J01325)。

摘　　要：从优化Sarsa算法的角度展开拖轮动态调度方法研究。采用强化学习框架并结合拖轮的状态和环境信息,建立状态-动作函数,进而搜索拖轮调度最佳策略。改进Sarsa算法中Q函数的更新方式,以克服收敛速度慢的问题。同时,根据学习率选择模式和动作选择方式,对探索策略与利用策略加以平衡,以提高算法的收敛速度和性能。算例仿真实验结果表明,采用该算法可有效缩短船舶等待时间,进而提升拖轮资源的利用效率。Aiming at the shortcomings of the traditional Sarsa algorithm,the optimization of tugboat dynamic scheduling method is studied.Based on the reinforcement learning framework and the state and environment information of tugboats,the state-action function is established to search the optimal strategy of tugboats scheduling decision.The update method of Q function in Sarsa algorithm is changed to overcome the problem of slow convergence.At the same time,according to the learning rate and action selection mode,the exploration strategy and utilization strategy are balanced to improve the convergence speed and performance of the algorithm.The simulation results show that the algorithm can effectively shorten the waiting time and improve the utilization efficiency of tugboat resources.

关键词：Sarsa算法拖轮自适应调度强化学习算法策略

分类号：TP391[自动化与计算机技术—计算机应用技术] U691[自动化与计算机技术—计算机科学与技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进Sarsa算法的拖轮动态调度方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进Sarsa算法的拖轮动态调度方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索