检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:曾斌[1] 王睿[2] 李厚朴[3] 樊旭 ZENG Bin;WANG Rui;LI Houpu;FAN Xu(Department of Management and Economics,Naval University of Engineering,Wuhan 430033,China;Teaching and Research Support Center,Naval University of Engineering,Wuhan 430033,China;Department of Navigation Engineering,Naval University of Engineering,Wuhan 430033,China)
机构地区:[1]海军工程大学管理工程与装备经济系,湖北武汉430033 [2]海军工程大学教研保障中心,湖北武汉430033 [3]海军工程大学导航工程系,湖北武汉430033
出 处:《系统工程与电子技术》2022年第1期199-208,共10页Systems Engineering and Electronics
基 金:国家自然科学基金(41771487);湖北省杰出青年科学基金(2019CFA086)资助课题。
摘 要:智能化后装保障调度是当前军事领域的研究热点之一,其中复杂多变的战场环境要求战时保障具有良好的自适应性。针对此问题,提出了基于马尔可夫决策过程的强化学习模型,能够主动学习最佳派遣策略,根据历史数据和当前态势预判后续变化。为了考虑不确定事件的影响,在模型求解算法中增加了基于概率统计模型的仿真流程;为了减少随机事件带来的计算复杂性,利用决策后状态变量重新设计了贝尔曼迭代方程;为了解决状态空间的维度灾问题,提出了基于基函数组合的近似函数。仿真实验表明,强化学习能力的引入能够显著提高战时保障调度性能。Intelligent logistics and equipment support is one of the research hotspots in the current military field, it is necessary for the wartime support to be adaptive in the complicated and changeable battlefield. Aiming at this problem, a reinforcement learning model based on Markov decision process(MDP) is proposed, which can adaptively learn the optimal assignment policy and obtain the scheduling scheme according to historical data and prediction based on current situation. A simulation procedure based on probability statistical model is adopted into the model solution to consider the impact of the uncertainty events. Furthermore, the post decision state is used in the design of Bellman iterative equation to decrease the computation complexity brought by the random incidents. Finally, the approximate function based on composition of basis functions is proposed to overcome the problem of dimensionality curse. Simulation experiment shows that the reinforcement learning capability can significantly improve the scheduling performance of support force.
分 类 号:TP301[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.216.45.231