检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:周一帆[1] 郭凯 李帮诚 ZHOU Yifan;GUO Kai;LI Bangcheng(School of Mechanical Engineering,Southeast University,Nanjing 211189,China)
出 处:《长沙理工大学学报(自然科学版)》2023年第2期27-34,共8页Journal of Changsha University of Science and Technology:Natural Science
基 金:国家自然科学基金资助项目(72071044)。
摘 要:【目的】研究多智能体强化学习算法用于多部件生产系统维修优化的有效性,及维修优化领域知识用于强化学习的可行性。【方法】将生产系统的维修决策建模为马尔可夫决策过程(Markov decision process,MDP),并采用一种基于奖励塑造的分布式Q学习(shaped reward distributed Q-learning,SR-DQL)算法对其进行求解。通过对智能体的设计和奖励塑造,把维修优化的领域知识应用于强化学习中。【结果】使用包含5个生产单元和4个缓冲库存的生产系统对本文所提出的SR-DQL算法进行验证。相较于Q学习算法,SRDQL算法能够提升6%的平均收益。此外,由该算法计算得到的平均收益也比由分布式Q学习算法和深度强化学习算法计算得到的大。【结论】多智能体强化学习能有效处理大规模生产系统的维修优化问题,添加奖励塑造可以提升算法性能,并得到更优的维修策略。[Purposes]This paper investigates the effectiveness of multi-agent reinforcement learning algorithms for maintenance optimization of multi-component production system.The feasibility of applying domain knowledge of maintenance optimization in reinforcement learning is also studied.[Methods]The maintenance decision making process of the production system was modeled as a Markov decision process(MDP),which was solved by a shaped reward distributed Q-learning(SR-DQL)algorithm.The domain knowledge of maintenance optimization was introduced into reinforcement learning by designing parameters of agents and reward shaping.[Findings]The proposed methods were validated using a production system with five production units and four inventory buffers.The proposed SR-DQL algorithm had a 6%ehancement of average revenuse comparing with the commonly used Q-learning.SR-DQL also outperformed distributed Qlearning and deep reinforcement learning algorithms.[Conclusions]The SR-DQL algorithm can effectively deal with the maintenance optimization problem of large-scale production systems,and reward shaping can improve the performance of the reinforcement learning algorithm.
关 键 词:多部件生产系统 奖励塑造 分布式Q学习 多智能体强化学习 深度强化学习
分 类 号:TH17[机械工程—机械制造及自动化]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.3