基于神经网络增强学习算法的工艺任务分配方法  被引量:3

Research on Task Allocation of Process Planning Based on Reinforcement Learning and Neural Network

在线阅读下载全文

作  者:苏莹莹[1] 王宛山[1] 王建荣[1] 唐亮[1] 

机构地区:[1]东北大学机械工程与自动化学院,辽宁沈阳110004

出  处:《东北大学学报(自然科学版)》2009年第2期279-282,共4页Journal of Northeastern University(Natural Science)

基  金:教育部高等学校博士学科点专项科研基金资助项目(20060145017)

摘  要:在任务分配问题中,如果Markov决策过程模型的状态-动作空间很大就会出现"维数灾难".针对这一问题,提出一种基于BP神经网络的增强学习策略.利用BP神经网络良好的泛化能力,存储和逼近增强学习中状态-动作对的Q值,设计了基于Q学习的最优行为选择策略和Q学习的BP神经网络模型与算法.将所提方法应用于工艺任务分配问题,经过Matlab软件仿真实验,结果证实了该方法具有良好的性能和行为逼近能力.该方法进一步提高了增强学习理论在任务分配问题中的应用价值.Aiming at the curse of dimensionality caused by prodigiousness of state-action space for Markov decision-making process model, a kind of Q learning method based on neural network was proposed. The Q value of a state-action pair during reinforcement learning was approached and stored by means of the high generalizability of BP neural network, then the optimal strategy based on Q learning for selection of action and a BP neural network model and algorithm for Q learning were designed. The algorithm proposed was applied to task allocation of process planning, with a simulation done by the software Matlab. The result indicated that it has a good performance and the capability of action approach, and the method enhances the applicability of reinforcement learning in task allocation.

关 键 词:任务分配 工艺设计 增强学习 Q学习 神经网络 

分 类 号:TH164[机械工程—机械制造及自动化]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象