一类基于启发式搜索的激励学习算法被引量：2

A Class of Reinforcement Learning Algorithm Based on Heuristic Search

出　　处：《计算机技术与发展》2006年第8期41-43,共3页Computer Technology and Development

摘　　要：激励学习已被证明是在控制领域中一种可行的新方法。相比其他的方法,它能较好地处理未知环境问题,但它仍然不是一种有效的方法。幸运的是,在现实世界中,智能体总是会有一些环境的先验知识,这些能形成启发式信息。启发式搜索是一种常用的搜索方法,有很快的搜索速度,但需要精确的启发式信息,这在有些时候难以得到。文中分析比较了启发式搜索和激励学习的各自特点,提出一类新的基于启发式搜索的激励学习算法,初步的实验结果显示了较好的性能。The reinforcement learning has been proved to be a new applicable method in control field. It can solve the problems of unknown environment better than the others. But it isn＇t a very effective method yet. Fortunately in real world,the agent often has some knowledge of the environment,which can be used as heuristic information. The heuristic search is a very effective search method,which can search very quickly. But it need very precise heuristic information, which may be hard to get in complex environment. The characteristics of heuristic search and reinforcement learning are compared and a class of reinforcement learning algorithm on heuristic search is introduced. The preliminary empirical result shows better than the previous.

关键词：启发式搜索激励学习启发式SARSA

分类号：TP301.6[自动化与计算机技术—计算机系统结构]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一类基于启发式搜索的激励学习算法被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一类基于启发式搜索的激励学习算法 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

一类基于启发式搜索的激励学习算法被引量：2