一类控制受约束非线性系统的基于单网络贪婪迭代DHP算法的近似最优镇定被引量：1

Near-optimal Stabilization for a Class of Nonlinear Systems with Control Constraint Based on Single Network Greedy Iterative DHP Algorithm

作　　者：罗艳红[1,2] 张化光[1,2] 曹宁[2] 陈兵[3]

机构地区：[1]东北大学流程工业综合自动化教育部重点实验室,沈阳110004 [2]东北大学信息科学与工程学院,沈阳110004 [3]青岛大学复杂性科学研究所,青岛266071

出　　处：《自动化学报》2009年第11期1436-1445,共10页Acta Automatica Sinica

基　　金：国家自然科学基金(60534010;60774048;60728307);国家高技术研究发展计划(863计划)(2006AA04Z183);长江学者和创新团队发展计划(60521003);高等学校学科创新引智计划(B08015)资助~~

摘　　要：提出一种贪婪迭代DHP(Dual heuristic programming)算法,解决了一类控制受约束非线性系统的近似最优镇定问题.针对系统的控制约束,首先引入一个非二次泛函把约束问题转换为无约束问题,然后基于协状态函数提出一种贪婪迭代DHP算法以求解系统的HJB(Hamilton-Jacobi-Bellman)方程.在算法的每个迭代步,利用一个神经网络来近似系统的协状态函数,而后根据协状态函数直接计算系统的最优控制策略,从而消除了常规近似动态规划方法中的控制网络.最后通过两个仿真例子证明了本文提出的最优控制方案的有效性和可行性.The near-optimal stabilization problem for nonlinear constrained systems is solved by greedy iterative DHP （Dual heuristic programming） algorithm, Considering the control constraint of the system, a nonquadratic functional is first introduced in order to transform the constrained problem into a unconstrained problem. Then based on the costate function, the greedy iterative DHP algorithm is proposed to solve the Hamilton-Jacobi-Bellman （HJB） equation of the system. At each step of the iterative algorithm, a neural network is utilized to approximate the costate function, and then the optimal control policy of the system can be computed directly according to the costate function, which removes the action network appearing in the ordinary approximate dynamic programming （ADP） method. Finally, two examples are given to demonstrate the validity and feasibility of the proposed optimal control scheme.

关键词：贪婪迭代约束非二次泛函最优控制神经网络

分类号：TP18[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一类控制受约束非线性系统的基于单网络贪婪迭代DHP算法的近似最优镇定被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一类控制受约束非线性系统的基于单网络贪婪迭代DHP算法的近似最优镇定 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

一类控制受约束非线性系统的基于单网络贪婪迭代DHP算法的近似最优镇定被引量：1