未知饱和控制系统有穷域最优控制被引量：2

Finite-horizon optimal control for unknown systems with saturating control inputs

机构地区：[1]东北大学信息科学与工程学院,辽宁沈阳110819 [2]牡丹江师范学院数学科学学院,黑龙江牡丹江157011

出　　处：《控制理论与应用》2016年第5期631-637,共7页Control Theory & Applications

基　　金：牡丹江市科学技术计划项目(G2015k1991);牡丹江师范学院一般项目(YB201605);国家自然科学基金项目(61104010);中国博士后自然科学基金项目(2012M510825;2014T70260);中央高校基本科研基金项目(N140404004)资助~~

摘　　要：针对带有饱和执行器且局部未知的非线性连续系统的有穷域最优控制问题,设计了一种基于自适应动态规划(ADP)的在线积分增强学习算法,并给出算法的收敛性证明.首先,引入非二次型函数处理控制饱和问题.其次,设计一种由常量权重和时变激活函数构成的单一网络,来逼近未知连续的值函数,与传统双网络相比减少了计算量.同时,综合考虑神经网络产生的残差和终端误差,应用最小二乘法更新神经网络权重,并且给出基于神经网络的迭代值函数收敛到最优值的收敛性证明.最后,通过两个仿真例子验证了算法的有效性.An adaptive dynamic programming （ADP）-based online integral reinforcement learning algorithm is designed for firdte-hodzon optimal control of nonlinear continuous-time systems with saturating control inputs and partially unknown dynamics. Moreover, the convergence of the algorithm is proved. Firstly, the control constraints are handled through non- quadratic function. Secondly, a single neural network （NN） with constant weights and time-dependent activation functions is designed in order to approximate the unknown and continuous value function. Compared with the traditional dual neural networks, the burden of computation by the single NN is lessened. Meanwhile, the NN weights are updated by the least square method with considering both the residual error and terminal error. Furthermore, the convergence of iterative value function on the base of NN is proved. Lastly, two simulation examples show the effectiveness of the proposed algorithm.

关键词：有穷域最优控制神经网络自适应动态规划

分类号：TP273[自动化与计算机技术—检测技术与自动化装置]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

未知饱和控制系统有穷域最优控制被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

未知饱和控制系统有穷域最优控制 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

未知饱和控制系统有穷域最优控制被引量：2