A new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architecture 被引量：6

A new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architecture

作　　者：SONG RuiZhuo XIAO WenDong SUN ChangYin

机构地区：[1]School of Automation and Electrical Engineering,University of Science and Technology Beijing

出　　处：《Science China(Information Sciences)》2014年第6期280-289,共10页中国科学（信息科学）（英文版）

基　　金：supported by the National Natural Science Foundation of China(Grant Nos.61304079,61125306,61034002);Beijing Natural Science Foundation(Grant No.4143065);China Postdoctoral Science Foundation(Grant No.2013M530527);the Open Research Project from SKLMCCS(Grant No.20120106)

摘　　要：A novel self-learning optimal control method for a class of discrete-time nonlinear systems is proposed based on iteration adaptive dynamic programming（ADP）algorithm.It is proven that the iteration costate functions converge to the optimal one,and a detailed convergence analysis of the iteration ADP algorithm is given.Furthermore,echo state network（ESN）architecture is used as the approximator of the costate function for each iteration.To ensure the reliability of the ESN approximator,the ESN mean square training error is constrained in the satisfactory range.Two simulation examples are given to demonstrate that the proposed control method has a fast response speed due to the special structure and the fast training process.A novel self-learning optimal control method for a class of discrete-time nonlinear systems is proposed based on iteration adaptive dynamic programming（ADP）algorithm.It is proven that the iteration costate functions converge to the optimal one,and a detailed convergence analysis of the iteration ADP algorithm is given.Furthermore,echo state network（ESN）architecture is used as the approximator of the costate function for each iteration.To ensure the reliability of the ESN approximator,the ESN mean square training error is constrained in the satisfactory range.Two simulation examples are given to demonstrate that the proposed control method has a fast response speed due to the special structure and the fast training process.

关键词：adaptive dynamic programming DISCRETE-TIME optimal control ESN costate function

分类号：TP13[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

A new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architecture 被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

A new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architecture 被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索