一种新的基于数据驱动的神经动态规划方法被引量：1

A New Data-Driven Neural Dynamic Programming Algorithm

出　　处：《人工智能与机器人研究》2019年第2期46-56,共11页Artificial Intelligence and Robotics Research

基　　金：广东省自然科学基金项目(No.2018A030313505);广东省科技计划项目(No.2017B010124003,No.2017 B090909001)。

摘　　要：为了实现无模型离散时间非线性动态系统的最优控制,提出了一种新的基于数据驱动的神经动态规划方法。该方法利用Q函数的残差与基函数的内积为零,同时控制策略的残差与基函数的内积也为零,从而得到控制方程。接着使用离线数据集与在线数据来迭代更新神经网络的系数,从而得到近似最优的控制策略,本文还证明了该算法是收敛的。A new data-driven neural dynamicprogramming method for model-free discrete-time nonlinear dynamic system isproposed in this paper.The residual of the Q-function and the control strategyare operated to be zero with the basis function through the inner product.Thenthe coefficients of the neural network are updated by the offline trained dataand the online data.Finally the optimal control strategy is obtained and the convergenceof this algorithm is proved.

关键词：最优控制神经动态规划 Q函数神经网络

分类号：TP2[自动化与计算机技术—检测技术与自动化装置]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种新的基于数据驱动的神经动态规划方法被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种新的基于数据驱动的神经动态规划方法 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

一种新的基于数据驱动的神经动态规划方法被引量：1