基于策略迭代的脉冲系统最优控制

Discrete-Time Impulsive Optimal Control on Policy Iteration

作　　者：高洋李媛[1] GAO Yang;LI Yuan(School of Science,Shenyang University of Technology,Shenyang 110870)

出　　处：《系统科学与数学》2024年第11期3228-3238,共11页Journal of Systems Science and Mathematical Sciences

基　　金：国家自然科学基金项目(62103289)资助课题。

摘　　要：针对离散时间非线性系统的最优脉冲控制问题,提出了一种基于策略迭代(PI)的自适应动态规划(ADP)算法.首先引入脉冲区间集的约束条件,将系统转换为离散时间非线性脉冲控制系统,并根据哈密顿-雅可比-贝尔曼方程得到脉冲控制下的最优性能指标函数.其次提出了一种基于PI的ADP算法解决了脉冲系统最优控制问题,并给出了脉冲系统的收敛性分析.相比于值迭代(VI)算法,PI在保证系统稳定的同时收敛速度更快.然后提出了一种策略评估算法,放宽了PI算法的初始条件,解决了初始值选取困难的问题.最后通过仿真实例验证了该算法的有效性.An adaptive dynamic programming(ADP)algorithm based on strategy iteration(PI)is proposed for optimal pulse control of discrete-time nonlinear systems.Firstly,the system is transformed into a discrete-time nonlinear pulse control system by introducing the constraint of pulse interval set,and the optimal performance index function under pulse control is obtained according to the Hamilton-Jacobi-Bellman equation.Secondly,an ADP algorithm based on PI is proposed to solve the optimal control problem of the pulse system,and the convergence analysis of the pulse system is given.Compared with the value iteration(VI)algorithm,PI converges faster while ensuring system stability.Then a strategy evaluation algorithm is proposed,which relaxes the initial conditions of PI algorithm and solves the difficult problem of initial value selection.Finally,a simulation example is given to verify the effectiveness of the proposed algorithm.

关键词：脉冲系统策略迭代最优控制自适应动态规划

分类号：O232[理学—运筹学与控制论]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于策略迭代的脉冲系统最优控制

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于策略迭代的脉冲系统最优控制

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索