检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:高晓格[1] 韩淑云[2] Gao Xiaoge;Han Shuyun(Automotive College,Anyang Vocational and Technical College,Anyang 455000,Henan,China;Faculty of Artificial Intelligence in Education,Central China Normal University,Wuhan 430079,Hubei,China)
机构地区:[1]安阳职业技术学院汽车学院,河南安阳455000 [2]华中师范大学人工智能教育学部,湖北武汉430079
出 处:《计算机应用与软件》2025年第2期287-291,298,共6页Computer Applications and Software
基 金:国家自然科学基金项目(61972173)。
摘 要:针对一类输入受限的非线性系统最优跟踪控制问题,提出一种基于强化学习的自适应动态规划的控制策略。通过设计一种合适的性能指标函数解决控制系统输入受限问题;通过设计评价神经网络来估计系统的最优性能指标函数,从而求解控制系统HJB(Hamilton-Jacobi-Bellman)方程,获得最优控制输入;利用Lyapunov方法获得评价网络的权重更新率,并证明系统的跟踪误差和评价网络的权重估计误差为最终一致有界(UUB);通过数值仿真实验验证该控制策略的有效性。An adaptive dynamic programming control strategy based on reinforcement learning is designed for optimal tracking control of a class of nonlinear systems with input constraints.An appropriate performance index function was designed to solve the input limitation problem of the control system.An evaluation neural network was designed to estimate the optimal performance index function of the system,and the HJB equation of the control system was solved to obtain the optimal control input.The weight update rate of the evaluation network was obtained by using Lyapunov method,and it was proved that the tracking error of the system and the weight estimation error of the evaluation network were ultimately uniformly bounded(UUB).The optimal utilization numerical simulation results show the effectiveness of the proposed control strategy.
分 类 号:TP3[自动化与计算机技术—计算机科学与技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49