检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:于子航 王改云[1] YU Zi-hang;WANG Gai-yun(School of Electrical Engineering and Automation,Guilin University of Electronic Technology,Guilin Guangxi 541000,China)
机构地区:[1]桂林电子科技大学花江校区电子工程与自动化学院,广西桂林541000
出 处:《计算机仿真》2022年第5期344-347,379,共5页Computer Simulation
摘 要:为解决依赖受控系统数学模型而导致的数据驱动控制性能不完善,控制跟踪结果误差较大的问题,提出一种基于近似Q-学习算法的数据驱动控制方法。为使Q被充分学习,在时间轴的立即回报序列上估计训练值,因此修改确定性规则,使其采用当前Q值和修正后估计衰减值计算加权平均值计算,实现算法收敛。采用以Q-学习算法为结构的控制器代替受控数据当前工作点的一般非线性模型,并且仅使用被控对象提供的数据来评估模型中的伪偏导函数,实现无模型数据驱动控制。仿真结果证明,所提方法的信号扰动较小,且控制跟踪结果误差较小,整体性能要优于传统方法。A data-driven control method based on approximate Q-learning algorithm is proposed to solve the problems of imperfect data-driven control performance and large error of control tracking results caused by relying on the mathematical model of the controlled system.In order to make Q be fully learned,the training value was estimated on the immediate return sequence of the time axis,so that the deterministic rule was modified.Then,the weighted average value was calculated based on current Q value and the estimated attenuation value after the correction,so that the convergence of algorithm was achieved.The controller based on Q-learning algorithm was used to replace the general nonlinear model of the current working point of the controlled data.Meanwhile,only the data provided by the controlled object were used to evaluate the pseudo partial derivative in model.Finally,the data-driven control without model was completed.Simulation results show that the proposed method has smaller signal disturbance and smaller error in control and track,so its overall performance is better than the traditional method.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.118.32.150