Policy Gradient Adaptive Dynamic Programming for Model-Free Multi-Objective Optimal Control  

在线阅读下载全文

作  者:Hao Zhang Yan Li Zhuping Wang Yi Ding Huaicheng Yan 

机构地区:[1]the Department of Control Science and Engineering,Tongji University,Shanghai 200092,China [2]the Key Laboratory of Advanced Control and Optimization for Chemical Processes of Ministry of Education,School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China

出  处:《IEEE/CAA Journal of Automatica Sinica》2024年第4期1060-1062,共3页自动化学报(英文版)

基  金:the National Natural Science Foundation of China(61922063,62273255,62150026);in part by the Shanghai International Science and Technology Cooperation Project(21550760900,22510712000);the Shanghai Municipal Science and Technology Major Project(2021SHZDZX0100);the Fundamental Research Funds for the Central Universities。

摘  要:Dear Editor,In this letter,the multi-objective optimal control problem of nonlinear discrete-time systems is investigated.A data-driven policy gradient algorithm is proposed in which the action-state value function is used to evaluate the policy.In the policy improvement process,the policy gradient based method is employed.

关 键 词:POLICY GRADIENT OPTIMAL 

分 类 号:O232[理学—运筹学与控制论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象