Online Pareto optimal control of mean-field stochastic multi-player systems using policy iteration

作　　者：Xiushan JIANG Yanshuang WANG Dongya ZHAO Ling SHI

机构地区：[1]College of New Energy,China University of Petroleum East China,Qingdao 266580,China [2]Department of Electronic and Computer Engineering,Hong Kong University of Science and Technology,Hong Kong 999077,China

出　　处：《Science China(Information Sciences)》2024年第4期17-33,共17页中国科学（信息科学）（英文版）

基　　金：supported by National Natural Science Foundation of China(Grant Nos.62103442,12326343,62373229);Natural Science Foundation of Shandong Province(Grant No.ZR2021QF080);Fundamental Research Funds for the Central Universities(Grant No.23CX06024A);Outstanding Youth Innovation Team in Shandong Higher Education Institutions(Grant No.2023KJ061)。

摘　　要：In this study,the Pareto optimal strategy problem was investigated for multi-player mean-field stochastic systems governed by It?differential equations using the reinforcement learning(RL)method.A partially model-free solution for Pareto-optimal control was derived.First,by applying the convexity of cost functions,the Pareto optimal control problem was solved using a weighted-sum optimal control problem.Subsequently,using on-policy RL,we present a novel policy iteration(PI)algorithm based on the Hrepresentation technique.In particular,by alternating between the policy evaluation and policy update steps,the Pareto optimal control policy is obtained when no further improvement occurs in system performance,which eliminates directly solving complicated cross-coupled generalized algebraic Riccati equations(GAREs).Practical numerical examples are presented to demonstrate the effectiveness of the proposed algorithm.

关键词：mean-field stochastic systems Pareto optimal control policy iteration scheme H-representation

分类号：O232[理学—运筹学与控制论]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

Online Pareto optimal control of mean-field stochastic multi-player systems using policy iteration

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

Online Pareto optimal control of mean-field stochastic multi-player systems using policy iteration

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索