检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Yun Zhang Lulu Zhang Yunze Cai
机构地区:[1]the Department of Automation,Shanghai Jiao Tong University,Shanghai 200240 [2]the Key Laboratory of System Control and Information Processing,Ministry of Education of China,Shanghai 200240,China
出 处:《IEEE/CAA Journal of Automatica Sinica》2024年第3期690-697,共8页自动化学报(英文版)
基 金:supported by the Industry-University-Research Cooperation Fund Project of the Eighth Research Institute of China Aerospace Science and Technology Corporation (USCAST2022-11);Aeronautical Science Foundation of China (20220001057001)。
摘 要:This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence proof.The players are divided into two groups in the learning process and adapt their policies sequentially.Our method removes the dependence of admissible initial policies,which is one of the main drawbacks of the PI-based frameworks.Furthermore,this algorithm enables the players to adapt their control policies without full knowledge of others’ system parameters or control laws.The efficacy of our method is illustrated by three examples.
关 键 词:Adaptive dynamic programming incomplete information multi-player differential game value iteration
分 类 号:O221.3[理学—运筹学与控制论] O232[理学—数学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.185