ZERO-SUM

作品数:34被引量:49H指数:5
导出分析报告
相关领域:理学更多>>
相关作者:吴臻于志勇杨涛更多>>
相关机构:东北师范大学更多>>
相关期刊:《Chinese Physics B》《Applied Mathematics and Mechanics(English Edition)》《Acta Mathematica Scientia》《Philosophy Study》更多>>
相关基金:国家自然科学基金霍英东教育基金国家重点基础研究发展计划国家教育部博士点基金更多>>
-

检索结果分析

结果分析中...
条 记 录,以下是1-10
视图:
排序:
On-Policy and Off-Policy Value Iteration Algorithms for Stochastic Zero-Sum Dynamic Games
《Journal of Systems Science & Complexity》2025年第1期421-435,共15页GUO Liangyuan WANG Bing-Chang ZHANG Ji-Feng 
supported by the National Natural Science Foundation of China under Grant Nos.62122043,62192753,62433020,T2293770;Natural Science Foundation of Shandong Province for Distinguished Young Scholars under Grant No.ZR2022JQ31.
This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown dynamics.On-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,...
关键词:Approximate dynamic programming on-policy off-policy stochastic zero-sum games valueiteration 
A zero-sum hybrid stochastic differential game with impulse controls
《Science China(Information Sciences)》2024年第11期281-295,共15页Siyu LV Zhen WU Jie XIONG 
supported by National Key R&D Program of China(Grant Nos.2023YFA1009200,2022YFA-1006102);National Natural Science Foundation of China(Grant Nos.12471414,11831010,61961160732,12471418);Natural Science Foundation of Jiangsu Province(Grant No.BK20242023);Natural Science Foundation of Shandong Province(Grant No.ZR2019ZD42);Taishan Scholars Climbing Program of Shandong(Grant No.TSPD20210302);Fundamental Research Funds for the Central Universities(Grant No.2242024K40018);Jiangsu Province Scientific Research Center of Applied Mathematics(Grant No.BK20233002)。
In this paper,we study a zero-sum stochastic differential game with the following salient features:(i)the system state is dictated by a hybrid diffusion,(ii)both players use impulse controls,and(iii)the game takes pla...
关键词:stochastic differential game Markov chain impulse control HJBI equation viscosity solution verification theorem 
An Online Q-Learning Method for Linear-Quadratic Nonzero-Sum Stochastic Differential Games with Completely Unknown Dynamics
《Journal of Systems Science & Complexity》2024年第5期1907-1922,共16页ZHANG Bao-Qiang WANG Bing-Chang CAO Ying 
supported in part by the National Natural Science Foundation of China under Grant Nos.62122043,62192753;in part by Natural Science Foundation of Shandong Province for Distinguished Young Scholars under Grant No.ZR2022JQ31;in part by the Innovative Research Groups of the National Natural Science Foundation of China under Grant No.61821004.
In this paper,the authors design a reinforcement learning algorithm to solve the adaptive linear-quadratic stochastic n-players non-zero sum differential game with completely unknown dynamics.For each player,a critic ...
关键词:Actor-critic algorithm model-free adaptive control nonzero-sum stochastic game reinforcement learning 
Constructive Approximate Nash Equilibrium for In-Orbit Target Enclosing with Collision Avoidance and Full-state Constraint via Nonzero-Sum Differential Games
《Guidance, Navigation and Control》2024年第3期166-190,共25页Bosong Wei Xiaokui Yue Zhiwei Hao Zongcheng Liu 
supported in part by the National Natural Science Foundation of China(Grant Nos. 12172288 and 12472046);the National Key Research and Development Program of China (Grant No. 2021YFC2202600)
The problem of in-orbit cooperative target enclosing involving N thrust-limited satellites under collision avoidance and maneuver amplitude constraints is studied. In order to find global optimal trajectories for targ...
关键词:Approximate Nash equilibrium differential game invariant manifold asymptotic stability target enclosing 
Computational intelligence interception guidance law using online off-policy integral reinforcement learning
《Journal of Systems Engineering and Electronics》2024年第4期1042-1052,共11页WANG Qi LIAO Zhizhong 
Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-f...
关键词:two-person zero-sum differential games Hamilton–Jacobi–Isaacs(HJI)equation off-policy integral reinforcement learning(IRL) online learning computational intelligence inter-ception guidance(CIIG)law 
Stationary Almost Markov ε-Equilibria for Discounted Stochastic Games with Borel Spaces and Unbounded Payoffs
《Journal of Systems Science & Complexity》2024年第4期1672-1684,共13页WU Yiting ZHANG Junyu HUANG Song 
supported by the National Key Research and Development Program of China under Grant No.2022YFA1004600;the National Natural Science Foundation of China under Grant No.11931018;the Guangdong Basic and Applied Basic Research Foundation under Grant No.2021A1515010057;the Guangdong Province Key Laboratory of Computational Science at the Sun Yat-sen University under Grant No.2020B1212060032。
This paper is concerned with nonzero-sum discrete-time stochastic games in Borel state and action spaces under the expected discounted payoff criterion.The payoff function can be unbounded.The transition probability i...
关键词:Almost Markovε-equilibrium Borel state space expected discounted payoff criterion nonzero-sum stochastic games unbounded payoffs 
Zero-Sum Continuous-Time Markov Games with One-Side Stopping
《Journal of the Operations Research Society of China》2024年第1期169-187,共19页Yurii Averboukh 
The article was prepared within the framework of the HSE University Basic Research Program in 2023。
The paper is concerned with a variant of the continuous-time finite state Markov game of control and stopping where both players can affect transition rates,while only one player can choose a stopping time.The dynamic...
关键词:Continuous-time Markov games Dynamic programming Verification theorem Stopping time 
Minimax Q-learning design for H_(∞) control of linear discrete-time systems
《Frontiers of Information Technology & Electronic Engineering》2022年第3期438-451,共14页Xinxing LI Lele XI Wenzhong ZHA Zhihong PENG 
supported by the National Natural Science Foundation of China (No. U1613225)。
The H_(∞)control method is an effective approach for attenuating the effect of disturbances on practical systems, but it is difficult to obtain the H_(∞)controller due to the nonlinear Hamilton-Jacobi-Isaacs equatio...
关键词:H_(∞)control Zero-sum dynamic game Reinforcement learning Adaptive dynamic programming Minimax Q-learning Policy iteration 
Inverse problems associated with subsequence sums in Cp■Cp
《Frontiers of Mathematics in China》2020年第5期985-1000,共16页Jiangtao PENG Yongke QU Yuanlin LI 
supported in part by the Fundamental Research Funds for the Central Universities(No.3122019152);the National Natural Science Foundation of China(Grant Nos.11701256,11871258);the Youth Backbone Teacher Foundation of Henan's University(No.2019GGJS196);the China Scholarship Council(Grant No.201908410132);was also supported in part by a Discovery Grant from the Natural Sciences and Engineering Research Council of Canada(Grant No.RGPIN 2017-03903).
Let G be a finite abelian group and S be a sequence with elements of G.We say that S is a regular sequence over G if|SH|≤|H|-1 holds for every proper subgroup H of G,where SH denotes the subsequence of S consisting o...
关键词:Inverse problems subsequence sums regular sequences zZero-sumfree sequences 
Updating Western Democracies for the 21st Century:A Move for Replacing a Zero-Sum System by a Win-Win One
《Journal of Philosophy Study》2020年第2期95-106,共12页Amos Avny 
The surrounding world is changing.Uncertainty is the only certain thing projected.The Western Civilization is moving from the 20th century orderly and stable Modernity to the chaotic and unstable reality of the 21st c...
关键词:DEMOCRACY technology change REFERENDUM grand-coalition amendments improvements Digital Era 
检索报告 对象比较 聚类工具 使用帮助 返回顶部