supported by the National Natural Science Foundation of China under Grant Nos.62122043,62192753,62433020,T2293770;Natural Science Foundation of Shandong Province for Distinguished Young Scholars under Grant No.ZR2022JQ31.
This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown dynamics.On-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,...
supported by National Key R&D Program of China(Grant Nos.2023YFA1009200,2022YFA-1006102);National Natural Science Foundation of China(Grant Nos.12471414,11831010,61961160732,12471418);Natural Science Foundation of Jiangsu Province(Grant No.BK20242023);Natural Science Foundation of Shandong Province(Grant No.ZR2019ZD42);Taishan Scholars Climbing Program of Shandong(Grant No.TSPD20210302);Fundamental Research Funds for the Central Universities(Grant No.2242024K40018);Jiangsu Province Scientific Research Center of Applied Mathematics(Grant No.BK20233002)。
In this paper,we study a zero-sum stochastic differential game with the following salient features:(i)the system state is dictated by a hybrid diffusion,(ii)both players use impulse controls,and(iii)the game takes pla...
supported in part by the National Natural Science Foundation of China under Grant Nos.62122043,62192753;in part by Natural Science Foundation of Shandong Province for Distinguished Young Scholars under Grant No.ZR2022JQ31;in part by the Innovative Research Groups of the National Natural Science Foundation of China under Grant No.61821004.
In this paper,the authors design a reinforcement learning algorithm to solve the adaptive linear-quadratic stochastic n-players non-zero sum differential game with completely unknown dynamics.For each player,a critic ...
supported in part by the National Natural Science Foundation of China(Grant Nos. 12172288 and 12472046);the National Key Research and Development Program of China (Grant No. 2021YFC2202600)
The problem of in-orbit cooperative target enclosing involving N thrust-limited satellites under collision avoidance and maneuver amplitude constraints is studied. In order to find global optimal trajectories for targ...
Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-f...
supported by the National Key Research and Development Program of China under Grant No.2022YFA1004600;the National Natural Science Foundation of China under Grant No.11931018;the Guangdong Basic and Applied Basic Research Foundation under Grant No.2021A1515010057;the Guangdong Province Key Laboratory of Computational Science at the Sun Yat-sen University under Grant No.2020B1212060032。
This paper is concerned with nonzero-sum discrete-time stochastic games in Borel state and action spaces under the expected discounted payoff criterion.The payoff function can be unbounded.The transition probability i...
The article was prepared within the framework of the HSE University Basic Research Program in 2023。
The paper is concerned with a variant of the continuous-time finite state Markov game of control and stopping where both players can affect transition rates,while only one player can choose a stopping time.The dynamic...
supported by the National Natural Science Foundation of China (No. U1613225)。
The H_(∞)control method is an effective approach for attenuating the effect of disturbances on practical systems, but it is difficult to obtain the H_(∞)controller due to the nonlinear Hamilton-Jacobi-Isaacs equatio...
supported in part by the Fundamental Research Funds for the Central Universities(No.3122019152);the National Natural Science Foundation of China(Grant Nos.11701256,11871258);the Youth Backbone Teacher Foundation of Henan's University(No.2019GGJS196);the China Scholarship Council(Grant No.201908410132);was also supported in part by a Discovery Grant from the Natural Sciences and Engineering Research Council of Canada(Grant No.RGPIN 2017-03903).
Let G be a finite abelian group and S be a sequence with elements of G.We say that S is a regular sequence over G if|SH|≤|H|-1 holds for every proper subgroup H of G,where SH denotes the subsequence of S consisting o...
The surrounding world is changing.Uncertainty is the only certain thing projected.The Western Civilization is moving from the 20th century orderly and stable Modernity to the chaotic and unstable reality of the 21st c...