Learning in games:a systematic review  

在线阅读下载全文

作  者:Rong-Jun QIN Yang YU 

机构地区:[1]National Key Laboratory for Novel Software Technology,Nanjing University,Nanjing 210023,China [2]Polixir Technologies,Nanjing 210023,China

出  处:《Science China(Information Sciences)》2024年第7期129-155,共27页中国科学(信息科学)(英文版)

基  金:supported by National Key Research and Development Program of China(Grant No.2020AAA0107200);National Natural Science Foundation of China(Grant No.61921006).

摘  要:Game theory studies the mathematical models for self-interested individuals.Nash equilibrium is arguably the most central solution in game theory.While finding the Nash equilibrium in general is known as polynomial parity arguments on directed graphs(PPAD)-complete,learning in games provides an alternative to approximate Nash equilibrium,which iteratively updates the player’s strategy through interactions with other players.Rules and models have been developed for learning in games,such as fictitious play and no-regret learning.Particularly,with recent advances in online learning and deep reinforcement learning,techniques from these fields greatly boost the breakthroughs in learning in games from theory to application.As a result,we have witnessed many superhuman game AI systems.The techniques used in these systems evolve from conventional search and learning to purely reinforcement learning(RL)-style learning methods,gradually getting rid of the domain knowledge.In this article,we systematically review the above techniques,discuss the trend of basic learning rules towards a unified framework,and recap applications in large games.Finally,we discuss some future directions and make the prospect of future game AI systems.We hope this article will give some insights into designing novel approaches.

关 键 词:non-cooperative games learning in games no-regret learning reinforcement learning superhuman AI 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象