检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Rong-Jun QIN Yang YU
机构地区:[1]National Key Laboratory for Novel Software Technology,Nanjing University,Nanjing 210023,China [2]Polixir Technologies,Nanjing 210023,China
出 处:《Science China(Information Sciences)》2024年第7期129-155,共27页中国科学(信息科学)(英文版)
基 金:supported by National Key Research and Development Program of China(Grant No.2020AAA0107200);National Natural Science Foundation of China(Grant No.61921006).
摘 要:Game theory studies the mathematical models for self-interested individuals.Nash equilibrium is arguably the most central solution in game theory.While finding the Nash equilibrium in general is known as polynomial parity arguments on directed graphs(PPAD)-complete,learning in games provides an alternative to approximate Nash equilibrium,which iteratively updates the player’s strategy through interactions with other players.Rules and models have been developed for learning in games,such as fictitious play and no-regret learning.Particularly,with recent advances in online learning and deep reinforcement learning,techniques from these fields greatly boost the breakthroughs in learning in games from theory to application.As a result,we have witnessed many superhuman game AI systems.The techniques used in these systems evolve from conventional search and learning to purely reinforcement learning(RL)-style learning methods,gradually getting rid of the domain knowledge.In this article,we systematically review the above techniques,discuss the trend of basic learning rules towards a unified framework,and recap applications in large games.Finally,we discuss some future directions and make the prospect of future game AI systems.We hope this article will give some insights into designing novel approaches.
关 键 词:non-cooperative games learning in games no-regret learning reinforcement learning superhuman AI
分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7