基于深度强化学习的对手建模方法研究综述  被引量:4

Research Progress of Opponent Modeling Based on Deep Reinforcement Learning

在线阅读下载全文

作  者:徐浩添 秦龙 曾俊杰 胡越 张琪 Xu Haotian;Qin Long;Zeng Junjie;Hu Yue;Zhang Qi(College of Systems Engineering,National University of Denfense Technology,Changsha 410073,China)

机构地区:[1]国防科技大学系统工程学院,湖南长沙410073

出  处:《系统仿真学报》2023年第4期671-694,共24页Journal of System Simulation

基  金:国家自然科学基金(61273300,62102432,62103420);国家社科基金军事学(2020-SKJJ-C-102);湖南省自然科学基金(2021JJ40697,2021JJ40702)。

摘  要:深度强化学习是一种兼具深度学习特征提取能力和强化学习序列决策能力的智能体建模方法,能够弥补传统对手建模方法存在的非平稳性适应差、特征选取复杂、状态空间表示能力不足等问题。将基于深度强化学习的对手建模方法分为显式建模和隐式建模两类,按照类别梳理相应的理论、模型、算法,以及适用场景;介绍基于深度强化学习的对手建模技术在不同领域的应用情况;总结亟需解决的关键问题以及发展方向,为基于深度强化学习的对手建模方法提供较全面的研究综述。Deep reinforcement learning is an agent modeling method with both deep learning feature extraction ability and reinforcement learning sequence decision-making ability,which can make up for the depleted non-stationary adaptation,complex feature selection and insufficient state-space representation ability of traditional opponent modeling.The deep reinforcement learning-based opponent modeling methods are divided into two categories,explicit modeling and implicit modeling,and the corresponding theories,models,algorithms and applicable scenarios are sorted out according to the categories.The applications of deep reinforcement learning-based opponent modeling techniques on different fields are introduced.The key problems and future development are summarized to provide a comprehensive research review for the deep reinforcement learning-based opponent modeling methods.

关 键 词:深度强化学习 对手建模 博弈论 心智理论 表征学习 元学习 

分 类 号:TP391.9[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象