基于强化学习与规则库增强的双方群体博弈策略训练方法研究  

Study on the Training Method of Bilateral Group Game Strategy Based on Reinforcement Learning and Enhanced Rule Base

在线阅读下载全文

作  者:陈灼 周翼骅 普俊松 张秀山[1] 张典[2] CHEN Zhuo;ZHOU Yihua;PU Junsong;ZHANG Xiushan;ZHANG Dian(School of Electronic Engineering,Naval University of Engineering,Wuhan 430033,China;School of Cyber Science and Engineering,Wuhan University,Wuhan 430072,China)

机构地区:[1]海军工程大学电子工程学院,湖北武汉430033 [2]武汉大学国家网络安全学院,湖北武汉430072

出  处:《软件导刊》2025年第1期15-20,共6页Software Guide

基  金:湖北省自然科学基金项目(2022CFB012)。

摘  要:传统强化学习方法泛化性不强,在一些特定任务下直接应用效果往往很差,尤其是在敌我双方博弈的场景下,态势更加复杂。为了解决该问题,提出基于强化学习的双方博弈策略训练方法,并在其基础上提出基于强化学习与规则库增强的双方群体博弈策略训练方法。经过实验验证,该方法显著提升了智能体的行为决策能力,智能体所得到的总奖励值接近14.5。在模拟抓捕任务中,其行为决策得到了有效的优化和改进。同时,通过不同规则库的设置,增加了模拟环境的不确定性,更好地模拟了真实环境的复杂性。The traditional reinforcement learning methods lack strong generalization,often performing poorly when directly applied to specific tasks,especially in scenarios involving adversarial two-player games where the situation is more complex.To address this issue,this paper proposes a reinforcement learning-based strategy training method for two-player games.Furthermore,it introduces a method that enhances two-party group game strategies based on both reinforcement learning and a rule library.Through experimental validation,the proposed meth‐ods enhance the behavioral decision-making of intelligent agents,with the total reward obtained by the intelligent agent approaching 14.5.This has resulted in effective behavioral decision improvements in simulated capture tasks.Simultaneously,by configuring different rule librar‐ies,the method introduces uncertainty into the simulated environment,better simulating the complexity of real-world environments.

关 键 词:强化学习 规则库 群体博弈策略 智能体决策 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象