基于行为树的多星轨道追逃博弈方法

Multi-Satellite Orbital Pursuit-Evasion Game Method Based on Behavior Trees

作　　者：苏浩季明江郭鹏宇曹璐 SU Hao;JI Mingjiang;GUO Pengyu;CAO Lu(National Innovation Institute of Defense Technology,Academy of Military Sciences,Beijing 100071,China;Intelligent Game and Decision Laboratory,Beijing 100000,China)

机构地区：[1]军事科学院国防科技创新研究院,北京100071 [2]智能博弈与决策实验室,北京100000

出　　处：《智能安全》2024年第3期82-91,共10页Artificial Intelligence Security

基　　金：国家自然科学基金资助项目(11972373,52005506)。

摘　　要：多智能体强化学习是解决空间追逃博弈问题的一类有效方法,但在多星追逃博弈场景下存在复杂性高、训练时间长、难以收敛等问题。本文提出一种基于行为树的多星轨道追逃博弈方法,将对多个目标的复杂追逃博弈问题分解为对单一目标的追逃博弈问题。利用行为树构建多星追逃任务分配与博弈决策框架,以最大化追击成功概率为目标建立最优任务分配模型,并利用遗传算法进行求解,实现多星追逃任务快速分解;对于分配的追击任务,各卫星自主选择多智能体深度确定性策略梯度算法训练得到的博弈策略开展博弈决策。结果表明,本文所提方法能将多星轨道博弈任务有效分解,并在行为树的驱动下成功完成对目标的追击。Multi-agent reinforcement learning is an effective approach for solving spatial pursuit-evasion games.In multi-star pursuit-evasion scenarios,however,there are challenges such as long training time and difficulty in convergence.This paper proposes a multi-star orbital pursuit-evasion method based on behavior trees,decomposing the complex pursuit-evasion problem involving multiple targets into individual pursuit-evasion problems for each target.By utilizing behavior trees to construct the framework for task allocation and game decision-making in multi-star pursuit-evasion scenarios,the optimal task allocation model is established with the objective of maximizing the probability of successful pursuit.Genetic algorithms are employed for solving,enabling rapid decomposition of multi-star pursuit-evasion tasks.For the allocated pursuit tasks,each satellite autonomously selects game strategies obtained through training with the Multi-Agent Deep Deterministic Policy Gradient algorithm.The results demonstrate that the proposed method effectively decomposes multi-star orbital game tasks and successfully achieves target pursuit under the guidance of behavior trees.

关键词：多星轨道追逃博弈行为树任务分配多智能体强化学习

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于行为树的多星轨道追逃博弈方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于行为树的多星轨道追逃博弈方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索