检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:姚昌华 万中妨 张建照 李家强[1] 陈金立[1] YAO Changhua;WAN Zhongfang;ZHANG Jianzhao;LI Jiaqiang;CHEN Jinli(School of Electronic&Information Engineering,Nanjing University of Information Science&Technology,Nanjing 210044,China;The 63rd Research Institute,National University of Defense Technology,Nanjing 210007,China)
机构地区:[1]南京信息工程大学电子与信息工程学院,南京210044 [2]国防科技大学第六十三研究所,南京210007
出 处:《电讯技术》2024年第9期1353-1360,共8页Telecommunication Engineering
基 金:国家自然科学基金资助项目(61971439,U22B2002);通信抗干扰全国重点实验室基础科研创新基金(稳定支持)项目(IFN20230207)。
摘 要:针对无人机集群对多个异构电磁目标进行协同干扰时面临的多智能体协同学习决策问题,提出了基于动态联盟的无人机集群协同干扰方法。运用马尔可夫决策对协同干扰进行建模,构建干扰联盟博弈模型,优化集群干扰中的频率和功率联合决策以及干扰资源的动态调配,实现对多个用频模式不同、压制功率需求不同的电磁目标进行干扰的系统效能优化。首先,利用协同强化学习算法对任务目标的信道决策进行学习,联盟内无人机协同干扰;然后,根据干扰效果进行干扰联盟结构的动态调整。所提算法可从强化学习进程中根据干扰效能的差异动态调整干扰资源,并能适应目标功率的变化,相同场景下,系统干扰效能值较基线算法提升约12.5%。For the multi-agent collaborative and learning decision-making problem encountered by unmanned aerial vehicle(UAV)cluster when coordinating jamming against multiple heterogeneous electromagnetic targets,a collaborative jamming method based on dynamic alliances is proposed.The method employs Markov decision processes to model coordinated jamming and constructs an jamming coalition game model.It optimizes joint decisions on frequency and power in cluster jamming and dynamically allocates jamming resources,in order to optimize system performance in interfering with multiple electromagnetic targets with different frequency modes and suppression power requirements.Initially,reinforcement learning algorithms are utilized to learn the channel decisions of task objectives,enabling UAVs within the alliance to collaboratively select channels.Subsequently,adjustments to the alliance structure among alliances are made based on jamming effects.The proposed algorithm can dynamically adjust jamming resources based on differences in jamming effectiveness through the reinforcement learning process and adapt to changes in target power.In the same scenario,the system jamming effectiveness value is improved by approximately 12.5% compared with that of baseline algorithms.
分 类 号:TN929.5[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.63