检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:徐浩南 林立岚 蔡霞[1] XU Haonan;LIN Lilan;CAI Xia(School of Computer Science and Technology,Zhejiang Sci-Tech University,Hangzhou 310018,China)
机构地区:[1]浙江理工大学计算机科学与技术学院,浙江杭州310018
出 处:《软件工程》2024年第11期1-5,共5页Software Engineering
摘 要:针对多智能体通过无线传感器网络与目标接收器通信时可能遭遇的信息窃取问题,提出了一种创新的多智能体波束成形方法。该方法旨在通过动态调整智能体的分布及传输信号状态,确保接收器能收到高质量的信号,最大限度地避免被潜在的窃听者窃取信息。首先将联合优化问题定义为部分可观测马尔可夫决策过程(POMDP),其次基于深度强化学习算法解决此优化问题。通过引入集中式训练、分布式执行的框架,智能体可以根据局部观测进行协同决策,从而调整全局通信状态。为了验证所提方法的有效性,基于多智能体粒子环境(MPE)设计了仿真环境,并在多个场景下进行了训练及测试,实验结果验证了该方法的有效性。This research addresses the potential issue of information theft during communication between multiple agents and target receivers through wireless sensor networks.An innovative multi-agent beamforming method is proposed to dynamically adjust the distribution of agents and the transmission signal states,ensuring the receiver receives high-quality signals while minimizing the risk of information being intercepted by potential eavesdroppers.Firstly,the joint optimization problem is defined as a Partially Observable Markov Decision Process(POMDP).Then,the optimization problem is solved using the deep reinforcement learning algorithm.By introducing a centralized training and distributed execution framework,agents can make collaborative decisions based on local observations,adjusting the overall communication state.To verify the effectiveness of the proposed method,a simulation environment based on the Multi-Agent Particle Environment(MPE)is designed,in which training and testing are conducted in multiple scenarios.Experimental results verify the effectiveness of the method.
关 键 词:多智能体系统 波束成形 防窃听通信 深度强化学习
分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.13