基于DRL的飞行自组网自适应多模式路由算法  被引量:1

Adaptive Multi-Mode Routing Algorithm for FANET Based on Deep Reinforcement Learning

在线阅读下载全文

作  者:黄凯 邱修林 殷俊[2] 杨余旺[1] HUANG Kai;QIU Xiulin;YIN Jun;YANG Yuwang(School of Computer Science and Engineering,Nanjing University of Science and Technology,Nanjing 210094,China;School of Computer Science and Technology,Nanjing University of Posts and Telecommunications,Nanjing 210003,China)

机构地区:[1]南京理工大学计算机科学与工程学院,南京210094 [2]南京邮电大学计算机科学与技术学院,南京210003

出  处:《计算机工程与应用》2023年第14期268-274,共7页Computer Engineering and Applications

基  金:国家自然科学基金(61973161,61991404);江苏省教育厅未来网络科研基金(FNSRFP-2021-YB-05)。

摘  要:针对传统飞行自组网协议自适应能力不强、大规模网络应用场景效果不佳的问题,提出了一种基于深度强化学习的多模式路由算法。该算法综合利用系统吞吐量、分组递交率和平均端到端时延等参数构建价值函数,通过智能体自动调节各个无人机的路由工作模式,将大型网络分解为主体网络和数个与之相连的小型异构网络,降低了系统复杂度,局部性能达到最优,提升了整个网络的性能。使用NS3仿真平台测试了算法和传统协议AODV、DSDV的性能指标。仿真结果表明,算法显著优于传统协议,且网络规模越大、负载越高则优势越明显,平均吞吐量提升了55.46%,分组递交率提升了39.85%,平均端到端时延降低了60.94%。Aiming at the problems of weak adaptability of traditional flying ad hoc network protocols and poor effect in large-scale network application scenarios,a multi-mode routing algorithm based on deep reinforcement learning is proposed.The algorithm constructs the value function by comprehensively using the parameters such as system throughput,packet delivery rate and average end-to-end delay.The agent automatically adjusts the routing mode of each UAV,decomposes the large network into the main network and several small heterogeneous networks connected with it,reduces the system complexity,optimizes the local performance,and improves the performance of the whole network.The agent automatically adjusts the routing mode of each UAV,decomposes the large network into the main network and several small heterogeneous networks connected with it,reduces the system complexity,optimizes the local performance,and improves the performance of the whole network.Simulation results show that the algorithm is significantly better than the traditional protocol,and the larger the network scale and the higher the load,the more obvious the advantage is.The average throughput is increased by 55.46%,the packet delivery rate is increased by 39.85%,and the average end-to-end delay is reduced by 60.94%.

关 键 词:飞行自组网 深度强化学习 自适应路由算法 混合路由 

分 类 号:TN929.52[电子电信—通信与信息系统] TP391[电子电信—信息与通信工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象