一种基于深度增强学习的智能路由技术  被引量:6

An Intelligent Routing Technology Based on Deep Reinforcement Learning

在线阅读下载全文

作  者:孙鹏浩 兰巨龙[1] 申涓[1] 胡宇翔[1] SUN Peng-hao;LAN Ju-long;SHEN Juan;HU Yu-xiang(PLA Strategic Support Force Information Engineering University,Zhengzhou,Henan 450002,China)

机构地区:[1]解放军战略支援部队信息工程大学,河南郑州450002

出  处:《电子学报》2020年第11期2170-2177,共8页Acta Electronica Sinica

基  金:国家自然科学基金(No.61521003,No.61702547,No.61872382);国家重点研发计划课题(No.2017YFB0803204);广东省重点领域研发计划项目(No.2018B010113001)。

摘  要:随着网络规模的不断增大以及网络复杂度的不断提高,传统路由算法面对网络流量在时空分布上的剧烈波动难以兼顾计算复杂度和算法效率.近年来,随着软件定义网络和人工智能技术的兴起,基于机器学习的自动路由策略生成逐渐受到关注.本文提出一种基于深度增强学习的智能路由技术SmartPath,通过动态收集网络状态,使用深度增强学习自动生成路由策略,从而保证路由策略能够动态适应网络流量变化.实验结果表明,本文所提出的方案能够不依赖人工流量建模动态更新网络路由,在测试环境下比当前最优方案减少至少10%的平均端到端传输时延.With the expansion of network scale and network complexity,traditional routing algorithms cannot ensure both the calculation complexity and performance under the large fluctuation of spatial-temporal distribution of network traffic.In recent years,with the development of Software-Defined Networking(SDN)and Artificial Intelligence(AI),AI-based methods of automatic routing strategies are gaining attention.In this paper,we propose an intelligent network routing technology called SmartPath based on Deep Reinforcement Learning(DRL).With dynamic collection of network status,we can use DRL to generate routing policies automatically,thus ensuring that the routing policy can dynamically adapt to the change of network traffic.Experiment result shows that the proposed scheme can adjust the routing strategy dynamically without human experience on traffic analysis and can reduce the average end-to-end transmission delay by at least 10%compared with the state-of-art schemes.

关 键 词:路由优化 软件定义网络 人工智能 深度增强学习 

分 类 号:TP393[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象