基于近端策略优化算法的车载边缘计算网络频谱资源分配  被引量:1

Spectrum Resource Allocation of Vehicle Edge Computing Network Based on Proximal Policy Optimization Algorithm

在线阅读下载全文

作  者:赵佳楠 胡晓辉[1] 杜欣欣 ZHAO Jianan;HU Xiaohui;DU Xinxin(Department of Electronics&Information Engineering,Lanzhou Jiaotong University,Lanzhou,Gansu 730070,China)

机构地区:[1]兰州交通大学电子与信息工程学院,甘肃兰州730070

出  处:《数据与计算发展前沿》2022年第4期142-155,共14页Frontiers of Data & Computing

基  金:国家科学自然基金(11461038)。

摘  要:【目的】在车载网络边缘计算中,合理地分配频谱资源对改善车辆通讯质量具有重要意义。频谱资源稀缺是影响车辆通讯质量的重要原因之一,车辆的高移动性以及在基站处准确收集信道状态信息的困难给频谱资源分配带来了挑战性。【方法】针对以上问题,优化目标设定为车对车(Vehicle-to-Vehicle,V2V)链路传输速率和车对基础设施(Vehicle-to-Infrastructure,V2I)容量大小,提出一种基于近端策略优化(Proximal Policy Optimization,PPO)强化学习算法的多智能体频谱资源动态分配方案。【结果】面对多个V2V链路共享V2I链路所占用的频谱资源从而缓解频谱稀缺问题。这一问题被进一步制定为马尔可夫决策过程(Markov Decision Process,MDP),并对状态、动作和奖励进行了设计,以优化频谱分配策略。【结论】仿真结果表明,在信道传输速率和车辆信息传递成功率方面,所提出的基于PPO算法的优化方案与基线算法相比具有更优的效果。[Objective]In the edge computing of vehicles,a reasonable allocation of spectrum resources is of great significance to improving the quality of vehicle communication.The scarcity of spectrum resources is a crucial issue that affects the quality of vehicle communication.The high mobility of vehicles and the difficulty of accurately collecting channel state information at the base station are challenging for spectrum resource allocation.[Methods]In view of the above problems,the optimization goal is set to the transmission rate of the vehicle-to-vehicle(V2V)link and the capacity of the vehicle-to-infrastructure(V2I)link.This paper proposed a optimization based on the Proximal Policy Optimization(PPO)reinforcement learning algorithm for multi-agent dynamic allocation of spectrum resources.[Results]Multiple V2V links sharing the spectrum resources occupied by V2I links can alleviate the problem of spectrum scarcity.Thus,this problem is further formulated as a Markov Decision Process,and the state,action,and reward are designed to optimize the spectrum allocation strategy.[Conclusions]The simulation results show that,compared with the baseline algorithm,the optimization scheme based on the PPO algorithm proposed in this paper has better performance in terms of channel transmission rate and vehicle information transmission success rate.

关 键 词:车载网络边缘计算 频谱分配 马尔可夫决策过程 近端策略优化 

分 类 号:U463.6[机械工程—车辆工程] TN929.5[交通运输工程—载运工具运用工程] TP18[交通运输工程—道路与铁道工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象