检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:赵佳楠 胡晓辉[1] 杜欣欣 ZHAO Jianan;HU Xiaohui;DU Xinxin(Department of Electronics&Information Engineering,Lanzhou Jiaotong University,Lanzhou,Gansu 730070,China)
机构地区:[1]兰州交通大学电子与信息工程学院,甘肃兰州730070
出 处:《数据与计算发展前沿》2022年第4期142-155,共14页Frontiers of Data & Computing
基 金:国家科学自然基金(11461038)。
摘 要:【目的】在车载网络边缘计算中,合理地分配频谱资源对改善车辆通讯质量具有重要意义。频谱资源稀缺是影响车辆通讯质量的重要原因之一,车辆的高移动性以及在基站处准确收集信道状态信息的困难给频谱资源分配带来了挑战性。【方法】针对以上问题,优化目标设定为车对车(Vehicle-to-Vehicle,V2V)链路传输速率和车对基础设施(Vehicle-to-Infrastructure,V2I)容量大小,提出一种基于近端策略优化(Proximal Policy Optimization,PPO)强化学习算法的多智能体频谱资源动态分配方案。【结果】面对多个V2V链路共享V2I链路所占用的频谱资源从而缓解频谱稀缺问题。这一问题被进一步制定为马尔可夫决策过程(Markov Decision Process,MDP),并对状态、动作和奖励进行了设计,以优化频谱分配策略。【结论】仿真结果表明,在信道传输速率和车辆信息传递成功率方面,所提出的基于PPO算法的优化方案与基线算法相比具有更优的效果。[Objective]In the edge computing of vehicles,a reasonable allocation of spectrum resources is of great significance to improving the quality of vehicle communication.The scarcity of spectrum resources is a crucial issue that affects the quality of vehicle communication.The high mobility of vehicles and the difficulty of accurately collecting channel state information at the base station are challenging for spectrum resource allocation.[Methods]In view of the above problems,the optimization goal is set to the transmission rate of the vehicle-to-vehicle(V2V)link and the capacity of the vehicle-to-infrastructure(V2I)link.This paper proposed a optimization based on the Proximal Policy Optimization(PPO)reinforcement learning algorithm for multi-agent dynamic allocation of spectrum resources.[Results]Multiple V2V links sharing the spectrum resources occupied by V2I links can alleviate the problem of spectrum scarcity.Thus,this problem is further formulated as a Markov Decision Process,and the state,action,and reward are designed to optimize the spectrum allocation strategy.[Conclusions]The simulation results show that,compared with the baseline algorithm,the optimization scheme based on the PPO algorithm proposed in this paper has better performance in terms of channel transmission rate and vehicle information transmission success rate.
关 键 词:车载网络边缘计算 频谱分配 马尔可夫决策过程 近端策略优化
分 类 号:U463.6[机械工程—车辆工程] TN929.5[交通运输工程—载运工具运用工程] TP18[交通运输工程—道路与铁道工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.191.28.129