一种基于可解释深度强化学习的动态频谱接入方法

A Dynamic Spectrum Access Method Based on Explainable Deep Reinforcement Learning

作　　者：耿凯张建照姚昌华 GENG Kai;ZHANG Jianzhao;YAO Changhua(School of Electronic&Information Engineering,Nanjing University of Information Science&Technology,Nanjing 210044,China;The 63rd Research Institute,National University of Defense Technology,Nanjing 210007,China)

机构地区：[1]南京信息工程大学电子与信息工程学院,南京210044 [2]国防科技大学第六十三研究所,南京210007

出　　处：《电讯技术》2024年第12期1981-1989,共9页Telecommunication Engineering

基　　金：国家自然科学基金资助项目(62131005,62231012,61971439,U22B2002);通信抗干扰全国重点实验室基础科研创新基金(稳定支持)项目(IFN20230207)。

摘　　要：针对基于强化学习的动态频谱接入模型性能有限、可解释性差的问题,提出了一种基于权重分析的动态频谱接入方法。采用储备池计算(Reservoir Computing,RC)网络来替代传统的深度Q学习网络(Deep Q-Learning Network,DQN),以简化网络结构并提高计算效率。同时引入权重分析的可解释方法,通过生成热力图来反映神经网络对不同信道的认知和偏好,从而提高了模型的可解释性。仿真结果表明,在多用户环境中,该算法在平均成功率、平均碰撞率和平均奖励等关键指标上显著优于Q-Learning等传统强化学习算法。相较于DQN+MLP算法,该算法不仅加快了收敛速度,而且在平均成功率达到0.8、平均碰撞率接近0以及平均奖励等关键指标上的表现与之相当。For the problems of limited performance and poor explainability of dynamic spectrum access model based on reinforcement learning,a dynamic spectrum access method based on weight analysis is proposed.The reservoir computing(RC)network is used to replace the traditional Deep Q-Learning Network(DQN)to simplify the network structure and improve the computing efficiency.At the same time,the explainability method of weight analysis is introduced to reflect the cognition and preference of neural network to different channels by generating heat map,so as to improve the explainability of the model.The simulation results show that the proposed algorithm is significantly better than traditional reinforcement learning algorithms such as Q-Learning in key indicators such as average success rate,average collision rate and average reward in multi-user environment.Compared with DQN+MLP,this algorithm not only speeds up the convergence speed,but also performs as well in key indicators such as average success rate of 0.8,average collision rate close to 0 and average reward.

关键词：动态频谱接入可解释人工智能储备池计算深度强化学习

分类号：TN925[电子电信—通信与信息系统]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种基于可解释深度强化学习的动态频谱接入方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种基于可解释深度强化学习的动态频谱接入方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索