基于ET-PPO的双变跳频图案智能决策  

Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO

在线阅读下载全文

作  者:陈一波 赵知劲[1] CHEN Yibo;ZHAO Zhijin(School of Communication Engineering,Hangzhou Dianzi University,Hangzhou 310018,China)

机构地区:[1]杭州电子科技大学通信工程学院,浙江杭州310018

出  处:《电信科学》2022年第11期86-95,共10页Telecommunications Science

基  金:国家自然科学基金资助项目(No.U19B2016)。

摘  要:为进一步提高双变跳频系统在复杂电磁环境中的抗干扰能力,提出了一种基于资格迹的近端策略优化(proximal policy optimization with eligibility traces,ET-PPO)算法。在传统跳频图案的基础上,引入时变参数,通过状态-动作-奖励三元组的构造将“双变”跳频图案决策问题建模为马尔可夫决策问题。针对PPO算法“行动器”网络样本更新方式的高方差问题,引入加权重要性采样减小方差;采用Beta分布的动作选择策略,增强学习阶段的稳定性。针对“评判器”网络收敛速度慢的问题,引入资格迹方法,较好地平衡了收敛速度和全局最优解求解。在不同电磁干扰环境下的算法对比仿真结果表明,ET-PPO有更好的适应性和稳定性,对抗阻塞干扰和扫频干扰表现较好。In order to further improve its anti-interference ability in complex electromagnetic environment,a PPO algorithm based on weighted importance sampling and eligibility traces(ET-PPO)was proposed.On the basis of the traditional frequency hopping pattern,time-varying parameters were introduced,and the bivariate frequency hopping pattern decision problem was modeled as a Markov decision problem through the construction of the state-action-reward triple.Aiming at the high variance problem of the sample update method of an actor network of the PPO algorithm,weighted importance sampling was introduced to reduce the variance,and the action selection strategy of Beta distribution was used to enhance the stability of the learning stage.Aiming at the problem of slow convergence speed of the evaluator network,the eligibility trace method was introduced,which better balanced the convergence speed and the global optimal solution.The algorithm comparison simulation results in different electromagnetic interference environments show that ET-PPO has better adaptability and stability,and has better performance against obstruction interference and sweep frequency interference.

关 键 词:复杂电磁环境 双变跳频图案 近端策略优化 资格迹 

分 类 号:TN914[电子电信—通信与信息系统] TP181[电子电信—信息与通信工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象