物联网流量无冗余特征集的筛选算法  

A Selection Algorithm for Redundancy-free Feature Sets inInternet of Things Traffic

在线阅读下载全文

作  者:赵志远 李鹏[1,2] 胡素君[1] ZHAO Zhi-yuan;LI Peng;HU Su-jun(School of Computer Science,Nanjing University of Posts and Telecommunications,Nanjing 210023,China;Institute of Network Security and Trusted Computing,Nanjing University of Posts and Telecommunications,Nanjing 210023,China)

机构地区:[1]南京邮电大学计算机学院,江苏南京210023 [2]南京邮电大学网络安全与可信计算研究所,江苏南京210023

出  处:《计算机技术与发展》2024年第12期48-56,共9页Computer Technology and Development

基  金:国家自然科学基金(61872196,61872194,61902196);江苏省科技支撑计划项目(BE2019740,BK20200753,20KJB520001);江苏省高等学校自然科学研究重大项目(18KJA520008);江苏省六大人才高峰高层次人才项目(RJFW-111)。

摘  要:目前物联网流量异常检测研究存在忽视特征筛选重要性的问题,筛选出一个无冗余的特征集有助于异常检测模型的训练与精简化。为了高效地提取物联网流量数据集中的无冗余特征集,文章提出了一种基于差分进化算法的两步走特征筛选算法。该算法首先使用基于线性相关系数和最大信息系数的双过滤器对数据集进行过滤式特征筛选,得到初筛结果特征集,再在此数据集基础上使用文章提出的一种包裹式特征筛选算法——DEWFS(Wrapped Feature Selection based on Differential Evolution),用极限学习机作为模型,经过预先定义的迭代次数,最终得到保留原始特征集异常检测性能的无冗余特征集。DEWFS算法基于差分进化算法,但对其初始化与中间迭代步骤进行了相应优化,使之能够适应流量特征筛选领域的优化任务。实验结果证明,该两步走算法能高效地筛选出物联网流量无冗余特征集,显著降低了后续流量异常检测算法的计算时间。Currently,research on anomaly detection in IoT traffic often overlooks the importance of feature selection.Selecting a redundancy-free feature set is crucial for efficiently training and simplifying anomaly detection models.To efficiently extract a redundancy-free feature set from IoT traffic datasets,we propose a two-step feature selection algorithm based on the differential evolution algorithm.Initially,a dual-filter approach utilizing linear correlation coefficients and maximum information coefficients is applied for filter-based feature selection to achieve preliminary screening results.Subsequently,it applies a novel wrapper-based feature selection algorithm—DEWFS(Differential Evolution Wrapped Feature Selection),with extreme learning machine as the model,through predefined iterations,ultimately obtaining a redundancy-free feature set that preserves the original feature set's anomaly detection capabilities.The DEWFS algorithm,while grounded in differential evolution,has been optimized in its initialization and intermediate iterative steps to adapt to the optimization tasks specific to traffic feature selection.Experimental results demonstrate that the proposed two-step algorithm efficiently selects a redundancy-free feature set for IoT traffic,significantly reducing the computational time for subsequent anomaly detection algorithms.

关 键 词:差分进化算法 物联网流量异常检测 特征筛选 最大信息系数 极限学习机 

分 类 号:TP393.08[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象