基于随机森林的物联网设备流量分类算法  被引量:8

Traffic classification algorithm of Internet of things devices based on random forest

在线阅读下载全文

作  者:李锐光 段鹏宇[1] 沈蒙 祝烈煌[1] LI Ruiguang;DUAN Pengyu;SHEN Meng;ZHU Liehuang(School of Cyberspace Science and Technology,Beijing Institute of Technology,Beijing 100081,China;National Computer Network Emergency Response Technical Team/Coordination Center of China,Beijing 100029,China)

机构地区:[1]北京理工大学网络空间安全学院,北京100081 [2]国家计算机网络应急技术处理协调中心,北京100029

出  处:《北京航空航天大学学报》2022年第2期233-239,共7页Journal of Beijing University of Aeronautics and Astronautics

摘  要:物联网(IoT)设备流量分类对网络资产管理有重要意义,基于流量统计的分类技术是当前研究热点。已有算法主要基于流信息建立特征向量,而对数据包信息利用较少。改进了基于随机森林的物联网设备流量分类算法,基于流信息和流数据包信息共同建立特征向量。实验结果表明:所提算法与其他算法相比,所提算法的平均分类准确率由56%提高到82%,平均召回率由47%提高到67%,平均F_(1)得分由0.43提高到0.74,混淆矩阵对比也有明显提升,因此具备更好的分类效果。The traffic classification of Internet of things(IoT)devices is very important to the management of cyberspace assets.The classification technology based on statistical identification is a hot spot in current academic research.The previous algorithms were mainly based on the flow information to set up the feature vectors,but lesson the packet information.In this paper,we improve the traffic classification algorithm of IoT devices based on random forest.We set up the feature vectors with both the flow information and the flow's packet information.The experimental results show that,compared with previous algorithms,the classification accuracy of the proposed algorithm increases from 56%to 82%,the recall rate improves from 47%to 67%,the F_(1) score increases from 0.43 to 0.74,and the confusion matrix correlation is also significantly improved.As a result,the proposed algorithm has a better classification effect than previous ones.

关 键 词:物联网(IoT) 流量分类算法 随机森林 特征向量 流信息 数据包信息 

分 类 号:TP393[自动化与计算机技术—计算机应用技术] TN919[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象