检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:崔弘[1] 赵双 张广胜 苏金树[2] CUI Hong;ZHAO Shuang;ZHANG Guang-sheng;SU Jin-shu(FiberHome Telecommunication Technologies Co.,Ltd.,Wuhan 430074;College of Computer Science and Technology,National University of Defense Technology,Changsha 410073;Investigation Technology Center of PLA,Beijing 100080,China)
机构地区:[1]烽火通信科技股份有限公司,湖北武汉430074 [2]国防科技大学计算机学院,湖南长沙410073 [3]中央军委政法委员会,北京100080
出 处:《计算机工程与科学》2022年第4期654-664,共11页Computer Engineering & Science
摘 要:随着移动网络的迅速发展,越来越多的用户选择使用代理应用,以保护个人网络隐私,隐藏上网行为或绕开网络活动限制,给网络管理与审计带来了新的挑战。与此同时,恶意攻击者可利用代理应用隐藏身份,使得恶意行为更难以检测和防范。因此,代理应用流量识别对网络管理与安全具有重要的作用,但目前该问题并未得到充分的研究。由于代理应用流量通常经过加密或混淆处理,传统的流量识别技术无法被有效应用。为实现准确、快速的移动代理应用流量识别,提出一组与负载无关的流量特征,并首次加入TCP层option字段用于刻画流量。基于4种机器学习算法训练的分类器和2种流量识别对象,验证提出的特征对识别移动代理应用流量的有效性,并对各类特征的重要性进行分析。实验结果表明,提出的特征能有效识别代理应用流量。在识别流量是否经由代理时,基于随机森林的分类器可达到99%以上的整体准确率。识别流量所属代理应用时,整体准确率高于94%。在公开数据集ISCX VPN-nonVPN上与其他方法相比,提出的方法识别准确率更高,并具有更快的识别速度,适合实时流量识别场景。With the rapid development of mobile networks,more users choose to protect privacy,hide online behavior and bypass the restrictions of networks by using proxy applications.As a result,new challenges are brought to network management and auditing.In addition,malicious attackers can use proxy to hide their identity,making it more difficult to detect and prevent such malicious behavior.Therefore,proxy application traffic identification plays an important role in network management and security,while this issue has not been fully studied at present.Because the proxy application traffic is usually encrypted and obfuscated,the traditional traffic identification methods can not be applied effectively.To achieve accurate and fast traffic identification of mobile proxy applications,a set of side-channel traffic features that are independent of the payload is proposed.The option field in the TCP header is used for the first time to describe the traffic characteristics.Four machine learning algorithms with two kinds of identification objects are utilized to validate the effectiveness and importance of the proposed feature set.The experimental results show that the proposed features can effectively identify proxy application traffic.More than 99%accuracy can be achieved when identifying whether traffic is forwarded by proxy applications based on random forest.Moreover,the average accuracy is higher than 94%when identifying which proxy application the traffic belongs to.Compared with other methods,the proposed method has better accuracy and faster classification speed on the public dataset ISCX VPN-nonVPN.Hence,it is more suitable for real-time traffic identification scenarios.
关 键 词:代理应用流量识别 移动应用 机器学习 流量特征 决策树
分 类 号:TP393[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.30