检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:余飞 陈乾 刘峻源 YU Fei;CHEN Qian;LIU Junyuan(Guang'an Vocational&Technical College,Guang'an Sichuan 638000,China;Guang'an Public Security Bureau Cyber SecurityDefense Detachmen,Guang'an Sichuan 638000,China)
机构地区:[1]广安职业技术学院,四川广安638000 [2]广安市公安局网络安全保卫支队,四川广安638000
出 处:《信息与电脑》2022年第2期64-66,共3页Information & Computer
摘 要:为了降低恶意网页脚本检测存在的漏检和误检问题,提高检测的查准率和查全率,笔者提出基于机器学习的恶意网页脚本检测方法。首先,采用网络爬虫爬取网站中存在的正常网络脚本和恶意网络脚本信息,作为机器学习算法迭代训练样本数据集;其次,引入N-gram特征模型提取网页脚本潜在特征;最后,使用机器学习中的神经网络算法迭代训练样本数据集,当网络训练误差达到最小时,在网络输入层输入恶意网页脚本,通过迭代训练得到恶意网页脚本检测结果。实验结果表明,当N-gram特征的N=4且样本特征数量值取700时,利用研究方法检测恶意网页脚本的漏检率和误检率低于1%,查全率和查准率均高于95%。In order to reduce the missed detection and false detection of malicious web scripts, and improve the accuracy and recall rate of detection, a malicious web script detection method based on machine learning is proposed. Firstly, the normal network script and malicious network script information in the website are crawled by the web crawler as the iterative training sample data set of the machine learning algorithm;Secondly, n-gram feature model is introduced to extract the potential features of Web script;Finally, the neural network algorithm in machine learning is used to iterate the training sample data set. When the network training error reaches the minimum, the malicious web page script is input in the network input layer, and the detection results of malicious web page script are obtained through iterative training. The experimental results show that when the n-gram feature n = 4 and the number of sample features is 700, the missed detection rate and false detection rate of malicious web script detected by the research method are less than 1%, and the recall rate and precision rate are higher than 95%.
分 类 号:TP393.081[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7