检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]南京邮电大学计算机学院,南京210003 [2]南京邮电大学自动化学院,南京210003
出 处:《计算机科学》2017年第B11期338-341,361,共5页Computer Science
摘 要:随着计算机技术的发展和普及,计算机病毒带来的危害日趋严重。传统N-Gram算法难以提取不同长度的特征,导致有效特征缺失,并产生庞大的特征集合,造成空间的浪费。针对这些问题,提出一种改进的基于N-Gram的特征码自动提取方法。该方法在原有N-Gram特征提取算法的基础上引入变长N-Gram特征,提取不同长度的有效特征,生成不定长病毒特征码。综合考虑特征频率的相关性,利用特征浓度对N-Gram特征进行有向筛选,生成数据字典,节省存储空间。实验结果表明,与单纯使用定长N-Gram的算法相比,该方法能有效降低特征码自动提取的误报率。Wi th the rapid development of computer technology, security threats brought by computer virus have become more and more serious. The tradit ional N-Gram algorithm is difficult to capture bytes of dif ferent length,leading to the lack of effective signature and the geheration of huge signature sets, and creating a waste of storage space. Instead of using f ixed-length N-Gram feature that the tradit ional way dose, an improved computer virus signature automatic ex-tract ion algorithm based on variable-length N-Gram was proposed to solve these problems. It extracts the effective sig-nature to generate variable-length virus signature. Taking the correlation of signature frequency into account, the algo-ri thm uses signature concentration to extract the N-Gram feature of malware samples and generates a data dictionary to save the storage space. In the experiment results, compared with the tradit ional algorithm which uses f ixed-length N- Gram feature, the proposed method can effectively decrease the false rate of signature extraction.
分 类 号:TP309.5[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28