汇编语言程序相似性检测混合算法  被引量:2

Hybrid algorithm of similarity detection for assembly language programs

在线阅读下载全文

作  者:石陆魁[1] 张军[1] 陈飞[1] 李金钊[1] 

机构地区:[1]河北工业大学计算机科学与软件学院,天津300401

出  处:《河北科技大学学报》2011年第2期138-142,共5页Journal of Hebei University of Science and Technology

基  金:天津市应用基础及前沿技术研究计划(10JCZDJC16000)

摘  要:根据汇编语言自身的特点,提出了结合属性计数和结构度量技术的相似性检测混合算法。在该方法中,将程序段的数目、子程序定义和调用的次数、循环指令loop出现的次数、转移指令出现的次数作为结构信息,73个使用频率较高的关键字作为属性信息。在从汇编语言程序中提取这些信息后,利用卡方检验来判断2个程序的相似性。实验结果表明,从混合算法得到的结果与人工检测的结果相一致,优于从属性计数和结构度量技术得到的结果。Plagiarism often occurs in programming assignments submitted by students.Similarity detection techniques can help teachers find the suspicious plagiarism.Most similarity detection techniques use identical algorithm for different programming languages,which leads to the redundant checking algorithms and degrades the checking accuracy.In this paper,a hybrid algorithm of similarity detection adapting to the characteristic of assembly language was presented,which combined attribute counting with structure metrics technique.In the algorithm,the number of paragraphs,the number of definition and calling of subroutines,the number of loop and branch occurrences in assembly programs were extracted as the structure information.And 73 high frequent keywords were taken as the attribute information.The similarity of two programs was judged with the chi-square test after getting the attribute and structure information.Experiments demonstrated that results from the proposed algorithm were consistent to those from the manual check.The hybrid algorithm was superior to the methods based on attribute counting and structure metrics.

关 键 词:汇编语言 相似性检测 抄袭 属性计数 结构度量 

分 类 号:TP313[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象