Java源代码字节码剽窃检测方法及支持系统  被引量:6

Method and its system of Java source and byte code plagiarism detection

在线阅读下载全文

作  者:李虎[1] 刘超[1] 刘楠[1] 李晓丽[1] 

机构地区:[1]北京航空航天大学计算机学院,北京100191

出  处:《北京航空航天大学学报》2010年第4期424-428,共5页Journal of Beijing University of Aeronautics and Astronautics

基  金:国家自然科学基金资助项目(60573057)

摘  要:提出一种Java源代码和字节码都适用的剽窃检测方法并实现了支持系统,该方法以类的Java文件或class文件为比较单元,从中抽取代表程序语法和语义特征的5种特征向量,综合计算产生两个类文件之间的相似度,可用于帮助判断两个类文件之间是否存在全部或部分剽窃现象.在人工修改程序的场景下进行的对比实验结果和剽窃检测实验结果表明,该方法可有效检测程序代码的严格拷贝和近似拷贝,有较高的检测性能,并且能够识别程序剽窃行为中对Java源文件所做的大部分类型的代码变换.A plagiarism detection approach to detect both Java source code and byte code was proposed.The proposed method compares Java source files or class files by multiple similarity measures developed to represent the syntax structures and semantic features of the programs.An efficient plagiarism detection tool using the proposed technique was developed to analyze plagiarism behavior of Java source code or class code.Statistical analysis and several graphical visualizations aid in the interpretation of analysis results.An experimental comparison with a typical commercial source code plagiarism detection tool as well as a case study by applying the tool to plagiarism detection with a set of manually modified programs were conducted.Experiment results show that the tool is more efficient and the proposed technique can recognize both exact copy and approximate copy,including those most of the types of source code transformations in program plagiarism behavior.

关 键 词:剽窃检测 JAVA源代码 JAVA字节码 相似性度量 

分 类 号:TP311.5[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象