基于分布式集群的语料库防篡改检索方法  被引量:2

Tamper Proof Retrieval Method for Corpus Based on Distributed Cluster

在线阅读下载全文

作  者:安玉香[1] 李檀[2] AN Yu-xiang;LI Tan(International School,Shenyang Jianzhu University,ShenyangLiaoning 110168,China;Shenyang JianzhuUniversity,Liaoning Shenyang 110168,China)

机构地区:[1]沈阳建筑大学国际学院,辽宁沈阳110168 [2]沈阳建筑大学,辽宁沈阳110168

出  处:《计算机仿真》2021年第9期460-464,共5页Computer Simulation

摘  要:为了保证语料库的安全性,提出基于分布式集群的语料库防篡改检索方法。通过分布式集群运行结构,采用决策树ID3算法分析节点的检测属性,构建语料库行为样本判定数,挖掘语料库浏览行为数据;采用数据关键特征防篡改检索方法,将监控代码植入到语料库浏览行为数据中,对语料库资源数据各特征函数进行加密处理,从而实现资源数据加密及防篡改检索。实验结果表明,上述方法的浏览行为挖掘效率较高,且能够将篡改信息全部检索出来,防篡改检索错误率较低,应用该方法后可较大程度提高语料库的安全性。This paper proposes a tamper proof retrieval method based on distributed cluster for ensuring the security of corpus.According to the operation structure of distributed cluster,ID3 algorithm was introduced to analyze the detection attributes of nodes in detail to construct the decision number of corpus behavior samples,thus mining the behavior data of corpus browsing.The key features of data tamper resistant retrieval method were applied.Meanwhile,the feature functions of the corpus resource data were encrypted.Finally,the encryption and tamper proof retrieval of the resource data was achieved.The experimental results show that this method has high efficiency of browsing behavior mining,low tamper proof retrieval error rate,and good application prospect in the field of improving the security performance of corpus.

关 键 词:分布式集群 语料库 防篡改 检索引擎 监控代码 

分 类 号:TP302[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象