基于D-S证据理论的XML文档潜在信息获取算法被引量：3

XML document latent information extraction algorithm based on D-S evidence theory

出　　处：《计算机应用研究》2013年第4期1187-1190,共4页Application Research of Computers

基　　金：国家"973"计划资助项目(2011CB311801);河南省科技创新人才计划资助项目(114200510001)

摘　　要：传统的XML文档检索方法主要是基于关键词匹配的检索,忽略了关键词的语义信息和蕴涵于信息组合中的潜在信息。针对上述问题,提出了基于D-S证据理论的XML文档潜在信息的获取算法。该算法通过引入本体定义了概念间的语义关系和信息的组合方式,提出了基于D-S证据理论的检索模型和指标权重的计算方法,并结合似然函数设计了一个动态的阈值,有效地消除语义匹配过程中存在的不确定性,解决了信息组合中潜在信息的获取问题。此外,还将该算法应用于电子政务领域个人和企业敏感信息的检测中,实验证明了该算法比传统的方法有着更高的查准率和查全率。Traditional XML document retrieval methods are mainly based on keywords＇＇ match,which ignore Key words＇ semantics and latent information contained in information combination.This paper proposed an algorithm of XML document latent information extraction based on D-S evidence theory.Firstly it used ontology to define the relationships between semantic concepts and the combination mode,and next proposed a retrieval model based on D-S evidence theory.Then it presented the computation of evidence weight,and finally designed a dynamic threshold with plausible function.It solved the problems of uncertainty in semantic match and retrieve of latent information.Furthermore,it presented the algorithm＇s application in the detection of personal and enterprises＇ sensitive information in e-government domain.The experiment proves that the proposed algorithm has higher precision and recall.

关键词：D-S证据理论可扩展标记语言潜在信息本体动态阈值

分类号：TP393.08[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于D-S证据理论的XML文档潜在信息获取算法被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于D-S证据理论的XML文档潜在信息获取算法 被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于D-S证据理论的XML文档潜在信息获取算法被引量：3