基于本体和贝叶斯网络的Deep Web集成系统研究  

Research on Deep Web integrated system based on ontology and Bayesian network

在线阅读下载全文

作  者:朱国进[1] 黄琪琪 

机构地区:[1]东华大学计算机科学与技术学院,上海201620

出  处:《智能计算机与应用》2018年第1期6-13,共8页Intelligent Computer and Applications

摘  要:Deep Web指无法简单地通过搜索引擎或网络爬虫能够检索到的隐藏在后台数据库中,而往往这些内容具有丰富的信息和数据。获取Deep Web所蕴含的丰富信息的有效方法是构建Deep Web集成框架,而查询接口作为Deep Web的唯一访问接口,所以Deep Web集成系统的关键就是构建Deep Web集成接口。研究的目标是通过自动构建特定领域的本体来表示Deep Web接口信息,从而能够自动识别该领域Deep Web接口来建立索引,提取数据库中丰富的资源。在没有人为干预的情况下展开整个过程。本文的方法能完全自动地提取Deep Web接口信息并派生领域本体,并通过本体贝叶斯网络识别新Deep Web接口,进行匹配。在特定领域,通过一种新的自动从Deep Web接口中提取属性方法,通过Word Net构建成本体语义树,运用得到的领域语义本体树结合贝叶斯网络完成领域分类,并在分类后进行查询接口与集成接口的模式匹配。本文提出的方法通过对比人工提取属性构成的语义树在分类和模式匹配的结果进行对比,验证了该方法的可用性和适用性。Deep Web refers to the hidden in the background database that can't be retrieved by search engine or Web crawler, butoften have rich in information and data. The effective way to get rich information contained in the Deep Web is to build a Deep Webintegration framework, and the query interface is the only access interface of Deep Web. So the key of Deep Web integration systemis to build Deep Web integration interface. The goal is to automatically express the Deep Web interface information by buildingspecific domain ontology, which could automatically identify the Deep Web interface in this field to index and extract abundantresources in the database. The whole process is carried out without human intervention. The proposed method can extract Deep Webinterface information and derive domain ontology automatically, and identify the new Deep Web interface through ontology Bayesiannetwork to match. In certain areas, through a new automatic attribute extraction method from the Deep Web interface, the researchconstructs ontology semantic tree by WordNet, meanwhile combined with the semantic domain ontology tree based on Bayesiannetwork to complete the field of classification, and the mode matching of query interface and integrated interface in classification.The proposed method achieves the results of classification and pattern matching by comparing the results of classification and patternmatching, and verifies the usability and applicability of the method.

关 键 词:DEEP Web查询接口集成系统 属性提取 语义本体树 贝叶斯网络 

分 类 号:TP393.092[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象