基于两层元数据与本体的异构数据共享技术  被引量:4

Heterogeneous data sharing technology based on two-layer metadata and ontology

在线阅读下载全文

作  者:李小涛[1] 胡晓惠[1] 李斌全[1] 

机构地区:[1]北京航空航天大学自动化科学与电气工程学院,北京100191

出  处:《北京航空航天大学学报》2015年第8期1476-1484,共9页Journal of Beijing University of Aeronautics and Astronautics

基  金:国家自然科学基金(61273350)

摘  要:针对多源、多类、异构数据难以同时共享的问题,提出了一种两层元数据结合本体的信息共享技术.首先,分析了两层元数据的结构,介绍了如何通过两层元数据统一描述多类异构数据.其次,针对元数据缺乏语义信息不能描述数据类别之间的隐含关系的问题,在元数据之上建立本体层,对元数据进行语义描述和本体推理.最后,在数据检索方面,利用Lucene全文检索引擎与SPARQL(Simple Protocol and RDF Query Language)本体查询语言相结合,在关键词查询过程增加了SPARQL检索操作,提高了查全率,并优化了检索时间.实验选取了2014—2015赛季欧洲足球冠军联赛数据作为测试数据,证明了本文方法在异构数据共享上的有效性和元数据查询性能的改进.With the aim to share multi-sourced, multi-class, heterogeneous data simultaneously, an information sharing technology was proposed based on a two-layer metadata combined with ontology. Firstly, the structure of the two-layer metadata standard was analyzed. At the same time, how to implement uniform description for heterogeneous data was introduced. Secondly, due to the lack of semantic information, some important potential correlations between metadata classes may be ignored. For this reason ontology was established on the metadata layer for describing and reasoning the relationships between classes. Finally, in order to improve the recall rate and optimize the retrieval time, an improved method combining Lucene full-text search engine with SPARQL query was proposed to retrieve metadata. SPARQL retrieval was performed before the keyword queried by Lueene. Soccer match information of 2014--2015 UEFA Champions League was selected as test data. The experiment results illustrate the effectiveness on sharing heterogeneous data and improvement on recall and timeliness of the approach.

关 键 词:异构数据 元数据 本体 信息共享 语义检索 

分 类 号:V219[航空宇航科学与技术—航空宇航推进理论与工程] TP393[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象