KWSDS:关系数据库中Top-k关键词搜索系统  被引量:2

KWSDS: A Top-k Keyword Search System in Relational Databases

在线阅读下载全文

作  者:唐明珠[1] 杨艳[1] 郭雪泉[1] 沈中辉[2] 钟颖莉[1] 

机构地区:[1]黑龙江大学计算机科学技术学院,哈尔滨150080 [2]黑龙江大学科学技术处,哈尔滨150080

出  处:《计算机研究与发展》2012年第10期2251-2259,共9页Journal of Computer Research and Development

基  金:国家自然科学基金项目(60973081);黑龙江省自然科学基金项目(F201011);黑龙江省教育厅科学技术研究基金项目(11551352;12511401);黑龙江大学青年科学基金项目(QL201029)

摘  要:关系数据库中的关键词搜索技术已经成为信息检索领域的研究热点,它为没有任何SQL语法知识的用户提供了一个简单友好的接口.但是现存的关键词搜索系统主要依赖于数据图或模式图,而单独使用数据图或模式图的算法搜索效率不高,结果准确率也较低.设计实现了一个Top-k关键词搜索系统(keyword search system based on database graph and schema graph,KWSDS),用户提交关键词后,系统对关键词进行预处理,消除一些脏关键词.首次提出使用数据图与模式图相结合的方法,设计了同表查询算法和异表查询算法,分析了算法的正确性和时间复杂度,并且提出了相关性结果排序方法.KWSDS系统的搜索算法运行时间短,搜索结果准确性高,具有良好的查询性能.最后通过实验验证了KWSDS的效率.Keyword search technology over relational databases has become one of hot topics in the field of information retrieval. It can provide the users with little SQL knowledge a simple and friendly interface. But the algorithms of some existing keyword search systems are mainly based on database graph or schema graph. However, the efficiency of them which use database graph or schema graph separately is low. The accurate rate of results is also not high. This paper devises and implements a top-k keyword search system KWSDS (keyword search system based on database graph and schema graph). After users input the keywords, it can eliminate some dirty keywords through pre- processing. The method of combining database graph and schema graph together to solve the problem of keyword search is proposed for the first time. This paper also devises the search algorithms between same table and different tables, prove the correctness of the algorithms and analyze time complexity of them. A sorting method based on relevance is proposed at the same time. The algorithms of KWSDS system run in shorter time than the existing algorithms, the results are output by KWSDS system with high accuracy. The system has excellent query performance. Finally, KWSDS is verified by abundant experiments.

关 键 词:关系数据库 信息检索 关键词搜索 数据图 模式图 

分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象