基于集合覆盖的分布式信息检索资源选择  被引量:1

Resource Selection in Distributed Information Retrieval Based on Set-covering

在线阅读下载全文

作  者:王秀红[1] 

机构地区:[1]江苏大学科技信息研究所,镇江212013

出  处:《计算机工程》2010年第4期36-38,共3页Computer Engineering

基  金:江苏大学博士生创新基金资助项目(CX08B_18x)

摘  要:考虑到不同的数据资源(数据集)之间存在的覆盖问题,基于集合覆盖理论,针对提问Q的检索结果在融合排序后位置的不同,对其赋以不同的权值,用来计算该项检索结果对其所在的数据集的贡献。若检索结果在先选的数据集中出现过,则不再计入后选的数据集得分内。通过加权求和得到待选数据集的得分,从而确定资源选择的先后顺序。由此优选出的资源集合可用于检索与问题Q同类或类似的提问Q’,缩短由于数据库之间的覆盖而重复检索的时间。Considering overlapping extent between resources, a set-covering-based algorithm for resource selection in Distributed Information Retrieval(DIR) is proposed. Different document with different weight according to its position in merged results for question Q is given. Only results that have not appeared in some earlier selected resource are focused on in later selected resources. The score of each resource is decided by the total weights of those merged results included in, and only the resource with max score is selected in each selecting step. The selecting order is the actual rank of selected resources which are used to answer the question Q, which is similar to question Q. The approach makes time cost decreased in DIR.

关 键 词:分布式信息检索 集合选择 资源选择 集合覆盖 

分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象