偏序域上的传递保持Skyline计算  被引量:1

Transitivity-preserving Skylines for Partially Ordered Domains

在线阅读下载全文

作  者:杨婧[1] 张彦春[2] 余永红[3] 江海新[4] 

机构地区:[1]中国科学院计算所网络数据科学与技术重点实验室,北京100190 [2]复旦大学计算机学院,上海200433 [3]南京邮电大学通达学院,南京210003 [4]中国科学院大学虚拟经济与数据科学研究中心,北京100190

出  处:《小型微型计算机系统》2014年第12期2727-2733,共7页Journal of Chinese Computer Systems

基  金:第51批中国博士后面上项目(2012M510594)资助;国家自然科学青年基金项目(61303049)资助

摘  要:当属性域是偏序的时候,最终的Skyline点几乎和原始数据集一样大小.因为大多数情况下,数据集里至少有一维点与点之间是不可比的.因此在保留感兴趣的点的同时,将大数据集裁剪到一个合理的规模,是一个值得研究的问题.为了得到一个更小更有用的Skyline点集,可以更好地反映真实的用户偏好,本文基于两种假设:偏好的参数是不完整的,实际的偏好是传递性的,提出一个更为广义的控制关系概念.The skyline of a set P of multi-dimensional points ( tuples ) consists of those points in P for which no clearly better point in P exists, using component-wise comparison on domains of interest. The guiding idea is to prune large data sets to a more manageable size, while ensuring that points of interest are preserved. However, when domains are only partially ordered,it easily happens that the skyline is nearly as large as the original set ( or at least of the same order of magnitude ), since most of the time points are incomparable in at least some dimension. To obtain a smaller, more useful skyline set which better reflects actual user preferences, we propose a richer notion of dominance,based on two assumptions:that preference specifications are often incomplete, and that actual preferences are transitive. Experiments on both real and synthetic data sets show that our new skyline notion scales well and is highly accurate in terms of user expectations.

关 键 词:偏序域 传递保持 SKYLINE计算 

分 类 号:TP311[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象