检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:杨婧[1] 张彦春[2] 余永红[3] 江海新[4]
机构地区:[1]中国科学院计算所网络数据科学与技术重点实验室,北京100190 [2]复旦大学计算机学院,上海200433 [3]南京邮电大学通达学院,南京210003 [4]中国科学院大学虚拟经济与数据科学研究中心,北京100190
出 处:《小型微型计算机系统》2014年第12期2727-2733,共7页Journal of Chinese Computer Systems
基 金:第51批中国博士后面上项目(2012M510594)资助;国家自然科学青年基金项目(61303049)资助
摘 要:当属性域是偏序的时候,最终的Skyline点几乎和原始数据集一样大小.因为大多数情况下,数据集里至少有一维点与点之间是不可比的.因此在保留感兴趣的点的同时,将大数据集裁剪到一个合理的规模,是一个值得研究的问题.为了得到一个更小更有用的Skyline点集,可以更好地反映真实的用户偏好,本文基于两种假设:偏好的参数是不完整的,实际的偏好是传递性的,提出一个更为广义的控制关系概念.The skyline of a set P of multi-dimensional points ( tuples ) consists of those points in P for which no clearly better point in P exists, using component-wise comparison on domains of interest. The guiding idea is to prune large data sets to a more manageable size, while ensuring that points of interest are preserved. However, when domains are only partially ordered,it easily happens that the skyline is nearly as large as the original set ( or at least of the same order of magnitude ), since most of the time points are incomparable in at least some dimension. To obtain a smaller, more useful skyline set which better reflects actual user preferences, we propose a richer notion of dominance,based on two assumptions:that preference specifications are often incomplete, and that actual preferences are transitive. Experiments on both real and synthetic data sets show that our new skyline notion scales well and is highly accurate in terms of user expectations.
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.33