检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:范青蓝[1] 汪林[1] 刘亚清[2] 白惇 鲁明羽[2] Qinglan Fan;Lin Wang;Yaqing Liu;Dun Bai;Mingyu Lu(Research Institute of Highway Ministry of Transport,Beijing 100088,China;School of Information Science & Technology,Dalian Maritime University,Dalian 116026,China)
机构地区:[1]交通运输部公路科学研究院,北京100088 [2]大连海事大学信息科学技术学院,辽宁大连116026
出 处:《信息工程期刊(中英文版)》2015年第6期173-176,共4页Scientific Journal of Information Engineering
基 金:受“面向ITS体系框架的交通运输数据资源规划研究”支持资助.
摘 要:主题数据库规划一直是信息资源规划领域研究的重点,而实体聚合算法是影响主题数据库规划质量的关键。但是现有的计算实体聚合毖方法很容易陷入聚簇偏置,影响了规划质量。针对这一问题,作者首先计算实体对的亲和毖,然后将实体对的亲和关系看作网页之间的链接关系,使用PageRaxtk算法对实体对重要性排序,进而使用K—means算法迭代来聚合实体。实验结果表明本文提出的方法能够避免聚簇偏置,进而改善了主题数据库规划质量。Subject database planning is always the emphasis of information resource planning. Algorithm for entities aggregation has heavy impact on the quality of subject database planning. However, the existing approaches to entities aggregation computation are inclined to fall into cluster offset, which does great harm to the quality of subject database planning. Against the problem, we firstly calculate the degree of aggregation between entities. Secondly, we view the relations of aggregation as the relations of links between web pages. We apply PageRank algorithm to sort all entity pairs by importance. At last, we exploit K-means algorithm to aggregate entities iteratively. The results of experiments show that our approach avoids cluster offset and improves the quality of subject database planning.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222