检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:刘兆庆[1] 伏玉琛[1] 凌兴宏[1] 熊湘云[1]
机构地区:[1]苏州大学计算机科学与技术学院,江苏苏州215006
出 处:《计算机应用》2013年第1期189-191,198,共4页journal of Computer Applications
基 金:国家自然科学基金资助项目(61070122)
摘 要:针对拖网算法存在的发现Web社区数量过多、社区间页面重复率较高以及严格的社区定义形成孤立社区等问题,提出一种基于形式概念分析(FCA)的博客社区发现算法。根据博客网络之间的链接关系构造概念格,通过格的代数消解对原始概念格进行等价划分,度量每个划分中概念间外延和内涵的结构相似性进而合并社区核心形成社区。实验结果表明:测试数据集中社区核心的网络密度大于40%的占全部的83.420%,合并社区的网络直径为3,且社区内容丰富程度得到提高。所提算法可以有效地运用于博客、微博等社交网络的社区发现,具有显著的应用价值和现实意义。Several problems exist in trawling algorithm, such as too many Web communities, high repetition rate between community-cores and isolated community formed by strict definition of community. Thus, an algorithm detecting Blog community based on Formal Concept Analysis (FCA) was proposed. Firstly, concept lattice was formed according to the linkage relations between Blogs, then clusters were divided from the lattice based on equivalence relation, finally communities were clustered in each cluster based on the similarity of concepts. The experimental results show that, the community-cores, which network density is greater than 40%, occupied 83. 420% of all in testing data set, the network diameter of combined community is 3, and the content of community gets enriched significantly. The proposed algorithm can be effectively used to detect communities in Blog, micro-Blog and other social networks, and it has significant application value and practical meaning.
关 键 词:博客社区 社区发现 形式概念分析 链接分析 社交网络
分 类 号:TP391.3[自动化与计算机技术—计算机应用技术] TP393.094[自动化与计算机技术—计算机科学与技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28