检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]大连海事大学信息科学技术学院,辽宁大连116026
出 处:《大连海事大学学报》2008年第4期52-54,58,共4页Journal of Dalian Maritime University
基 金:国家自然科学基金资助项目(60473135;60773084;J0724003;60603023);教育部博士点基金资助项目(20070151009)
摘 要:为解决单个帖子线索的多话题性问题,识别聚类中的孤立点,提出一种基于模糊聚类的网络论坛(BBS)热点话题挖掘算法.采用模糊聚类进行话题识别,使得一个帖子线索可以隶属于多个话题,而对于隶属度远小于类内平均隶属度的帖子线索,则当作孤立点来处理.此外,还给出了一种面向BBS文本的特征表示方法,并结合隶属度给出基于模糊划分的话题热度评分公式.实验结果验证了该算法的有效性.A bulletin board system(BBS) hot topic mining algorithm based on fuzzy clustering was developed to solve the problem of the post thread with multiple topics and identifying the outliers in clustering. Fuzzy clustering was used to make one post thread belonging to many topics and the post thread whose membership degree being far less than the in-class average membership degree was treated as the outlier. Moreover, a kind of feature representation for BBS texts was given, and the formula to evaluate the hotness of topics based on fuzzy partition was also given in consideration of the membership degree. Experimental results verify the efficiency of the proposed algorithm.
关 键 词:网络论坛(BBS) 热点话题挖掘 模糊聚类 帖子线索
分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.62