基于模糊聚类的网络论坛热点话题挖掘  被引量:20

BBS hot topic mining algorithm based on fuzzy clustering

在线阅读下载全文

作  者:鲁明羽[1] 姚晓娜[1] 魏善岭[1] 

机构地区:[1]大连海事大学信息科学技术学院,辽宁大连116026

出  处:《大连海事大学学报》2008年第4期52-54,58,共4页Journal of Dalian Maritime University

基  金:国家自然科学基金资助项目(60473135;60773084;J0724003;60603023);教育部博士点基金资助项目(20070151009)

摘  要:为解决单个帖子线索的多话题性问题,识别聚类中的孤立点,提出一种基于模糊聚类的网络论坛(BBS)热点话题挖掘算法.采用模糊聚类进行话题识别,使得一个帖子线索可以隶属于多个话题,而对于隶属度远小于类内平均隶属度的帖子线索,则当作孤立点来处理.此外,还给出了一种面向BBS文本的特征表示方法,并结合隶属度给出基于模糊划分的话题热度评分公式.实验结果验证了该算法的有效性.A bulletin board system(BBS) hot topic mining algorithm based on fuzzy clustering was developed to solve the problem of the post thread with multiple topics and identifying the outliers in clustering. Fuzzy clustering was used to make one post thread belonging to many topics and the post thread whose membership degree being far less than the in-class average membership degree was treated as the outlier. Moreover, a kind of feature representation for BBS texts was given, and the formula to evaluate the hotness of topics based on fuzzy partition was also given in consideration of the membership degree. Experimental results verify the efficiency of the proposed algorithm.

关 键 词:网络论坛(BBS) 热点话题挖掘 模糊聚类 帖子线索 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象