检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:席耀一[1] 林琛[1] 李弼程[1] 周杰[1] 许旭阳[1]
出 处:《计算机应用》2011年第1期93-96,共4页journal of Computer Applications
基 金:国家863计划项目(2007AA01Z439)
摘 要:现有的话题追踪方法大多面向新闻数据,将其应用于论坛时效果不够理想。结合论坛的特点,提出一种基于语义相似度的论坛话题追踪方法。该方法首先通过构建话题和帖子的关键词表建立其文本表示模型,然后利用知网计算两个关键词表的语义相似度并以此作为帖子与话题的相关程度,最后根据相关程度实现论坛话题追踪。该方法较好地避免了向量空间模型的缺陷。实验表明,该方法能比较有效地解决面向论坛的话题追踪问题。To study the BBS topic tracking, the paper discovered that most of the traditional methods of topic tracking deal with news reports, and they are not suitable when they are applied to BBS. The paper utilized the characteristics of BBS and presented a topic tracking method for BBS data based on semantic similarity. This method firstly constructed keywords tables of topic and post as their representation models, and then computed the two tables' semantic similarity with the help of HowNet which is served as correlation degree between post and topic. Finally, this method used the correlation degree to realize BBS-oriented topic tracking. This method effectively avoids the disadvantage of Vector Space Model ( VSM). The experimental results show that this method can solve the problem of BBS-oriented topic tracking effectively.
关 键 词:话题追踪 论坛 关键词 语义相似度 向量空间模型
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.38