检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]河北大学数学与计算机学院,河北保定071002 [2]河北大学网络中心,河北保定071002
出 处:《小型微型计算机系统》2013年第3期471-474,共4页Journal of Chinese Computer Systems
基 金:国家自然科学基金项目(60873203)资助
摘 要:主流的热点追踪算法都采用文本聚类技术来实现,在处理海量网页时,很难精准聚合中心结果,离需要的热点差距太远.现有的网络舆情系统监控的范围受限于使用者给出的关键词,使系统无法监测使用者未知的突发事件.针对网络舆情发生和传播特点,改善舆情信息采集策略;网络舆情的相关页面标题文字主题鲜明,据此提出自动挖掘热点关键词并根据关键词进行话题聚类的方法;根据新闻、论坛和博客的不同特点分别设计网络舆情热点分析模型.在此基础上,设计并实现了一个网络舆情监测系统.系统实际运行表明,该方案可以及时发掘热点话题并对突发事件实时追踪监测.The main algorithms for hotspot tracking adopt the Text Clustering technology.When dealing with mass web pages,it is difficult to cluster the expected hotspot.Clustering causes huge central bias.The w orking range of current Internet public opinion monitoring and w arning system is limited by the keyw ords given by the user,thus causing the system not to detect those unexpected events.Based on the characteristics of Internet public opinion occurrence and its spreading,the information acquisition strategy is improved;according to the distinct themes of the titles of the related Internet public opinion's w eb texts,to pursue hotspot keyw ords automatically and to conduct topic clustering based on keyw ord is proposed;based on the different features of new s,forums and blogs,hotspot analysis models of Internet public opinion are designed respectively;On this basis,an internet public opinion monitoring system is designed and implemented.The running tests show that this scheme is capable of finding hot topics timely and tracking realtime emergent events.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.133.94.34