检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:滕婕 李硕 刘莉[1,2] 胡广伟 TENG Jie;LI Shuo;LIU Li;HU Guangwei(School of Information Management,Nanjing University,Nanjing 210023,China;Government Data Resources Institution of Nanjing University,Nanjing 210023,China)
机构地区:[1]南京大学信息管理学院,江苏南京210023 [2]南京大学政务数据资源研究所,江苏南京210023
出 处:《情报科学》2024年第9期178-191,共14页Information Science
基 金:国家社会科学基金重大项目“大数据驱动的城乡社区服务体系精准化构建研究”(20&ZD154);营销服务渠道效能及渠道协同效能评价体系研究(SGJSYF00YHJS2000144)
摘 要:【目的/意义】针对现有研究中关键词筛选的指标维度较少,且文献主题的演化和排序方法存在单一性问题,本文提出一种基于时间序列聚类和天际线算法的高价值热点主题挖掘方法。【方法/过程】首先,通过RFM模型对关键词进行价值分层,获取具有高价值层次的关键词。基于构建的语义关系网,利用社区发现算法获取初始文献研究主题。接着,对初始主题簇进行二次近邻传播聚类,以揭示不同主题间存在的具有相似发展特征的演化规律;同时,提取主题相对重要性的表征指标,借助天际线算法和主成分分析法实现主题的科学排序。最后,围绕“城乡社区供需服务”主题,检索1998—2022年相关的知网文献,采用本方法开展文本挖掘工作。【结果/结论】本文提出的新方法综合考虑了关键词的时间维度和价值属性,给出了一种综合主题识别、演化和排序的较为系统的主题挖掘方法。通过实验结果的对比分析发现,本方法能够有效地识别高价值热点主题,多维度全面地评估主题热度。【创新/局限】在衡量关键词价值和计算研究主题热度的方法主要针对评价标准和测量指标的优化,此方法未涵盖所有可能影响这些指标的元素。【Purpose/significance】In response to the existing research on literature topic mining,which has fewer index dimensions for keyword screening and the problem of uniqueness in the topic evolution and ranking methods of literature,this paper proposes a highvalue hot topic mining method based on time series clustering and skyline algorithm.【Method/process】Firstly,the paper stratifies the value of keywords through RFM model to obtain keywords with high value levels.Next,the initial thematic clusters are re-clustered to reveal the evolutionary phenomenon of similar developmental characteristics existing among different themes.At the same time,we extract the characterization indexes of the relative importance of topics,and realize the scientific ranking of topics with the help of skyline algorithm and principal component analysis.Finally,the paper processed and mined the data of journal literature related to"urban and rural community supply and demand services"from 1998 to 2022 on China Knowledge Network.【Result/Conclusion】The new method proposed in this paper integrates the temporal dimension and value attributes of keywords,and gives a more systematic topic mining method that integrates topic identification,evolution and ranking.The comparative analysis of the results shows that this method can accurately and quickly identify high-value hot topics and comprehensively evaluate the topic hotness in multiple dimensions.【Innovation/limitation】This research′s calculation methods for measuring keyword value and topic popularity mainly focus on the optimization of evaluation criteria and measurement indicators,and do not cover all possible elements that might affect these indicators.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49