基于密度聚类算法改进的语义主路径分析方法研究  

Research on Improving the Semantic Main Path Analysis Method by Leveraging the Density Peak Clustering Algorithm

在线阅读下载全文

作  者:陈亮[1] 余池 尚玮姣[2] 许海云 吕世炅[1] 陈利利 Chen Liang;Yu Chi;Shang Weijiao;Xu Haiyun;Lyu Shijiong;Chen Lili(Institute of Scientific and Technical Information of China,Beijing 100038;Research Institute of Forestry Policy and Information,Chinese Academy of Forestry,Beijing 100091;Business School,Shandong University of Technology,Zibo 255000)

机构地区:[1]中国科学技术信息研究所,北京100038 [2]中国林业科学研究院林业科技信息研究所,北京100091 [3]山东理工大学管理学院,淄博255000

出  处:《情报学报》2024年第3期287-301,共15页Journal of the China Society for Scientific and Technical Information

基  金:中央级公益性科研院所基本科研业务项目“开源科技情报智能分析系统与应用场景建设”(ZD2023-13);国家电网公司总部科技项目“全球煤油气电耦合下我国能源安全风险识别与战略路径优化技术研究”(1400-202357341A-1-1-ZN)。

摘  要:语义主路径分析方法在改进传统主路径分析法中主路径内容单一、主题一致性较差等不足的同时,留下了两个缺陷,即所选主路径在语义空间的位置可能偏离主题聚簇中心、不同主路径的主题区分度并不明显。本文在语义主路径分析方法的基础上,提出一种逐步优化的主路径选择方法,即将主题聚簇密度和路径遍历权重进行叠加形成复合密度,通过调节复合密度中两个要素的比重来优化主题聚簇中心的定位,当聚簇中心的位置变化收敛后,将位于不同主题聚簇中心的路径作为结果输出。将本文方法分别用于电动汽车锂离子电池专利引文网络和材料科学领域高影响力论文引文网络,实验结果显示,本文方法所产生的多条主路径不仅在主题聚簇中的布局更加合理,而且选取不当主路径的可能性也大大降低,从而验证了本文方法的有效性。The semantic main path analysis(sMPA)method overcomes the shortcomings of the traditional main path analysis(MPA)method,such as a single main path and low theme consistency.However,it also leaves two defects:the position of the selected main path in the semantic space may deviate from the cluster center,and the topic discrimination of different main paths is not obvious.To address this problem,this study proposes a gradually optimized main path selection method in which topic cluster density and path traversal weight are superimposed to form a composite density,and the location of the topic cluster center is optimized by adjusting the proportion of the two elements in the composite density.When the cluster center converges,the paths located in different topic cluster centers are outputted.This method is verified by applying it to the patent citation network of lithium-ion batteries for electric vehicles and the citation network of highimpact papers in the field of materials science.The experimental results show that not only is the layout of multiple main paths generated by the new method but the possibility of selecting improper main paths is also significantly reduced.

关 键 词:语义主路径分析 主题一致性 主题聚类 材料科学 电动汽车锂离子电池 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象