科技文档间非对称关系的双模态度量方法  被引量:1

A double mode measurement method of asymmetric relationship between scientific documents

在线阅读下载全文

作  者:徐建民 王鑫 XU Jianmin;WANG Xin(College of Cyberspace Security and Computer,Hebei University,Baoding 071002,China)

机构地区:[1]河北大学网络空间安全与计算机学院,河北保定071002

出  处:《河北大学学报(自然科学版)》2021年第5期587-598,共12页Journal of Hebei University(Natural Science Edition)

基  金:国家社会科学基金后期资助项目(17FTQ002);河北省自然科学基金资助项目(F2015201142)。

摘  要:针对科技文档间相似程度和影响力不同的问题,通过分析科技文档间非对称关系,提出一种新的度量方法.该方法将科技文档间非对称关系定义为文档覆盖度,并用公式覆盖度和文本覆盖度对其进行度量.公式覆盖度由改进的非对称因子计算,文本覆盖度通过利用文本的相对突出性调整特征向量的余弦夹角计算,公式覆盖度和文本覆盖度线性融合得到科技文档覆盖度.实验结果表明:与已有的2种科技文档关系度量方法相比,本文提出的非对称关系度量方法在聚类中的平均准确率分别提高了8%和4%.Aiming at the problem of different degree of similarity and influence between scientific documents,a new measurement method is proposed by analyzing the asymmetric relationship between scientific documents.This method defines the asymmetric relationship between scientific documents as document coverage,and uses formula coverage and text coverage to measure it.The formula coverage is calculated by an improved asymmetric factor,and the text coverage is calculated by using the relative prominence of the text to adjust the cosine angle of the feature vector,The formula coverage and text coverage are linearly fused to obtain the coverage of scientific documents.The experimental results show that compared with the existing two measurement methods for relationship between scientific documents,the average accuracy of the asymmetric relationship measurement method proposed in this paper is improved by about 8 percentage points and 4 percentage points.

关 键 词:科技文档 非对称性 覆盖度 关系度量 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象