检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:徐建民 王鑫 XU Jianmin;WANG Xin(College of Cyberspace Security and Computer,Hebei University,Baoding 071002,China)
机构地区:[1]河北大学网络空间安全与计算机学院,河北保定071002
出 处:《河北大学学报(自然科学版)》2021年第5期587-598,共12页Journal of Hebei University(Natural Science Edition)
基 金:国家社会科学基金后期资助项目(17FTQ002);河北省自然科学基金资助项目(F2015201142)。
摘 要:针对科技文档间相似程度和影响力不同的问题,通过分析科技文档间非对称关系,提出一种新的度量方法.该方法将科技文档间非对称关系定义为文档覆盖度,并用公式覆盖度和文本覆盖度对其进行度量.公式覆盖度由改进的非对称因子计算,文本覆盖度通过利用文本的相对突出性调整特征向量的余弦夹角计算,公式覆盖度和文本覆盖度线性融合得到科技文档覆盖度.实验结果表明:与已有的2种科技文档关系度量方法相比,本文提出的非对称关系度量方法在聚类中的平均准确率分别提高了8%和4%.Aiming at the problem of different degree of similarity and influence between scientific documents,a new measurement method is proposed by analyzing the asymmetric relationship between scientific documents.This method defines the asymmetric relationship between scientific documents as document coverage,and uses formula coverage and text coverage to measure it.The formula coverage is calculated by an improved asymmetric factor,and the text coverage is calculated by using the relative prominence of the text to adjust the cosine angle of the feature vector,The formula coverage and text coverage are linearly fused to obtain the coverage of scientific documents.The experimental results show that compared with the existing two measurement methods for relationship between scientific documents,the average accuracy of the asymmetric relationship measurement method proposed in this paper is improved by about 8 percentage points and 4 percentage points.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.119.110.206