检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:吴云 杨长春[1] 梅佳俊 顾寰 WU Yun;YANG Chang-chun;MEI Jia-jun;GU Huan(School of Information Science and Engineering,Changzhou University,Changzhou 213164,China)
出 处:《计算机工程与设计》2018年第9期2776-2779,2810,共5页Computer Engineering and Design
基 金:赛尔网络下一代互联网技术创新基金项目(NGII20160703)
摘 要:为提高自动文摘的质量,提出一种词句协同的自动摘要提取算法(F-CoRank)。在传统词频的基础上,提高与标题相似的特征词的词频,得出提高后的词频矩阵和句子之间的相似度后,构建无向网络图,根据词句协同算法,得到各个节点的权重,对得到的粗文摘进行冗余处理,根据相应的需求,选择权重较高的前几个句子作为摘要。在哈工大的单文本文档语料上进行实验,实验结果表明,提高词频权重在一定程度上改进了文摘的质量,相比词句协同算法(Co-Rank)在覆盖率上有了较大提高。To improve the quality of automatic summarization,a word-sentence co-ranking based on the word frequency(F-CoRank)was proposed.Based on the traditional word frequency,the word frequency of the characteristic word which was similar to the title was improved,and after obtaining the improved word frequency matrix and the similarity between the sentences,the undirected network graph was constructed,and the weights of nodes were obtained according to the word-sentence co-ranking algorithm.The rough abstracts were redundantly processed,and the first few sentences with higher weights were selected according to the corresponding requirements.Experimental results on the HIT’s single document show that improving the frequency of word frequency exactly improves the quality of the abstract to a certain degree,and compared with the word-sentence co-ranking(Co-Rank),it improves the coverage rate.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.129.92.14