基于知网与搜索引擎的词汇语义相似度计算被引量：6

Vocabulary Semantic Similarity Computation Based on How Net and Search Engine

出　　处：《计算机与现代化》2018年第4期90-94,共5页Computer and Modernization

摘　　要：提出一种基于知网与搜索引擎的词汇语义相似度计算方法。利用义原在层次体系树的深度、密度、信息量优化义原的相似性计算。将逐点共有信息(PMI)算法与归一化谷歌距离(NGD)算法结合优化基于搜索引擎的词汇语义相似度计算。将词汇的词性作为权重因子融合知网与搜索引擎的词汇相似度计算结果。实验结果表明,与基于知网和基于搜索引擎的语义相似度计算方法相比,所提出的方法在NLPCC测试集上的平均相似度更接近于测试集的评测标准,在汽车票务领域的词汇相似度计算中具有较好的应用效果。This paper proposes a method of computing lexical semantic similarity based on How Net and search engines. The similarity computation is optimized by using the depth,density and information of semantic primitive in the hierarchy tree. The search engine based lexical semantic similarity computation is optimized by combining the point by point common information(PMI) algorithm with the normalized Google distance(NGD) algorithm. The lexical part of speech is used as weighting factor to merge the word similarity computation between How Net and search engine. The experimental results show that,compared with the semantic similarity calculation method based on How Net and search engine,the average similarity of the proposed method on NLPCC test set is closer to the evaluation criteria of the test set,and lexical similarity in the car ticket calculation fields has a good application effect.

关键词：语义相似度知网义原搜索引擎

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于知网与搜索引擎的词汇语义相似度计算被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于知网与搜索引擎的词汇语义相似度计算 被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于知网与搜索引擎的词汇语义相似度计算被引量：6