基于知网的新词语相似度算法研究被引量：11

New Word Similarity Algorithm Research Based on HowNet

出　　处：《情报科学》2015年第2期67-71,共5页Information Science

基　　金：国家自然科学基金资助项目(61003311);安徽省高校省级自然科学基金资助项目(KJ2011A040)

摘　　要：基于"知网"提出了一种新的词语相似度计算方法。在概念层次上,引入义原类相似度的概念及计算规则,结合词语概念中主要义原类限制次要义原类和变系数法对各义原类加权计算,求得概念相似度;在词语层次上,引入词性相似度的概念,取不同词性的最大值作为词语相似度。实验结果表明,与已有方法相比,该方法有效提高了词语相似度的精确度和计算效率。A new word similarity algorithm based on How Net semantic lexicon is proposed in the paper.In the conceptual level, this paper introduces the concept and calculation rules of sememe class similarityto do weighted calculation on each sememe class, combining the idea of the main sememe class limitingthe secondary sememe class and variable coefficient method,with this method, the conception similaritycan be achieved; in the word level, the concept of the similarity of the parts of speech is introduced, andthe maximum value is taken as word similarity. The experiment results show that the new method effective-ly improves the accuracy and computational efficiency of word similarity, compared with the existing meth-ods.

关键词：知网词语相似度义原类相似度词性相似度语义距离

分类号：G250.2[文化科学—图书馆学]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于知网的新词语相似度算法研究被引量：11

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于知网的新词语相似度算法研究 被引量：11

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于知网的新词语相似度算法研究被引量：11