检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]中央民族大学信息工程学院,北京100081 [2]国家语言资源监测与研究中心少数民族语言分中心,北京100081
出 处:《首都师范大学学报(自然科学版)》2016年第3期7-10,共4页Journal of Capital Normal University:Natural Science Edition
摘 要:本文提出了一种利用知网的实例库与知网关系进行词义消歧算法.该方法首先利用知网提供的实例库进行初步的匹配;若在实例库中没有完全匹配,则利用上下文搭配关键词与知网中的实例搭配词进行相似度计算,若相似度大于给定阈值,则消歧结束.否则,我们再判断歧义词的义原与关键词的义原是否具有某种关系,根据义原权值调节算法调整义原权值.调整后的义原权值大小不一,按照事先的约定,我们选取综合权值最大的义项.我们发现,该方法能够弥补仅依靠实例库的覆盖率低的问题,又能减少仅依靠统计方法产生的噪音,从而提高词义消歧的正确率.This paper proposes a WSD algorithm based on How Net. The method firstly uses case base that How Net provides for preliminary matching; if the case library do not exactly match,then we compute the similarity between the context matching keywords and How Net examples collocations. If the similarity is greater than a given threshold,then the end of disambiguation. Otherwise,we judge that if there has some relations between sememes of keywords and sememes of case library and adjust the original weight value according to the original weight adjustment algorithm. After the adjustment,the weights of sememes are not the same size. In accordance with the prior agreement,we selected the largest comprehensive weight items. We find that this method can make up for the problem that only depend on the low coverage of the instance library and can also reduce the noise generated by the statistical method,thus improving the correct rate of word sense disambiguation.
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15