基于词分布表征的汉语框架排歧研究被引量：4

Chinese Frame Disambiguation Base on Word Distributed Representations

机构地区：[1]山西大学计算机与信息技术学院,山西太原030006 [2]太原工业学院计算机工程系,山西太原030008 [3]山西大学软件学院,山西太原030006

出　　处：《中北大学学报（自然科学版）》2015年第3期328-332,337,共6页Journal of North University of China(Natural Science Edition)

摘　　要：框架排歧目的在于根据句子中目标词的上下文环境,从现有的框架库中为该目标词自动标注一个合适的框架.将框架排歧任务看作分类问题,首次将词的低维分布表征信息作为模型特征引入到汉语框架排歧研究中,来探讨仅从词特征出发,不同的特征表示对框架排歧模型的影响.实验选取了88个词元中2 077条例句为数据集,并将目标词周围的词分布表征信息加入到最大熵算法中进行建模.实验结果表明,使用词分布表征信息的框架排歧模型可以达到58.11%的精度,该结果与传统的仅使用词特征时(47.47%)的结果相比有大幅度提高.这说明词分布表征对汉语框架排歧任务是有重要作用的.The purpose of frame disambiguation is to select a proper frame from all frames in CFN for a target word of a Chinese sentence,based on the context of the target word.Frame disambiguation is regarded as a classification task between frames,and we firstly introduce word low dimension distributed representations as features to investigate the influence of different feature representations on frame disambiguation model only proceed from the word feature.We selected 2 077 annotated sentences from 88 lexical units as our dataset,and introduced the distributed representations of words around the target word into maximum entropy algorithm for the model building.Experimental results show that the accuracy of our proposed frame disambiguation model reaches 58.11%.Compared with the result（47.47%）that only use word features,this result get increased significantly,and it shows that word distributed representations is so important to frame disambiguation.

关键词：框架排歧最大熵模型词分布表征汉语框架语义知识库

分类号：TP391.1[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于词分布表征的汉语框架排歧研究被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于词分布表征的汉语框架排歧研究 被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于词分布表征的汉语框架排歧研究被引量：4