基于框架表示学习的汉语框架排歧  被引量:2

Chinese frame disambiguation based on frame representation learning

在线阅读下载全文

作  者:侯运瑶 曹学飞 崔军[1] 王瑞波 李济洪 李茹[1,4,5] Hou Yunyao;Cao Xuefei;Cui Jun;Wang Ruibo;Li Jihong;Li Ru(School of Computer&Information Technology,Shanxi University,Taiyuan 030006,China;School of software,Shanxi University,Taiyuan 030006,China;School of Modern Educational Technology,Shanxi University,Taiyuan 030006,China;Key Laboratory of Computer Intelligence&Chinese Information Processing of Ministry of Education,Shanxi University,Taiyuan 030006,China;Collaborative Innovation Center of Big Data Mining&Intelligent Technology in Shanxi,Taiyuan 030006,China)

机构地区:[1]山西大学计算机与信息技术学院,太原030006 [2]山西大学软件学院,太原030006 [3]山西大学现代教育技术学院,太原030006 [4]山西大学计算智能与中文信息处理教育部重点实验室,太原030006 [5]山西省大数据挖掘与智能技术协同创新中心,太原030006

出  处:《计算机应用研究》2020年第12期3640-3644,共5页Application Research of Computers

基  金:国家自然科学基金青年基金项目(61806115);国家自然科学基金项目(61772324)。

摘  要:为了改善框架排歧模型的性能,区别于传统分类算法人工提取特征的做法,直接从语料中的例句出发,使用神经网络模型给出了一种框架表示学习的算法,并将学习到的框架表示向量用于框架排歧任务,显著提升了框架排歧的性能。该算法充分利用CFN中例句库、词元库,基于hinge-loss的神经网络,学习到能最大区别正确框架与错误框架的框架表示向量。此外,还使用WSABIE算法学习到目标词及其上下文的表示向量,排歧时以上下文表示向量与框架表示向量做余弦夹角来判决。在CFN中88个有歧义的词元上进行3组2折交叉验证(3×2 BCV)实验,框架排歧精度最好达到72.52%,t-检验结果表明该方法性能显著高于其他框架排歧方法。In order to improve the performance of frame disambiguation model,this paper used a neural network model to learn frame representation based on sentences in corpus different from the traditional classification algorithm extracting features manually,and employed the learned frame representation on the frame disambiguation task,which significantly improved the performance of the task.Making full use of the CFN example sentence database and being based on the hinge-loss neural network,the algorithm learnt the frame representation that could distinguish the correct frame from the error frame in the largest degree.This paper also used the WSABIE algorithm to learn the representation vector of the context of the target word,and finally used the cosine distance between the context representation vector and the frame representation vector to make a decision for the task.Experiment performed three sets of two-fold cross-validation(3×2 BCV)on 88 ambiguous words in CFN,and the best accuracy of frame disambiguation reach to 72.52%.The t-test results show that the performance of the proposed method is significantly higher than other frame disambiguation methods.

关 键 词:框架排歧 框架表示 表示学习 汉语框架语义知识库 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象