基于动态句法剪枝机制的中文语义角色标注被引量：2

Chinese Semantic Role Labeling Based on Dynamic Syntax Pruning

作　　者：费豪姬东鸿[1] 任亚峰 FEI Hao;JI Dong-Hong;REN Ya-Feng(School of Cyber Science and Engineering,Wuhan University,Wuhan 430072;School of Interpreting and Translation Studies,Guangdong University of Foreign Studies,Guangzhou 510420)

机构地区：[1]武汉大学国家网络安全学院,武汉430072 [2]广东外语外贸大学高级翻译学院,广州510420

出　　处：《计算机学报》2022年第8期1746-1764,共19页Chinese Journal of Computers

基　　金：国家重点研发计划项目(2017YFC1200500);国家自然科学基金项目(61702121,61772378);广州市科技计划项目(202102020607)资助.

摘　　要：语义角色标注(Semantic Role Labeling,SRL)旨在识别给定句子中所包含的谓词及对应的语义论元,从而为信息抽取、自动问答和阅读理解等任务的语义理解提供帮助.构建句法特征作为实现语义角色标注任务的关键步骤,在很大程度上影响着任务的性能.针对现有的神经网络模型未能有效构建句法特征,例如现有研究采取离线式的人工定式句法裁剪方案,不可避免地造成关键句法信息丢失或者裁剪效果减弱等问题,本文提出基于动态句法剪枝机制的端到端神经网络模型,并将其用于中文语义角色标注任务.具体地,我们提出两种创新的动态句法剪枝机制:基于递归神经网络模型的动态句法剪枝机制(Recur-DSP)和基于带句法标签的图卷积网络模型的句法剪枝机制(SGCN-DSP).Recur-DSP采用递归神经网络模型进行句法结构编码与融合,并对句法树的每一个连接处通过Gumbel-Softmax函数离散化实现动态句法裁剪.SGCN-DSP采用图卷积神经网络模型为句法依存树的依存弧结构以及对应的标签进行统一建模,并提出对应的动态句法裁剪机制.在基准数据集上的实验结果显示所提方法超过当前的最好模型,获得当前中文语义角色标注的最优性能.通过整合预训练语言模型BERT,基于CoNLL09数据集,提出的模型SGCN-DSP在角色论元识别上获得了90.4%的F1值,在谓词识别上获得90.8%的F1值.Semantic role labeling(SRL),as the shallow semantic parsing task,which has received extensive research attention in recent years and plays a core role in the natural language processing(NLP)community.The SRL task aims to identify the corresponding argument roles for the predicates of a given sentence,which can facilitate the downstream NLP tasks,such as information extraction,question answer system and reading comprehension,etc.A great number of methods have been proposed for the task,and the existing studies can be divided into two main categories:machine learning based methods with hand-crafted discrete features and deep learning methods with automatic distributed features.The early studies largely separate SRL into two individual subtasks,i.e.,predicate disambiguation and argument role labeling.More recently,great efforts have been paid for constructing various end-to-end SRL architectures,solving two pipeline steps in one shot via one unified model.Recent studies also show that integrating external syntactic features,such as syntactic dependency trees,are important for the SRL task highly.So designing a novel neural model,which can capture syntactic features effectively,has become a heated research topic.Recently,He et al.(2018)find that only a part of syntactic structure information can offer valuable information for the SRL task,which calls for pruning the syntactic structure features.However,the existing work adopts the offline syntactic pruning strategy,which can inevitably lead to either the loss of key syntactic information or the weakening of pruning effectiveness.Extracting syntactic features,as an important step of the SRL task,will largely affect the final performance of the task.However,the existing neural network methods fail to effectively model syntactic features.For example,the existing studies adopt the offline syntactic pruning strategy with fixed human labor,which inevitably leads to the loss of key syntactic information or the weakening of pruning effectiveness.To address the above issues,w

关键词：自然语言处理语义角色标注句法剪枝神经网络深度学习

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于动态句法剪枝机制的中文语义角色标注被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于动态句法剪枝机制的中文语义角色标注 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于动态句法剪枝机制的中文语义角色标注被引量：2