机构地区:[1]武汉大学国家网络安全学院,武汉430072 [2]广东外语外贸大学高级翻译学院,广州510420
出 处:《计算机学报》2022年第8期1746-1764,共19页Chinese Journal of Computers
基 金:国家重点研发计划项目(2017YFC1200500);国家自然科学基金项目(61702121,61772378);广州市科技计划项目(202102020607)资助.
摘 要:语义角色标注(Semantic Role Labeling,SRL)旨在识别给定句子中所包含的谓词及对应的语义论元,从而为信息抽取、自动问答和阅读理解等任务的语义理解提供帮助.构建句法特征作为实现语义角色标注任务的关键步骤,在很大程度上影响着任务的性能.针对现有的神经网络模型未能有效构建句法特征,例如现有研究采取离线式的人工定式句法裁剪方案,不可避免地造成关键句法信息丢失或者裁剪效果减弱等问题,本文提出基于动态句法剪枝机制的端到端神经网络模型,并将其用于中文语义角色标注任务.具体地,我们提出两种创新的动态句法剪枝机制:基于递归神经网络模型的动态句法剪枝机制(Recur-DSP)和基于带句法标签的图卷积网络模型的句法剪枝机制(SGCN-DSP).Recur-DSP采用递归神经网络模型进行句法结构编码与融合,并对句法树的每一个连接处通过Gumbel-Softmax函数离散化实现动态句法裁剪.SGCN-DSP采用图卷积神经网络模型为句法依存树的依存弧结构以及对应的标签进行统一建模,并提出对应的动态句法裁剪机制.在基准数据集上的实验结果显示所提方法超过当前的最好模型,获得当前中文语义角色标注的最优性能.通过整合预训练语言模型BERT,基于CoNLL09数据集,提出的模型SGCN-DSP在角色论元识别上获得了90.4%的F1值,在谓词识别上获得90.8%的F1值.Semantic role labeling(SRL),as the shallow semantic parsing task,which has received extensive research attention in recent years and plays a core role in the natural language processing(NLP)community.The SRL task aims to identify the corresponding argument roles for the predicates of a given sentence,which can facilitate the downstream NLP tasks,such as information extraction,question answer system and reading comprehension,etc.A great number of methods have been proposed for the task,and the existing studies can be divided into two main categories:machine learning based methods with hand-crafted discrete features and deep learning methods with automatic distributed features.The early studies largely separate SRL into two individual subtasks,i.e.,predicate disambiguation and argument role labeling.More recently,great efforts have been paid for constructing various end-to-end SRL architectures,solving two pipeline steps in one shot via one unified model.Recent studies also show that integrating external syntactic features,such as syntactic dependency trees,are important for the SRL task highly.So designing a novel neural model,which can capture syntactic features effectively,has become a heated research topic.Recently,He et al.(2018)find that only a part of syntactic structure information can offer valuable information for the SRL task,which calls for pruning the syntactic structure features.However,the existing work adopts the offline syntactic pruning strategy,which can inevitably lead to either the loss of key syntactic information or the weakening of pruning effectiveness.Extracting syntactic features,as an important step of the SRL task,will largely affect the final performance of the task.However,the existing neural network methods fail to effectively model syntactic features.For example,the existing studies adopt the offline syntactic pruning strategy with fixed human labor,which inevitably leads to the loss of key syntactic information or the weakening of pruning effectiveness.To address the above issues,w
关 键 词:自然语言处理 语义角色标注 句法剪枝 神经网络 深度学习
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...