检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王卓越 陈彦光 邢铁军[2] 孙媛媛[1] 杨亮[1] 林鸿飞[1] WANG Zhuoyue;CHEN Yanguang;XING Tiejun;SUN Yuanyuan;YANG Liang;LIN Hongfei(College of Computer Science and Technology,Dalian University of Technology,Dalian,Liaoning 116024,China;Neusoft Corporation,Shenyang 110179,China)
机构地区:[1]大连理工大学计算机科学与技术学院,辽宁大连116024 [2]东软集团股份有限公司,沈阳110179
出 处:《计算机工程与应用》2023年第2期178-184,共7页Computer Engineering and Applications
基 金:国家重点研发计划项目(2018YFC0830603)。
摘 要:面向法律文本的实体关系联合抽取技术对于案情关键信息的智能提取至关重要,是智慧司法领域应用中的重要环节。目前的联合抽取方法虽然已经在特定罪名案件的数据集上取得了较好的效果,但是由于模型在训练时只关注了特定罪名类型文本数据的特点,使得模型的泛化能力有限,在应用到多罪名案件的情况下常常使得模型的效果下降。因此引入多任务学习的方法对多罪名情形下的实体关系联合抽取进行了研究,以涉毒类案件和盗窃类案件两大类罪名的文书数据为基础,构建了一个罪名分类任务作为联合抽取的辅助任务,通过基于特征筛选的动态加权多任务模型同时对两个任务进行学习,在单任务模型的基础上整体F1值提升了2.4个百分点,在涉毒类案件和盗窃类案件上的F1值分别提升了1.6和3.2个百分点。Joint entity recognition and relation extraction on legal documents is important for automatic extraction of the crucial information of the legal cases.And it is a crucial part for legal intelligence application.The current triplet extraction methods have achieved good results on specific crime cases,while since these models only pay attention to the text features of specific crime type during training,the generalization ability of the model is limited,which usually leads to a decrease in the performance when applying to multi-crime legal documents.Therefore,it leverages the multi-task learning method for triplet extraction on multi-crime legal documents.The experiments are based on two categories of crimes involving drug-related cases and larceny-related cases.It constructs a crime classification task as auxiliary task and trains the two tasks simultaneously by the dynamic weight with feature filtering multi-task model.From the experimental results,compared with the single-task model,this model improves the F1 value by 2.4 percentage points on the whole,by 1.6 and 3.2 percentage points on drug-related cases and larceny-related cases respectively.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.175