检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:WANG Changjing ZENG Xianghui WANG Yuxin SUN Yuxin ZUO Zhengkang 王昌晶;曾翔辉;王钰鑫;孙钰昕;左正康(江西师范大学数字产业学院,江西上饶334000;江西师范大学计算机信息工程学院,江西南昌330022;江西师范大学高性能计算江西省重点实验室,江西南昌330022;江西师范大学网络化支撑软件国家级国际科技合作基地,江西南昌330022)
机构地区:[1]School of Digital Industry,Jiangxi Normal University,Shangrao 334000,Jiangxi,China [2]School of Computer Information Engineering,Jiangxi Normal University,Nanchang 330022,Jiangxi,China [3]Jiangxi Provincial Key Laboratory for High Performance Computing,Jiangxi Normal University,Nanchang 330022,Jiangxi,China [4]State International Science and Technology Cooperation Base of Networked Supporting Software,Jiangxi Normal University,Nanchang 330022,Jiangxi,China
出 处:《Wuhan University Journal of Natural Sciences》2025年第1期21-31,共11页武汉大学学报(自然科学英文版)
基 金:Supported by the National Nature Science Foundation of China(62462037,62462036);Project for Academic and Technical Leader in Major Disciplines in Jiangxi Province(20232BCJ22013);Jiangxi Provincial Natural Science Foundation(20242BAB26017,20232BAB202010);Jiangxi Province Graduate Innovation Fund Project(YC2023-S320)。
摘 要:Intent detection and slot filling are two important components of natural language understanding.Because their relevance,joint training is often performed to improve performance.Existing studies mostly use a joint model of multi-intent detection and slot-filling with unidirectional interaction,which improves the overall performance of the model by fusing the intent information in the slot-filling part.On this basis,in order to further improve the overall performance of the model by exploiting the correlation between the two,this paper proposes a joint multi-intent detection and slot-filling model based on a bidirectional interaction structure,which fuses the intent encoding information in the encoding part of slot filling and fuses the slot decoding information in the decoding part of intent detection.Experimental results on two public multi-intent joint training datasets,MixATIS and MixSNIPS,show that the bidirectional interaction structure proposed in this paper can effectively improve the performance of the joint model.In addition,in order to verify the generalization of the bidirectional interaction structure between intent and slot,a joint model for single-intent scenarios is proposed on the basis of the model in this paper.This model also achieves excellent performance on two public single-intent joint training datasets,CAIS and SNIPS.意图识别与槽位填充是自然语言理解的两个重要组成部分,由于两者的相关性,所以常进行联合训练提高性能。现有研究多是采用单向交互的多意图识别与槽位填充联合模型,通过在槽位填充部分融合意图信息提高了模型整体性能。为了进一步利用两者的相关性提高模型整体性能,本文提出了基于双向交互结构的多意图识别与槽位填充联合模型,在槽位填充编码部分融合了意图编码信息,在意图识别解码部分融合了槽位解码信息。在MixATIS和MixSNIPS两个公共多意图联合训练数据集上的实验结果表明,该双向交互结构能有效提高联合模型的性能。此外,为了验证意图与槽位双向交互结构的泛化性,在本文模型的基础上提出了针对单意图场景的联合模型,在CAIS和SNIPS两个公共单意图联合训练数据集上也取得了优异的性能。
关 键 词:natural language understanding multi-intent detection slot filling bidirectional interaction joint training
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.227.21.218