检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:吕海啸 李益红[1] 周晓谊[1] LYU Haixiao;LI Yihong;ZHOU Xiaoyi(School of Cyberspace Security,Hainan University,Haikou 570228,China)
出 处:《计算机科学与探索》2024年第8期2180-2189,共10页Journal of Frontiers of Computer Science and Technology
基 金:海南省重点计划项目(ZDYF2022GXJS224);国家自然科学基金(62163010,62162021,62362025)。
摘 要:少样本命名实体识别通常使用基于相似性的度量,为了能够充分利用模型参数中的知识转移,提出一种前缀调优的少样本命名实体识别方法(P-NER)。将输入文本的特征向量放入嵌入模块进行特征提取;把前缀提示的向量参数拼接到编码层模型的前端,并将编码层模型参数进行固定;对编码层得到的结果进行交叉熵模型的解码,并对每个训练样本采样两个子模型,通过最小化两个子模型之间相对熵的方式达到对模型预测进行正则化的目的;通过验证输出概率和真实标签概率来衡量模型对每个词的标签预测与实际标签的一致程度并输出分类结果。实验结果表明在CoNLL2003数据集上,该方法的域内少样本实体识别的平均F1得分为84.92%,在跨领域少样本实体识别的MIT Movie、MIT Restaurant和ATIS三个数据集中均领先其他基线方法的结果。因此,该方法可在只需要调节以往微调方法的2.9%参数的情况下,显著提高少样本命名实体识别的效果。The commonly adopted approach for few-shot named entity recognition(NER)typically involves the use of similarity-based metrics.In order to fully leverage knowledge transfer within the model parameters,this paper proposes a prefix-tuning method for few-shot NER(P-NER).This involves placing the input text’s feature vectors into an embedding module for feature extraction.The vector parameters of prefix prompts are concatenated to the front end of the encoding layer model,with the encoding layer model parameters being fixed.The results obtained from the encoding layer are decoded using a cross-entropy model.For each training sample,two sub-models are sampled,and regularization of the model prediction is achieved by minimizing the relative entropy between the two sub-models.The model’s consistency with actual labels is assessed by validating the output probability and the probability of true labels for each word,ultimately yielding the classification results.Experimental results demonstrate that on the CoNLL2003 dataset,this method achieves an average F1 score of 84.92%for in-domain few-shot entity recognition.In the cross-domain few-shot entity recognition tasks,it outperforms other baseline methods on three datasets:MIT Movie,MIT Restaurant and ATIS.Thus,this method significantly enhances the effectiveness of few-shot named entity recognition with a mere 2.9%adjustment to the parameters of previous fine-tuning methods.
关 键 词:命名实体识别(NER) 少样本学习 提示学习
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7