检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:周晓敏[1] 滕飞[1] 张艺[1] ZHOU Xiaomin;TENG Fei;ZHANG Yi(School of Computing and Artificial Intelligence,Southwest Jiaotong University,Chengdu Sichuan 611756,China)
机构地区:[1]西南交通大学计算机与人工智能学院,成都611756
出 处:《计算机应用》2023年第9期2721-2726,共6页journal of Computer Applications
基 金:四川省重点研发项目(2021YFG0136)。
摘 要:国际疾病分类(ICD)编码的频率分布呈现出长尾的情况,因此,对少样本编码进行多标签文本分类极具挑战性。针对少样本编码分类中训练数据不足的问题,提出了一种基于元网络的自动ICD编码模型(MNIC)。首先,将特征空间中的实例和语义空间中的特征拟合到同一个空间进行映射,并将频繁编码的特征表示映射到它的分类器权重上,从而通过元网络学习到元知识;然后将学习到的元知识从数据丰富的频繁编码转移到数据贫乏的少样本编码;最后,为元知识的可转移性和通用性提供了合理的解释。在MIMIC-Ⅲ数据集上的实验结果表明,与次优的AGM-HT(Adversarial Generative Model conditioned on code descriptions with Hierarchical Tree structure)模型相比,MNIC将少样本编码的Micro-F1与曲线下面积(Micro-AUC)分别提高了3.77和3.82个百分点,显著提高了少样本编码分类的性能。The frequency distribution of International Classification of Diseases(ICD)codes is long tail,resulting in it is challenging to perform multi-label text classification for few-shot code.An MNIC(Meta Network-based automatic ICD Coding model)was proposed to solve the problem of insufficient training data in few-shot code classification.Firstly,instances in the feature space and features in the semantic space were fitted to the same space for mapping,and the feature representations of many-shot codes were mapped to their classifier weights,thus learning meta-knowledge through metanetwork.Secondly,the learned meta-knowledge was transferred from data-abundant many-shot codes to data-poor few-shot codes.Finally,a reasonable explanation was provided for the transferability and generality of meta-knowledge.Experimental results on MIMIC-Ⅲdataset show that MNIC improves the Micro-F1 and Micro Area Under Curve(Micro-AUC)of few-shot codes by 3.77 and 3.82 percentage points respectively compared to the suboptimal AGM-HT(Adversarial Generative Model conditioned on code descriptions with Hierarchical Tree structure)model,indicating that the proposed model improves the performance of few-shot code classification significantly.
关 键 词:自动国际疾病分类编码 少样本学习 元学习 自然语言处理 可解释性
分 类 号:TP389.1[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7