基于领域知识图谱增强和Lattice-LSTM的中医药命名实体识别  

TRADITIONAL CHINESE MEDICINE NAMED ENTITY RECOGNITION BASED ON DOMAIN KNOWLEDGE GRAPH ENHANCEMENT AND LATTICE-LSTM

在线阅读下载全文

作  者:牛天星 郑小盈[2,3] 祝永新 汪辉 Niu Tianxing;Zheng Xiaoying;Zhu Yongxin;Wang Hui(School of Information Science and Technology,Shanghai Tech University,Shanghai 201210,China;Shanghai Advanced Research Institute,Chinese Academy of Science,Shanghai 201210,China;University of Chinese Academy of Sciences,Beijing 101408,China)

机构地区:[1]上海科技大学信息科学与技术学院,上海201210 [2]中国科学院上海高等研究院,上海201210 [3]中国科学院大学,北京101408

出  处:《计算机应用与软件》2025年第3期127-134,共8页Computer Applications and Software

基  金:国家重点研发计划项目(2020SKA0120202);国家自然科学基金项目(U2032125)。

摘  要:针对中医药领域命名实体识别任务中,现有的通过构造词典对实体识别模型进行增强的方法中存在的专业术语发现困难、构造词典效率低下和识别准确率不足等问题,提出一种基于领域知识图谱增强和Lattice-LSTM的领域命名实体识别模型。通过对已经构建完成的领域图谱使用嵌入算法,将其快速高效地转化为领域词典,并使用融合多粒度词汇信息的Lattice-LSTM将词典中的专业词汇编码到模型的输入中去,从而提高了模型在领域实体识别任务上的效果。采用中医药数据集进行实验,结果表明,所提模型的F1值高于传统实体识别模型,验证了模型的有效性。Aiming at the problems in existing methods for enhancing entity recognition models through lexicon construction in Traditional Chinese Medicine(TCM)named entity recognition tasks—including difficulties in discovering domain-specific terms,inefficient dictionary construction,and insufficient recognition accuracy—this study proposes a domain-specific named entity recognition model based on domain knowledge graph enhancement and Lattice-LSTM.By applying embedding algorithms to a pre-constructed domain knowledge graph,we efficiently convert it into a domain-specific lexicon and incorporate multi-granularity lexical information through Lattice-LSTM to encode professional vocabulary in the dictionary into the model's input,thereby improving the model's effectiveness in domain-specific entity recognition tasks.Experiments on TCM datasets show that the F1-score of the proposed model is higher than that of traditional entity recognition models,verifying the model's validity.

关 键 词:领域知识图谱 中医药 命名实体识别 知识图谱嵌入 

分 类 号:TP3[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象