基于新词发现和Lattice-LSTM的中文医疗命名实体识别被引量：8

CHINESE MEDICAL NAMED ENTITY RECOGNITION BASED ON NEW WORD DISCOVERY AND LATTICE-LSTM

作　　者：赵耀全车超[1] 张强[1] Zhao Yaoquan;Che Chao;Zhang Qiang(National and Local Joint Engineering Laboratory of Computer Aided Design,Dalian University,Dalian 116622,Liaoning,China)

机构地区：[1]大连大学计算机辅助设计国家地方联合工程实验室,辽宁大连116622

出　　处：《计算机应用与软件》2021年第1期161-165,249,共6页Computer Applications and Software

基　　金：国家自然科学基金项目(61751203);大连市科技创新基金项目(2018J12GX036);大连市高层次人才创新支持计划项目(2017RD11)。

摘　　要：在医疗命名实体识别中,由于存在大量医学专业术语和语料中语言不规范的原因,识别的准确率不高。为了识别未登录的医学术语和应对语言不规范问题,提出一种基于N-grams新词发现的Lattice-LSTM的多粒度命名实体识别模型。在医疗对话语料中使用N-grams算法提取新词并构造一个医疗相关的词典,通过Lattice-LSTM模型将输入的字符和所有能在词典匹配的单词一起编码,其中门结构能够使模型选择最相关的字符和单词。Lattice-LSTM能够利用发现的新词信息识别未登录的医学术语,从而得到更好的实验识别结果。In medical named entity recognition,the accuracy of recognition is not high because there are a large number of medical terms and non-standard language in corpus.In order to identify unregistered medical terms and deal with the problem of non-standard language,we propose a Lattice-LSTM multi-granularity named entity recognition model based on N-grams new words discovery.The N-grams algorithm was used to extract new words from medical conversation corpus and construct a medical-related dictionary.Lattice-LSTM model was used to encode the input characters together with all the words matched in the dictionary.The gate structure enabled the model to select the most relevant characters and words.Lattice-LSTM can use the information of new words to identify unregistered medical terms,so as to get better experimental recognition results.

关键词：医疗命名实体识别 N-GRAMS 新词发现 Lattice-LSTM

分类号：TP3[自动化与计算机技术—计算机科学与技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于新词发现和Lattice-LSTM的中文医疗命名实体识别被引量：8

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于新词发现和Lattice-LSTM的中文医疗命名实体识别 被引量：8

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于新词发现和Lattice-LSTM的中文医疗命名实体识别被引量：8