电子病历命名实体识别技术研究综述  被引量:9

Review of Research on Named Entity Recognition Technologies for Electronic Medical Records

在线阅读下载全文

作  者:吴智妍 金卫[1] 岳路[1] 生慧[1] WU Zhiyan;JIN Wei;YUE Lu;SHENG Hui(College of Intelligence and Information Engineering,Shandong University of Traditional Chinese Medicine,Jinan 250355,China)

机构地区:[1]山东中医药大学智能与信息工程学院,济南250355

出  处:《计算机工程与应用》2022年第21期13-29,共17页Computer Engineering and Applications

基  金:山东中医药大学科学基金优秀青年科学基金自然科学类(2018zk22,2018zk23);中医药院校电子信息专业学位研究生培养模式研究与实践(XJJG2021006)。

摘  要:电子病历(EMR)是医疗信息快速发展的产物,目前以非结构化文本形式存储。通过使用自然语言处理(NLP)技术,在非结构化文本中提取出大量医学实体,将有助于提升医务人员查阅病历效率,同时识别的成果也将辅助于接下来的关系提取和知识图谱构建等研究。介绍常用的若干个数据集、语料标注标准和评价指标。从早期传统方法、深度学习方法、预训练模型、小样本问题处理四个方面详细阐述电子病历命名实体识别方法,对比分析各模型自身的优势及局限性。探讨了目前研究的不足,并对未来发展方向提出展望。Electronic medical records(EMR)are a product of the rapid development of medical information and are currently stored in the form of unstructured text.By using natural language processing(NLP)techniques to extract a large number of medical entities in unstructured text,it will help to improve the efficiency of medical personnel in accessing medical records,while the results of identification will also assist in the next research such as relationship extraction and knowledge graph construction.This paper introduces several commonly used datasets,corpus annotation criteria and evaluation metrics.This paper elaborates on the named entity recognition methods of electronic medical records from four aspects:early traditional methods,deep learning methods,pre-trained model,and small sample problem processing,and compares and analyzes the advantages and limitations of each model itself.The shortcomings of the current research are discussed,and the future development direction is proposed.

关 键 词:电子病历 自然语言处理 命名实体识别 深度学习 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象