检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:吴智妍 金卫[1] 岳路[1] 生慧[1] WU Zhiyan;JIN Wei;YUE Lu;SHENG Hui(College of Intelligence and Information Engineering,Shandong University of Traditional Chinese Medicine,Jinan 250355,China)
机构地区:[1]山东中医药大学智能与信息工程学院,济南250355
出 处:《计算机工程与应用》2022年第21期13-29,共17页Computer Engineering and Applications
基 金:山东中医药大学科学基金优秀青年科学基金自然科学类(2018zk22,2018zk23);中医药院校电子信息专业学位研究生培养模式研究与实践(XJJG2021006)。
摘 要:电子病历(EMR)是医疗信息快速发展的产物,目前以非结构化文本形式存储。通过使用自然语言处理(NLP)技术,在非结构化文本中提取出大量医学实体,将有助于提升医务人员查阅病历效率,同时识别的成果也将辅助于接下来的关系提取和知识图谱构建等研究。介绍常用的若干个数据集、语料标注标准和评价指标。从早期传统方法、深度学习方法、预训练模型、小样本问题处理四个方面详细阐述电子病历命名实体识别方法,对比分析各模型自身的优势及局限性。探讨了目前研究的不足,并对未来发展方向提出展望。Electronic medical records(EMR)are a product of the rapid development of medical information and are currently stored in the form of unstructured text.By using natural language processing(NLP)techniques to extract a large number of medical entities in unstructured text,it will help to improve the efficiency of medical personnel in accessing medical records,while the results of identification will also assist in the next research such as relationship extraction and knowledge graph construction.This paper introduces several commonly used datasets,corpus annotation criteria and evaluation metrics.This paper elaborates on the named entity recognition methods of electronic medical records from four aspects:early traditional methods,deep learning methods,pre-trained model,and small sample problem processing,and compares and analyzes the advantages and limitations of each model itself.The shortcomings of the current research are discussed,and the future development direction is proposed.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.21.34.100