检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:何涛[1,2] 陈剑 闻英友[1,2] HE Tao;CHEN Jian;WEN Yingyou(Neusoft Reserch,Northeastern University,Shenyang 110169;Research Center of Safety Engineering Technology in Industrial Control of Liaoning Province,Shenyang 110169)
机构地区:[1]东北大学东软研究院,沈阳110169 [2]辽宁省工业控制安全工程技术研究中心,沈阳110169
出 处:《计算机与数字工程》2022年第3期639-643,共5页Computer & Digital Engineering
基 金:国家重点研发计划(编号:2018YFC0830601);辽宁省重点研发计划(编号:2019JH2/10100027);教育部基本科研业务费项目(编号:N171802001);辽宁省“兴辽英才计划”项目(编号:XLYC1802100)资助。
摘 要:电子病历实体识别是智慧医疗服务中一项重要的基础任务,当前医院诊疗过程中采用人工分析病历文本的方法,容易产生关键信息遗漏且效率低下。为此,提出一种结合BERT与条件随机场的实体识别模型,使用基于双向训练Transformer的BERT中文预训练模型,在手工标注的符合BIOES标准的语料库上微调模型参数,通过BERT模型学习字符序列的状态特征,并将得到的序列状态分数输入到条件随机场层,条件随机场层对序列状态转移做出约束优化。BERT模型具有巨大的参数量、强大的特征提取能力和实体的多维语义表征等优势,可有效提升实体抽取的效果。实验结果表明,论文提出的模型能实现88%以上的实体识别F1分数,显著优于传统的循环神经网络和卷积神经网络模型。Electronic medical record entity recognition is an important basic task in intelligent medical services.At present,the method of manual analysis of medical record text is used in the process of diagnosis and treatment in hospitals,which is easy to produce key information omission and inefficient.Therefore,a kind of entity recognition model combining BERT and conditional random field is proposed.Using the BERT Chinese pre-training model based on bi-directional training transformers,the parameters of the model are fine-tuned on the manually marked corpus which conforms to the BIOES standard.Through the BERT model,the state characteristics of character sequences are learned,and the obtained sequence state scores are input into conditional random field layer,which makes a reduction to the sequence state transition bundle.BERT model has many advantages,such as huge parameters,powerful feature extraction ability and multi-dimensional semantic representation of entities,which can effectively improve the effect of entity extraction.The experimental results show that the BERT-CRF model obtained more than 88% of the entity recognition F1 score,which is significantly better than the traditional recurrent neural network and convolutional neural network model.
关 键 词:深度学习 BERT 条件随机场 命名实体识别 电子病历
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.236