检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:马银瑶[1] 毕文帅 毛锦江[3] 孟晨伟 吕翰林 王雷 MA Yinyao;BI Wenshuai;MAO Jinjiang;MENG Chenwei;LYU Hanlin;WANG Lei(Department of Obstetrics,Guangxi Zhuang Autonomous Region People's Hospital,Guangxi Zhuang Autonomous Region,Nanning530000,China;Institute of Biointelligence Technology,BGI Research-Shenzhen,Guangdong Province,Shenzhen518083,China;Department of Obstetrics,Guigang City People's Hospital,Guangxi Zhuang Autonomous Region,Guigang537000,China)
机构地区:[1]广西壮族自治区人民医院产科,广西南宁530000 [2]深圳华大生命科学研究院生物智能技术研究所,广东深圳518083 [3]广西壮族自治区贵港市人民医院产科,广西贵港537000
出 处:《中国当代医药》2023年第20期23-28,F0003,共7页China Modern Medicine
基 金:广西重点研发计划项目(桂科AB22035056)。
摘 要:目的产科的病案诊断文本,科研价值高但挖掘难度大。本文提出了一种组合算法方法,从文本中自动挖掘出满足科研要求的标准诊断术语,且可在不同医院产科应用。方法本文的组合算法先基于标注语料训练MC-BERT模型,训练后的模型进行术语标准化,再用Louvain算法归类冗余术语,自动输出科研诊断术语。结果组合算法的术语标准化在测试集上的F1达到0.9235,并可自动将1107个标准诊断术语聚类为106个科研诊断术语。组合算法在另一家医院的验证集上也得到了验证,术语标准化算法F1达到0.9094。结论该方法能从病案诊断文本中批量高效获取科研诊断术语,训练后的模型可在多家医院产科应用。Objective The medical record diagnostic texts of obstetrics are essentially important for scientific research but are difficult to extract.This paper presents a combinatorial algorithm to automatically extract standard diagnostic terms from the diagnostic texts and can be applied in different hospitals'obstetrics.Methods A combined algorithm was proposed as method.First,the MC-BERT model was trained based on the labeled corpus,and the trained model was used to standardize the terms.Then,the Louvain algorithm was used to classify redundant terms and automatically output scientific research diagnostic terms.Result The term normalization of the combined algorithm achieved an F1 of 0.9235 on the test set,and could automatically cluster 1107 standard diagnostic terms into 106 scientific research diagnostic terms.The combined algorithm was also validated on the validation set of another hospital,and the F1 of the term normalization algorithm reached 0.9094.Conclusion This method can efficiently obtain scientific research diagnostic terms in batches from the diagnostic texts of medical records,and the trained model can be applied in many hospitals'obstetrics.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.46