MaLDA:基于LDA的用药分析  被引量:2

MaLDA:medication analysis based on LDA

在线阅读下载全文

作  者:周靖[1,2] 佘玉轩 熊赟[1,2,3] ZHOU Jing;SHE Yuxuan;XIONG Yun(School of Computer Science, Fudan University, Shanghai 201203, China;Shanghai Key Laboratory of Data Science, Shanghai 201203, China;Shanghai Key Laboratory of Financial Information Technology(Shanghai University of Finance and Economics),Shanghai 200433, China)

机构地区:[1]复旦大学计算机科学技术学院,上海201203 [2]上海市数据科学重点实验室,上海201203 [3]上海市金融信息技术研究重点实验室(上海财经大学),上海200433

出  处:《计算机工程与应用》2016年第18期8-13,共6页Computer Engineering and Applications

基  金:国家高技术研究发展计划(863)(No.2015AA020105);国家自然科学基金(No.91546105;No.71331005);上海市科委基金(No.14511107302);上海市数据科学重点实验室开放课题资助课题(No.201509060001);NSFC-广东联合基金(第二期)超级计算科学应用研究专项资助;国家超级计算广州中心支持

摘  要:为了给医生及病人安全、合理、高效用药提供决策支持,提出了一种基于LDA(Latent Dirichlet Allocation)的用药分析方法 Ma LDA(Medication Analysis based on LDA)。该方法结合了用药记录和就诊记录,将药物看作文档、药物功能看作主题、疾病看作词语,通过主题模型LDA发现隐含的药物功能,通过药物功能,将相关药物、相关疾病和药物与疾病联系起来。根据药物对药物功能的分布对药物进行聚类,每一类药物被相关的疾病所描述,进而对临床用药进行分析。Ma LDA不仅能发现临床用药中针对某一类疾病效用较好的药物,而且能发现隐含的联合用药。实验数据来源于上海市某医院137 510位病人的用药记录和就诊记录。实验结果证实了Ma LDA相对于其他方法在对电子就医记录进行用药分析的有效性。To provide support for doctors and patients to use drugs in a safer, more rational and efficient way, this paper proposes a framework for medication analysis based on LDA(Latent Dirichlet Allocation), MaLDA(Medication Analysis based on the LDA). MaLDA combines the usage of medication records and diagnostic records, infers the function of each drug using topic-based inference model LDA, which regards a drug as a document, a function as a topic, and a disease as a word. As a result, related drugs, drug and disease, related diseases are associated by functions. Then clustering all drugs according to its distribution of functions, and each cluster is described by related diseases. Finally, it analyzes the clinical medication based on the results of clustering. The result generated by MaLDA can not only find the drug which is better in treatment, but also find the drug combination which lays the foundation for mining drug side effects and the complications of disease. The method is evaluated by using 137 510 patients’diagnostic records and medication records. The results justify the advantages of MaLDA over baseline methods on medication analysis.

关 键 词:数据挖掘 用药分析 主题模型 隐含的狄利克雷分布 

分 类 号:TP39[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象