检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:杨健斌 李咏 夏淑东 齐鹏嘉 戴燕云[1] 童基均[1] YANG Jianbin;LI Yong;XIA Shudong;QI Pengjia;DAI Yanyun;TONG Jijun(School of Information Science and Technology,Zhejiang Sci-Tech University,Hangzhou 310018,China;The Fourth Affiliated Hospital,Zhejiang University School of Medicine,Yiwu 322000,China)
机构地区:[1]浙江理工大学信息科学与工程学院,杭州310018 [2]浙江大学医学院附属第四医院,浙江义乌322000
出 处:《浙江理工大学学报(自然科学版)》2024年第3期378-388,共11页Journal of Zhejiang Sci-Tech University(Natural Sciences)
基 金:浙江省自然科学基金项目(LQ22F010006,LTGY23H170004)。
摘 要:针对心血管介入患者全周期病理数据普遍存在缺失、不连续、非结构化等问题,建立了心血管介入专病数据库,并采用基于Relief-F算法的预测方法,对心血管介入患者术后死亡风险进行预测。首先参照HL7、CDISC等国际心血管疾病统一标准对各数据源进行标准化处理,建立研究数据集,并对数据进行清洗和预处理;其次采用Relief-F算法对特征进行选择,最终保留30个特征变量;再次选择逻辑回归、支持向量机、随机森林等3种机器学习方法进行建模分析,并采用10折交叉验证方法对分类器进行训练;最后引入准确率等模型评价指标来评估各算法在数据集上的分类预测效果。实验结果表明:随机森林的分类效果在该研究数据集上的表现最佳,准确率达到81.97%,精确率为86.90%,召回率为82.14%,F1值为0.8441。该研究提出的方法能够客观反映患者术后死亡风险,为心血管介入患者术后死亡风险预测提供了一种有效的解决方案。In view of the common problems such as missing,discontinuous and unstructured pathological data of patients with cardiovascular intervention throughout the whole cycle,a cardiovascular interventional disease database was established,and the prediction method based on the Relief-F algorithm was adopted to effectively predict the risk of postoperative death of patients with cardiovascular intervention.Firstly,all data sources were standardized according to HL7,CDISC and other international cardiovascular disease standards to obtain research data sets,and the data sets were cleaned and preprocessed.Secondly,the Relief-F algorithm was used to select the features,and 30 feature variables were retained in the end.Thirdly,logistic regression,support vector machine and random forest were selected for modeling and analysis,and the 10-fold cross-validation method was used to train the classifier.Finally,model evaluation indexes such as accuracy rate were introduced to evaluate the classification prediction effect of each algorithm on the data set.The experimental results show that the classification effect of random forest has the best performance on the research data set,its accuracy rate is 81.97%,the accuracy rate is 86.90%,the recall rate is 82.14%,and the F_1 value is 0.8441.This study can objectively reflect the postoperative death risk of patients,and provides an effective solution for predicting the postoperative death risk of patients with cardiovascular intervention.
关 键 词:心血管介入 术后死亡风险预测 Relief-F算法 特征提取 机器学习 随机森林
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.145