基于Null Importance与GS-LGBM的糖尿病视网膜病变因素分析与风险预测  

Risk factors analysis and prediction of diabetic retinopathy based on Null Importance and GSLGBM

在线阅读下载全文

作  者:曹佳悦 罗冬梅[1] CAO Jiayue;LUO Dongmei(School of Microelectronics and Data Science,Anhui University of Technology,Ma'anshan 243002,China)

机构地区:[1]安徽工业大学微电子与数据科学学院,安徽马鞍山243002

出  处:《中国医学物理学杂志》2023年第8期1033-1038,共6页Chinese Journal of Medical Physics

基  金:国家级创新创业训练项目(202110360094,202210360089,202210360086);安徽省高校自然科学基金重点研究项目(2022AH050328);安徽省教育教学研究项目(2020jyxm0238)。

摘  要:目的:通过机器学习算法分析糖尿病视网膜病变(DR)关键因素,构建DR风险预测模型,为DR的预防和诊断提供参考。方法:采用国家人口健康科学数据中心的《糖尿病并发症预警数据集》,基于Null Importance方法去除噪声特征,筛选出与DR有关的关键因素;基于GridSearch优化LGBM模型参数,构建GS-LGBM DR风险预测模型。以准确率、精确率、召回率、F1分数、AUC值作为评价标准,与XGBoost、随机森林、Logistic以及未调优的LGBM模型进行比较。结果:Null Importance方法筛选出30个关键因素;与XGBoost、随机森林、Logistic以及未调优的LGBM模型相比,本研究所构建的GS-LGBM DR风险预测模型各评价指标均最优,其在测试数据上的AUC值高达0.897。结论:相较传统的DR预测模型,经过超参数优化后的模型具有更好的DR风险预测能力,更有助于DR的临床诊断。Objective To analyze the risk factors of diabetic retinopathy(DR)and construct a DR risk prediction model through machine learning algorithms,thereby providing reference for DR prevention and diagnosis.Methods The study adopted the Diabetic Complication Early-Warning Data Set of the National Population Health Data Center.Null Importance method was used to remove noise features and screen out the key factors related to DR.LGBM model parameters were optimized with GridSearch to construct the GS-LGBM DR risk prediction model.The proposed method was compared with XGBoost,random forest,Logistic,and LGBM models in terms of accuracy,precision,recall,F1 score,and AUC values.Results Thirty key factors were screened out using the Null Importance method.Compared with XGBoost,random forest,Logistic and LGBM models,the GS-LGBM DR risk prediction model had the best evaluation performances,and its AUC value on the test data was as high as 0.897.Conclusion The hyperparameter optimized model is superior to the traditional DR prediction model,and it is more conducive to the clinical diagnosis of DR.

关 键 词:糖尿病视网膜病变 Null Importance 风险预测 GS-LGBM 

分 类 号:R318[医药卫生—生物医学工程] R587.1[医药卫生—基础医学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象