基于Bayes超参数优化梯度提升树的心脏病预测方法  

Heart Disease Prediction Method Based on BayesianHyperparameter Optimization Gradient Boosting Trees

在线阅读下载全文

作  者:王海燕 焦增晨 赵剑[1] 安天博 鞠熠 WANG Haiyan;JIAO Zengchen;ZHAO Jian;AN Tianbo;JU Yi(Key Laboratory of Intelligent Rehabilitation and Accessibility for People with Disabilities of Ministry of Education,College of Computer Science and Technology,Changchun University,Changchun 130022,China)

机构地区:[1]长春大学计算机科学技术学院,残障人士智能康复及无障碍教育部重点实验室,长春130022

出  处:《吉林大学学报(理学版)》2025年第2期472-478,共7页Journal of Jilin University:Science Edition

基  金:吉林省教育厅科学技术研究项目(批准号:JJKH20220597KJ);吉林省科技发展计划项目(批准号:YDZJ202201ZYTS549)。

摘  要:针对传统机器学习算法在数据集Cleveland和Hungary上预测准确率低的问题,提出一种基于Bayes超参数优化梯度提升树的心脏病预测方法.首先,采用K-最近邻算法对数据集中的缺失值进行填补,用Min-Max标准化、One-Hot编码处理数据,并基于梯度提升树算法进行心脏病预测;其次,采用Bayes优化和十倍交叉验证的方式搜寻算法的最佳超参数组合.实验结果表明,优化后的梯度提升树算法在心脏病数据集Cleveland上预测准确率可达90.2%,在心脏病数据集Hungary上预测准确率可达81.4%,优于决策树、支持向量机、K-最近邻等传统机器学习方法,可辅助医生进行心脏病诊断.Aiming at the problem of low prediction accuracy of traditional machine learning algorithms on Cleveland and Hungary dataset,we proposed a heart disease prediction method based on Bayesian hyperparameter optimization gradient boosting trees.Firstly,the K-nearest neighbor algorithm was used to fill in the missing values in the dataset,Min-Max standardization and One-Hot encoding were used to process the data,and the gradient boosting tree algorithm was used to predict the heart disease.Secondly,Bayesian optimization and ten-fold cross validation were used to search for the best combination of hyperparameters of the algorithm.The experimental results show that the prediction accuracy of the optimized gradient boosting tree algorithm can reach 90.2%on the Cleveland heart disease dataset,and the prediction accuracy can reach 81.4%on the Hungarian heart disease dataset,outperforming traditional machine learning methods such as decision tree,support vector machine and the K-nearest neighbor,it can assist doctors in the diagnosis of heart disease.

关 键 词:心脏病预测 K-最近邻算法 梯度提升树 Bayes优化 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程] TP301.6[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象