基于双随机森林的发热待查智能诊断方法  

An Intelligent Diagnosis Method for FUO Based on Bi-random Forest

在线阅读下载全文

作  者:杜建超[1] 丁俊瑶 赵梦楠 连建奇 陈天艳[3] WU Yuan 周云 石磊[3] DU Jianchao;DING Junyao;ZHAO Mengnan;LIAN Jianqi;CHEN Tianyan;WU Yuan;ZHOU Yun;SHI Lei(School of Telecommunications Engineering,Xidian University,Xi’an,Shaanxi 710071,China;The Second Affiliated Hospital of Air Force Medical University,Xi’an,Shaanxi 710038,China;The First Affiliated Hospital of Xi’an Jiaotong University,Xi’an,Shaanxi 710061,China;Duke University Health System,Durham NC 27710,U.S.A.)

机构地区:[1]西安电子科技大学通信工程学院,陕西西安710071 [2]空军军医大学第二附属医院,陕西西安710038 [3]西安交通大学第一附属医院,陕西西安710061 [4]Duke University Health System,Durham NC 27710

出  处:《生物医学工程学进展》2024年第3期197-205,共9页Progress in Biomedical Engineering

基  金:空军军医大学第二附属医院前沿交叉研究项目(2021QYJC-005)。

摘  要:在机器学习预测模型中,不平衡数据集会降低少数类的预测准确性。针对发热待查数据集的不平衡特性,该文提出了一种基于K-Means聚类欠采样的双随机森林病因预测方法。首先通过K-Means聚类欠采样构建一个平衡数据集,并在此基础上创建一个基于CART投票机制的随机森林预测模型。然后对初始数据集用同样的方法创建一个随机森林预测模型。最后将两个随机森林预测模型联合,使用两者的CART一起投票预测。该文提出的方法增加了CART的数量,在保持原有数据集特性的同时,提高了少数类的投票权重。在发热待查数据集上的实验表明,该文所提方法不仅改善了少数类的预测性能,对其他类别的预测性能也有一定程度的提升。In machine learning prediction models,imbalanced datasets reduce the accuracy of minority class predictions.A bi-random forest etiology prediction method based on K-Means clustering undersampling is proposed to address the imbalanced characteristics of the fever of unknown origin(FUO)dataset.Firstly,a balanced dataset is constructed through K-Means clustering undersampling,and a random forest prediction model based on the CART voting mechanism is created on this basis.Then,a random forest prediction model is also created using the same method for the initial dataset.Finally,two random forest prediction models are combined and their CART are used to vote together for prediction.The proposed method increases the number of CART,and enhances the voting weights of minority class while maintaining the characteristics of the original dataset.Experiments on FUO dataset show that the proposed method not only improves the prediction performance for minority class,but also improves the prediction performance for the other classes to a certain extent.

关 键 词:智能诊断 机器学习 发热待查 随机森林 不平衡数据集 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象