基于随机森林算法构建胆囊肿瘤性息肉发生的预测模型  

Prediction model for neoplastic gallbladder polyps based on random forest algorithm

作  者:李起 豆明辉 贾蓬勃 王新团 雷达[3] 李军辉[4] 杨瑞[5] 杨成林[6] 张小弟 郝琪伟[8] 耿西林[9] 张煜[9] 刘益民[10] 郭智华[10] 姚春和[11] 王天翊 蔡志强[12] 司书宾[12] 杨文彬[4] 耿智敏[1] 张东[1] Qi Li;Minghui Dou;Pengbo Jia;Xintuan Wang;Da Lei;Junhui Li;Rui Yang;Chenglin Yang;Xiaodi Zhang;Qiwei Hao;Xilin Geng;Yu Zhang;Yimin Liu;Zhihua Guo;Chunhe Yao;Tianyi Wang;Zhiqiang Cai;Shubin Si;Wenbin Yang;Zhimin Geng;Dong Zhang(Department of Hepatobiliary Surgery,the First Affiliated Hospital of Xi'an Jiaotong University,Xi'an 710061,China;Department of Hepatobiliary Surgery,the First People's Hospital of Xianyang,Xianyang 712000,China;Department of Hepatobiliary Surgery,Baoji Central Hospital,Baoji 721000,China;Department of General Surgery,the Second Affiliated Hospital of Xi'an Jiaotong University,Xi'an 710004,China;Department of Hepatobiliary Surgery,Hanzhong Central Hospital,Hanzhong 723000,China;Department of General Surgery,Ankang Central Hospital,Ankang 725000,China;Department of Hepatobiliary Surgery,No.215 Hospital of Shaanxi Nuclear Industry,Xianyang 712000,China;Department of Hepatobiliary Surgery,Yulin No.2 Hospital,Yulin 719000,China;Department of Hepatobiliary Surgery,Shaanxi Provincial People's Hospital,Xi'an 710068,China;Department of Hepatobiliary Surgery,Baoji People's Hospital,Baoji 721000,China;Department of General Surgery,Xianyang Hospital of Yan'an University,Xianyang 712000,China;Department of Industrial Engineering,School of Mechanical Engineering,Northwestern Polytechnical University,Xi'an 710072,China)

机构地区:[1]西安交通大学第一附属医院肝胆外科,710061 [2]陕西省咸阳市第一人民医院肝胆外科,712000 [3]陕西省宝鸡市中心医院肝胆外科,721000 [4]西安交通大学第二附属医院普通外科,710004 [5]陕西省汉中市中心医院肝胆外科,723000 [6]陕西省安康市中心医院普通外科,725000 [7]陕西省核工业215医院肝胆外科,陕西省咸阳市712000 [8]陕西省榆林市第二医院肝胆外科,719000 [9]陕西省人民医院肝胆外科,西安710068 [10]陕西省宝鸡市人民医院肝胆外科,721000 [11]陕西省延安大学附属咸阳医院普通外科,712000 [12]西北工业大学机电学院工业工程系,西安710072

出  处:《中华肝脏外科手术学电子杂志》2025年第1期46-52,共7页Chinese Journal of Hepatic Surgery(Electronic Edition)

基  金:国家自然科学基金(62076194);陕西省重点研发计划(2021SF-016,2022-SF-606);西安交通大学第一附属医院院基金(2024-QN-015)。

摘  要:目的探讨胆囊肿瘤性息肉发生相关因素,并基于随机森林算法构建胆囊肿瘤性息肉预测模型。方法收集2015年1月至2023年8月在11家医疗中心行胆囊切除术的745例胆囊息肉患者临床病理资料。患者均签署知情同意书,符合医学伦理学规定。其中男286例,女459例;年龄18~80岁,中位年龄46岁。胆囊息肉长径为10~15 mm,中位直径11 mm。胆囊肿瘤性息肉发生相关因素的单因素分析采用χ2或Mann-Whitney U检验。根据患者入院时间不同分为训练集(588例)和测试集(157例),训练集用于随机森林预测模型的构建,测试集用于预测模型验证。采用ROC曲线下面积(AUC)及混淆矩阵评估模型的预测能力。结果本研究中非肿瘤性息肉者占87.2%(650/745),其中胆固醇息肉518例,炎性息肉55例,腺瘤样增生47例;肿瘤性息肉占12.8%(95/745),其中胆囊腺瘤83例,T1期胆囊癌12例。单因素分析显示,息肉数量、息肉长径、息肉短径、基底情况、息肉部位、回声强度与胆囊肿瘤性息肉发生有关(χ^(2)=20.675,Z=-4.694,Z=-2.595,χ^(2)=6.692,Z=3.935,Z=-2.690;P<0.05)。基于胆囊肿瘤性息肉发生的危险因素及重要度排序结果构建随机森林预测模型,模型训练集和测试集AUC分别为0.79、0.69,敏感度分别为0.74、0.63,特异度分别为0.75、0.68。基于胆囊肿瘤性息肉的随机森林预测模型混淆矩阵分析,模型训练集和测试集准确率分别为75%、68%。结论胆囊肿瘤性息肉发生与息肉个数、息肉大小、息肉基底情况、息肉部位、回声强度等具有明显相关性,基于随机森林算法构建的预测模型有助于胆囊肿瘤性息肉的识别,为胆囊息肉患者的外科诊疗及随访策略提供决策支持。Objective To explore the risk factors of neoplastic gallbladder polyps,and construct a prediction model for neoplastic gallbladder polyps based on random forest algorithm.Methods Clinicopathological data of 745 patients with gallbladder polyps who underwent cholecystectomy in 11 medical centers from January 2015 to August 2023 were collected.The informed consents of all patients were obtained and the local ethical committee approval was received.Among them,286 patients were male and 459 female,aged from 18 to 80 years,with a median age of 46 years.The maximum diameter of gallbladder polyps was ranged from 10 to 15 mm,and the median diameter was 11 mm.Univariate analysis of the risk factors of neoplastic gallbladder polyps was conducted by Chi-square test or Mann Whitney U test.According to the admission date,they were divided into the training set(n=588)and test set(n=157).The training set was used to construct the random forest prediction model,and the test set was utilized to validate the prediction model.The prediction performance of this model was assessed by the area under the ROC curve(AUC)and confusion matrix.Results In this study,non-neoplastic gallbladder polyps patients accounted for 87.2%(650/745),including 518 cases of cholesterol polyps,55 cases of inflammatory polyps and 47 cases of adenomatous hyperplasia.The proportion of neoplastic gallbladder polyps was 12.8%(95/745),including 83 cases of gallbladder adenomas and 12 cases of T1 gallbladder carcinomas.Univariate analysis showed that the number of polyp,maximum and minimum diameter of polyp,polyp short diameter,polyp basal status,polyp location and echo intensity were correlated with the incidence of neoplastic gallbladder polyps(χ^(2)=20.675,Z=-4.694,Z=-2.595,χ^(2)=6.692,Z=3.935,Z=-2.690;P<0.05).Based on the risk factors of neoplastic gallbladder polyps and the ranking of importance,a random forest prediction model was constructed.The AUC of the training and test sets was 0.79 and 0.69,with a sensitivity of 0.74 and 0.63 and a specificity of 0.

关 键 词:胆道良性疾病 胆囊息肉 肿瘤性息肉 胆囊切除术 随机森林算法 

分 类 号:R73[医药卫生—肿瘤]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象