检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:马建军 张铁娟[2] 赵庆龙[2] 于世晖 梅扬 Ma Jianjun;Zhang Tiejuan;Zhao Qinglong;Yu Shihui;Mei Yang(Clinical Quality Evaluation Institute,Jilin Provincial Tuberculosis Prevention and Treatment Institute,Changchun 130062,China;Jilin Provincial Center of Disease Control and Prevention,Changchun 130062,China;Chinese Center for Disease Control and Prevention,Beijing 102206,China)
机构地区:[1]吉林省结核病防治科学研究院诊疗质量评价所,长春130062 [2]吉林省疾病预防控制中心(吉林省公共卫生研究院),长春130062 [3]中国疾病预防控制中心,北京102206
出 处:《结核与肺部疾病杂志》2023年第5期364-369,共6页Journal of Tuberculosis and Lung Disease
基 金:吉林省卫生与健康管理模式革新项目(2020G007)。
摘 要:目的:应用机器学习算法随机森林建立吉林省老年流动人口肺结核发病风险模型并分析发病风险因素,为制定结核病重点人群防治策略提供参考。方法:采用1∶1匹配设计的病例对照研究,选择2021年吉林省登记的年龄≥60岁的流动人口肺结核患者(281例)为病例组,281例性别匹配的非本地户籍健康人群为对照组,随机抽取70%(393例/名)和30%(169例/名)的数据作为训练集和测试集,使用R Software Version 4.2.1软件建立随机森林算法的发病风险模型。结果:发病风险因素前5位分别为有结核病患者接触史、工作经常变动、个人防护差、吸烟、较少摄入肉蛋奶,其基尼平均减少值分别为44.344、29.007、21.859、19.703、15.242;随机森林模型最优树数量为281,袋外数据误差率为6.44%;ROC曲线下面积为0.967;使用Caret包10折交叉验证随机森林算法,正确率为93.5%,Kappa值为0.870。结论:有结核病患者接触史的老年流动人口被感染的风险最大,常态化的结核病防控要重视隔离具有传染性的肺结核患者,加强个人防护和营养摄入。Objective:To use the machine learning algorithm-random forest to establish a risk model of tuberculosis incidence among elderly mobile population in Jilin Province,so as to provide a reference for the development of prevention and treatment strategies for key populations of tuberculosis.Methods:Using a case-control study with a 1∶1 matching design,281 tuberculosis patients≥60 years from the migrant population registered in Jilin Province in 2021 were selected as the case group,and 281 gender-matched healthy non-local household members were selected as the control group,70%(393 cases)and 30%(169 cases)of the data were randomly selected as the training and test sets,and random forest algorithm was used to model the incidence risk of tuberculosis using R Software Version 4.2.1.Results:The top 5 risk factors for morbidity were history of exposure to tuberculosis patients,change of job,poor personal protection,smoking,and low intake of meat,eggs and milk,the average decline of Gini were 44.344,29.007,21.859,19.703 and 15.242,respectively;the optimal number of trees in the model was 281,and the error rate of out-of-bag data was 6.44%;area under the ROC curve was 0.967;the random forest algorithm was cross-validated using the Caret package 10-fold with a 93.5%correct rate and a Kappa value of 0.870.Conclusion:Elderly mobile population with a history of contact with tuberculosis patients were at highest risk of infection,thus normalized tuberculosis prevention and control should emphasize on isolation of infectious tuberculosis patients and strengthening personal protection and nutritional intake.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222