Logistic回归中误调整工具变量对因果效应估计的影响  被引量:2

The effect of mis-adjusting instrumental variables on the estimation of causal effect in Logistic regression analysis model

在线阅读下载全文

作  者:苏萍 王停停 于媛媛 孙晓茹 李洪凯 薛付忠 SU Ping;WANG Ting-ting;YU Yuan-yuan;SUN Xiao-ru;LI Hong-kai;XUE Fu-zhong(Institute of Basic Medicine,The Second Hospital of Shandong University,Jinan 250033,China;Department of Biostatistics,School of Public Health,Cheeloo College of Medicine,Shandong University,Institute of Health and Medical Research,Shandong University,Jinan 250012,China)

机构地区:[1]山东大学第二医院基础医学研究所,济南250033 [2]山东大学齐鲁医学院公共卫生学院生物统计系,山东大学健康医疗大数据研究院,济南250012

出  处:《中华疾病控制杂志》2021年第6期656-662,共7页Chinese Journal of Disease Control & Prevention

基  金:国家重点研发计划(2020YFC2003500);国家自然科学基金(81773547,82003557);山东省自然科学基金(ZR2019ZD02,ZR2019PH041)

摘  要:目的通过统计模拟和实例数据分析,探索当存在不可观测的混杂因素时,Logistic回归分析模型中调整工具变量(instrumental variable,IV)对估计因果效应的影响。方法设定变量均服从二项分布,在Logistic回归分析模型中依次使用不同的参数进行统计模拟,以因果效应估计值的偏倚和标准误作为评价指标;实例数据分析是基于山东省多家医院健康体检中心的体检随访数据,以高血压为目标结局,构建纵向观察队列,筛选单核苷酸多态性(single nucleotide polymorphism,SNP)位点rs12149832作为IV,在Logistic回归分析模型中,采用不同策略(纳入/不纳入rs12149832协变量)来分析BMI与患高血压风险之间的关系。结果统计模拟结果显示在以Logistic回归分析模型估计暴露与结局间的效应时,协变量集中纳入IV会增大效应估计的偏倚和标准误,但增大程度较小;实例分析中,高血压队列共纳入1240名女性,基线年龄为(37.7±10.5)岁,BMI为(22.1±3.1)kg/m^(2)。纳入IV的模型所得的效应估计值为0.225(P<0.001),略小于不包含IV的回归模型所得的效应估计值(0.228,P<0.001),基本验证了关于纳入IV进行调整的统计模拟结果。结论观察性流行病学研究中,Logistic回归分析模型误纳入IV对效应估计值的偏倚和标准误均有影响。Objective To explore the effects of adjusting for instrumental variables(IVs)in a Logistic regression model through statistical simulation and real data analysis while there were unmeasured confounding factors.Methods Simulations were carried out by traversing the value of parameters in the Logistic regression model,and variables were all binomial distribution.Bias and standard error were used to evaluate the performance of estimators.As for the real data analysis,a longitudinal hypertension cohort was constructed based on the multi-center health management cohort of Shandong Province,and single nucleotide polymorphism(SNP)rs12149832 was selected as the IV.Logistic regression models with and without adjusting IV(rs12149832)were used to estimate the effect of body mass index(BMI)on hypertension.Results The statistical simulation results showed that adjusting for IVs in a Logistic regression model would increase the confounding bias and the standard error of effect estimation,but these increases were generally small.As for the real data analysis,a total of 1240 women were included in the Hypertension cohort.The baseline age was(37.7±10.5)years and the BMI was(22.1±3.1)kg/m^(2).The estimated value with adjusting for IV(0.225,P<0.001)was slightly less than the estimated value without adjusting for the IV(0.228,P<0.001),which basically verified the statistical simulation results about adjusting IVs.Conclusion In observational epidemiological studies,the mistaken inclusion of IVs in the Logistic regression model has an impact on both the bias and standard error of the effect estimates.

关 键 词:工具变量 因果推断 混杂因素 Logistic回归分析模型 

分 类 号:R181.2[医药卫生—流行病学] R195.1[医药卫生—公共卫生与预防医学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象