基于贝叶斯深度学习方法的上海新冠肺炎病例时空预测和不确定性量化  

Spatio-Temporal Forecasting and Uncertainty Quanti cation of COVID-19 Cases in Shanghai via a Bayesian Deep Learning Approach

在线阅读下载全文

作  者:周世荣 汤银才[1] 王平平 庄亮亮 徐嘉威 ZHOU Shirong;TANG Yincai;WANG Pingping;ZHUANG Liangliang;XU Jiawei(KLATASDS-MOE,School of Statistics,East China Normal University,Shanghai,200062,China;College of Mathematics and Physics,Wenzhou University,Wenzhou,325035,China;School of Economics,Nanjing University of Finance and Economics,Nanjing,210023,China;School of Statistics and Mathematics,Zhejiang Gongshang University,Hangzhou,310018,China)

机构地区:[1]华东师范大学统计学院统计与数据科学前沿理论及应用教育部重点实验室,上海200062 [2]温州大学数理学院,温州325035 [3]南京财经大学经济学院,南京210023 [4]浙江工商大学统计与数学学院,杭州310018

出  处:《应用概率统计》2024年第2期298-322,共25页Chinese Journal of Applied Probability and Statistics

基  金:supported by the National Natural Science Foundation of China(Grant Nos.12171432,11671303,12271168);the 111 Project of China(Grant No.B14019).

摘  要:2022年春季在上海爆发的新冠肺炎疫情对上海的社会、经济和居民的日常生活造成了严重影响.新冠肺炎的传播通常表现出复杂的非线性动力学,受环境、人口统计、医疗条件、核酸或抗原检测频率、流行病控制策略等影响.具有复杂网络结构和广泛训练的长短期记忆(LSTM)模型被广泛用于学习和预测流行病的传播.然而,这种模型既没有解释数据的不确定性,也没有考虑各种协变量和异质性的影响.因此,本文提出了一个两阶段LSTM嵌套广义泊松回归模型来分析2022年春季上海爆发的新冠肺炎疫情数据.在第一阶段,训练一个多层LSTM网络来学习特定地区的感染数据,然后使用训练好的LSTM来拟合和预测有症状的新冠肺炎感染人数.在第二阶段,在分层贝叶斯框架下通过广义泊松回归模型对预测的病例数进行建模,其中相对风险的对数用带有协变量和时空异质性的随机效应的线性函数来建模.在深度学习方法的帮助下,时空广义泊松回归模型可以预测和量化每日新增症状感染数量的不确定性.此外,得益于从协变量和时空异质性的借力,基于贝叶斯深度学习方法的预测比基于LSTM方法的预测性能更好.The outbreak of COVID-19 in Shanghai in the spring of 2022 had a serious impact on the society,economy,and daily life of residents.The spread of COVID-19 often exhibits complex non-linear dynamics influenced by environment,demographics,medical conditions,frequency of nucleic acid or antigen testing,epidemic control strategies,etc.Long-short term memory(LSTM)models with complex network structures and extensive training are widely adopted to learn and predict the spreading of epidemic.However,such a model neither explains the uncertainty in data,nor takes the influence of various covariates and heterogeneities into account.Therefore,a twostage LSTM nested generalized Poisson regression(LNGPR)model is proposed in this paper to analyze COVID-19 infectious data in Shanghai outbroke in the Spring of 2022.In the first stage,a multi-layer LSTM network is trained to learn district-specific infectious data,then the trained LSTM is used to fit and predict the number of symptomatic COVID-19 infections.In the second stage,the predicted number of cases is modeled by a generalized Poisson regression model under a hierarchical Bayesian framework,in which the logarithm of the relative risks is modeled as a linear function of covariates and random effects with spatio-temporal heterogeneities.Facilitated by a deep learning approach,the spatio-temporal generalized Poisson regression model can forecast and quantifies uncertainty of the number of daily new symptomatic infections.Furthermore,the predictions based on the proposed Bayesian deep learning approach performs better than those based on LSTM method in virtue of borrowing strength from covariates,and spatial and temporal heterogeneity.

关 键 词:COVID-19 LSTM 泊松回归模型 积分嵌套拉普拉斯近似(INLA) 

分 类 号:O212.8[理学—概率论与数理统计]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象