Competing Risks Data Analysis with High-dimensional Covariates:An Application in Bladder Cancer  被引量:3

Competing Risks Data Analysis with High-dimensional Covariates:An Application in Bladder Cancer

在线阅读下载全文

作  者:Leili Tapak Massoud Saidijam Majid Sadeghifar Jalal Poorolajal Hossein Mahjub 

机构地区:[1]Department of Biostatistics and Epidemiology,School of Public Health,Hamadan University of Medical Sciences [2]Research Center for Molecular Medicine,Department of Molecular Medicine and Genetics,School of Medicine,Hamadan University of Medical Sciences [3]Department of Statistics,Bu-Ali Sina University [4]Modeling of Noncommunicable Diseases Research Center,School of Public Health,Hamadan University of Medical Sciences [5]Research Center for Health Sciences,School of Public Health,Hamadan University of Medical Sciences

出  处:《Genomics, Proteomics & Bioinformatics》2015年第3期169-176,共8页基因组蛋白质组与生物信息学报(英文版)

基  金:funded by the Vice Chancellor for Research and Technology of Hamadan University of Medical Sciences (grant No.9210173382)

摘  要:Analysis of microarray data is associated with the methodological problems of high dimension and small sample size. Various methods have been used for variable selection in high- dimension and small sample size cases with a single survival endpoint. However, little effort has been directed toward addressing competing risks where there is more than one failure risks. This study compared three typical variable selection techniques including Lasso, elastic net, and likelihood-based boosting for high-dimensional time-to-event data with competing risks. The per- formance of these methods was evaluated via a simulation study by analyzing a real dataset related to bladder cancer patients using time-dependent receiver operator characteristic (ROC) curve and bootstrap .632 + prediction error curves. The elastic net penalization method was shown to outper- form Lasso and boosting. Based on the elastic net, 33 genes out of 1381 genes related to bladder cancer were selected. By fitting to the Fine and Gray model, eight genes were highly significant(P 〈 0.001). Among them, expression of RTN4, SON, IGF1R, SNRPE, PTGR1, PLEK, and ETFDHwas associated with a decrease in survival time, whereas SMARCAD1 expression was asso- ciated with an increase in survival time. This study indicates that the elastic net has a higher capacity than the Lasso and boosting 'for the prediction of survival time in bladder cancer patients. Moreover, genes selected by all methods improved the predictive power of the model based on only clinical variables, indicating the value of information contained in the mieroarray features.Analysis of microarray data is associated with the methodological problems of high dimension and small sample size. Various methods have been used for variable selection in high- dimension and small sample size cases with a single survival endpoint. However, little effort has been directed toward addressing competing risks where there is more than one failure risks. This study compared three typical variable selection techniques including Lasso, elastic net, and likelihood-based boosting for high-dimensional time-to-event data with competing risks. The per- formance of these methods was evaluated via a simulation study by analyzing a real dataset related to bladder cancer patients using time-dependent receiver operator characteristic (ROC) curve and bootstrap .632 + prediction error curves. The elastic net penalization method was shown to outper- form Lasso and boosting. Based on the elastic net, 33 genes out of 1381 genes related to bladder cancer were selected. By fitting to the Fine and Gray model, eight genes were highly significant(P 〈 0.001). Among them, expression of RTN4, SON, IGF1R, SNRPE, PTGR1, PLEK, and ETFDHwas associated with a decrease in survival time, whereas SMARCAD1 expression was asso- ciated with an increase in survival time. This study indicates that the elastic net has a higher capacity than the Lasso and boosting 'for the prediction of survival time in bladder cancer patients. Moreover, genes selected by all methods improved the predictive power of the model based on only clinical variables, indicating the value of information contained in the mieroarray features.

关 键 词:MICROARRAY Elastic net Lasso Competing risks Subdistribution hazard Cause-specific hazard 

分 类 号:R737.14[医药卫生—肿瘤]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象