基于XGBoost的中国上市公司违约风险预测模型  被引量:4

Default Risk Prediction Model for Chinese Listed Companies Based on XGBoost

在线阅读下载全文

作  者:迟国泰[1] 王珊珊[1] CHI Guotai;WANG Shanshan(School of Economics and Management,Dalian University of Technology,Dalian 116024,Liaoning,China)

机构地区:[1]大连理工大学经济管理学院,辽宁大连116024

出  处:《系统管理学报》2024年第3期735-754,共20页Journal of Systems & Management

基  金:国家自然科学基金重点项目(71731003);国家自然科学基金面上项目(72071026,72173096,71971051,71971034,71873103);国家自然科学基金青年科学基金资助项目(71901055,71903019);国家自然科学基金地区科学基金资助项目(72161033);国家社会科学基金重大项目(18ZDA095)。

摘  要:准确预测上市公司的违约风险,是企业信用风险评价的关键,也是金融机构信贷决策的重要依据。通过线性回归模型的信息量AIC遴选违约判别能力最大的指标组合,采用粒子群优化算法构建基于XGBoost的违约预测模型。选取中国A股3425家上市公司不同时间窗口的数据为样本进行违约预测,将所构建的PSO-XGBoost模型与逻辑回归、支持向量机等13种预测模型对比,验证所建模型的有效性。通过UCI数据库中的3个公开信用数据集,利用Friedman检验,验证所建模型的稳健性。研究表明:使用上市公司数据与13种模型对比,PSO-XGBoost模型提高了预测精度G-mean;使用3个公开信用数据集,在多个评价指标上,PSO-XGBoost模型的平均预测性能显著优于对比模型;通过指标对预测结果的贡献获得指标重要性得分,增强了预测模型的可解释性。研究发现:“资产负债率”“流动比率”“长期资本负债率”等财务指标对违约预测的影响最大,“行业景气指数”“社会消费品零售总额增长率”“流通中现金(M0)供应量同比增长率”等指标是影响违约预测的重要指标。本研究可以为提高违约风险预测的准确性提供有效的方法和实证证据,有助于加强上市公司违约风险的预警和防范,降低违约风险监管成本,为企业管理者、债权人及投资者提供良好的决策支持。Accurate prediction of default risk of listed companies is essential to credit risk evaluation and an important basis for financial institutions to make credit decisions.This paper,by selecting the optimal feature subset with a strong default discriminative ability using the linear regression model based on the Akaike information criterion(AIC)measure,and utilizing particle swarm optimization(PSO)algorithm,builds an extreme gradient boosting(XGBoost)default prediction model based on selected feature subset.Based on the dataset covering 3425 A-share listed companies in China for different time windows,it empirically compares the proposed model(PSO-XGBoost)with thirteen well-known benchmark models,including logistic regression and support vector machine,to check the effectiveness of the model.Moreover,it uses Friedman test to further examine the significant difference between the proposed model and the benchmark models using three credit datasets from UCI machine learning repository.The empirical results on listed companies dataset show that the proposed model has a good prediction performance and outperforms other benchmark models in terms of geometric mean(G-mean).The majority of performance measures on three credit datasets show that the average prediction performance of the proposed model surpasses that of other benchmark models.This paper obtains the feature importance measured by the relative contribution of each feature to the prediction results and increases the interpretability of the model.The findings reveal that financial indicators containing asset liability ratio,current ratio,and long-term debt to asset ratio have the greatest effects on default prediction.Macro factors including industry prosperity index,gross retail sales growth rate of consumer goods,and growth rate of cash in circulation(M0)supply,are important features affecting default prediction.This paper provides effective methods and empirical evidence for improving the prediction accuracy of default risk,which helps strengthen the early wa

关 键 词:违约预测 指标组合遴选 决策树参数 

分 类 号:F830.56[经济管理—金融学] TP18[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象