Improved hybrid resampling and ensemble model for imbalance learning and credit evaluation  被引量:1

在线阅读下载全文

作  者:Gang Kou Hao Chen Mohammed A.Hefni 

机构地区:[1]School of Business Administration,Southwestern University of Finance and Economics,Chengdu,610074,China [2]Department of Mining Engineering,Faculty of Engineering,Jeddah,21589,Saudi Arabia

出  处:《Journal of Management Science and Engineering》2022年第4期511-529,共19页管理科学学报(英文版)

摘  要:A clustering-based undersampling (CUS) and distance-based near-miss method are widely used in current imbalanced learning algorithms, but this method has certain drawbacks. In particular, the CUS does not consider the influence of the distance factor on the majority of instances, and the near-miss method omits the inter-class(es) within the majority of samples. To overcome these drawbacks, this study proposes an undersampling method combining distance measurement and majority class clustering. Resampling methods are used to develop an ensemble-based imbalanced-learning algorithm called the clustering and distance-based imbalance learning model (CDEILM). This algorithm combines distance-based undersampling, feature selection, and ensemble learning. In addition, a cluster size-based resampling (CSBR) method is proposed for preserving the original distribution of the majority class, and a hybrid imbalanced learning framework is constructed by fusing various types of resampling methods. The combination of CDEILM and CSBR can be considered as a specific case of this hybrid framework. The experimental results show that the CDEILM and CSBR methods can achieve better performance than the benchmark methods, and that the hybrid model provides the best results under most circumstances. Therefore, the proposed model can be used as an alternative imbalanced learning method under specific circumstances, e.g., for providing a solution to credit evaluation problems in financial applications.

关 键 词:Imbalanced learning Clustering-based under-sampling Ensemble methods Hybrid methods Credit risk evaluation 

分 类 号:F426.1[经济管理—产业经济] F724.5F764

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象