基于GBMTS算法的不平衡数据分类研究  被引量:6

Research on the Classification of Imbalanced Data Based on GBMTS Algorithm

在线阅读下载全文

作  者:顾玉萍[1] 程龙生[1] 陈湘来 

机构地区:[1]南京理工大学经济管理学院,江苏南京210094 [2]南京康尼集团综合管理部,江苏南京210038

出  处:《数理统计与管理》2016年第6期1016-1027,共12页Journal of Applied Statistics and Management

基  金:国家自然科学基金资助项目(71271114)

摘  要:解决不平衡数据分类问题,在现实中有着深远的意义。马田系统利用单一的正常类别构建基准空间和测量基准尺度,并由此建立数据分类模型,十分适合不平衡数据分类问题的处理。本文以传统马田系统方法为基础,结合信噪比及F-value、G-mean等分类精度,建立了基于遗传算法的基准空间优化模型,同时运用Bagging集成化算法,构造了改进马田系统模型算法GBMTS。通过对不同分类方法及相关数据集的实验分析,表明:GBMTS算法较其他分类算法,更能够有效的处理不平衡数据的分类问题。It is of great significance in reality to solve the problem of classification with imbalanced data. Mahalanobis-Tagnchi system (MTS) uses a single normal group to construct the reference space and measurement reference scale, and thus establishes the data classification model which is suitable for the classification problem of imbalaneed data. In this paper, the reference space optimization model is constructed based on the traditional MTS method combined with the signal-to-noise ratio and classification accuracy indicators such as F-value and G-mean, and then an improved MTS model algorithm GBMTS is proposed by using the bagging algorithm. Through the experimental analysis of different classification methods and related data sets, it is shown that the GBMTS algorithm is more effective to deal with the classification problem of imbalanced data compared to the other methods.

关 键 词:马田系统 不平衡数据 分类 遗传算法 BAGGING算法 

分 类 号:O212[理学—概率论与数理统计] TP181[理学—数学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象