检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:鲁淑霞[1] 张振莲 翟俊海[1] LU Shuxia;ZHANG Zhenlian;ZHAI Junhai(College of Mathematics and Information Science,Hebei Province Key Laboratory of Machine Learning and Computational Intelligence,Hebei University,Baoding 071002,China)
机构地区:[1]河北大学数学与信息科学学院,河北省机器学习与计算智能重点实验室,保定071002
出 处:《南京航空航天大学学报》2023年第2期339-346,共8页Journal of Nanjing University of Aeronautics & Astronautics
基 金:河北省科技计划重点研发项目(19210310D);河北省自然科学基金(F2021201020)。
摘 要:针对非平衡数据分类问题,提出了一种基于代价敏感的惩罚AdaBoost算法。在惩罚Adaboost算法中,引入一种新的自适应代价敏感函数,赋予少数类样本及分错的少数类样本更高的代价值,并通过引入惩罚机制增大了样本的平均间隔。选择加权支持向量机(Support vector machine,SVM)优化模型作为基分类器,采用带有方差减小的随机梯度下降方法(Stochastic variance reduced gradient,SVRG)对优化模型进行求解。对比实验表明,本文提出的算法不但在几何均值(G-mean)和ROC曲线下的面积(Area under ROC curve,AUC)上明显优于其他算法,而且获得了较大的平均间隔,显示了本文算法在处理非平衡数据分类问题上的有效性。How to improve the classification accuracy of minority instances is one of the hot topics in machine learning research.In order to solve the problem of imbalanced data classification,a penalized AdaBoost algorithm based on cost sensitivity is proposed.In the penalized Adaboost algorithm,a new adaptive cost sensitive function is introduced,which gives higher cost value to the minority instances and the misclassified minority instances.It can obtain a larger average margin by introducing penalty mechanism.The weighted support vector machine(SVM)optimization model is used as the base classifier.The stochastic variance reduced gradient(SVRG)with variance reduction method is used to solve the optimization model.The comparative experiments show that the proposed algorithm is not only superior to other algorithms in terms of geometric⁃mean(G-mean)and area under ROC curve(AUC),but also can obtain a larger average margin by introducing penalty mechanism,which fully demonstrates the effectiveness of the proposed algorithm in handling imbalanced data classification problems.
关 键 词:非平衡数据 惩罚AdaBoost 自适应代价敏感函数 平均间隔 随机梯度下降
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.62