基于模型决策树的AdaBoost算法  被引量:5

AdaBoost algorithm based on model decision tree

在线阅读下载全文

作  者:梁云 门昌骞[1] 王文剑[2] LIANG Yun;MEN Chang-qian;WANG Wen-jian(School of Computer and Information Technology,Shanxi University,Taiyuan 030006,Shanxi,China;Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education,Shanxi University,Taiyuan 030006,Shanxi,China)

机构地区:[1]山西大学计算机与信息技术学院,山西太原030006 [2]山西大学计算智能与中文信息处理教育部重点实验室,山西太原030006

出  处:《山东大学学报(理学版)》2023年第1期67-75,共9页Journal of Shandong University(Natural Science)

基  金:国家自然科学基金资助项目(62076154,U21A20513,U1805263);中央引导地方科技发展资金资助项目(YDZX20201400001224);山西省自然科学基金资助项目(201901D111030);山西省国际科技合作重点研发计划项目(201903D421050)。

摘  要:AdaBoost算法是一种将多个基学习器通过合理策略结合生成强学习器的集成算法,其性能取决于基学习器的准确率和多样性。但弱学习器分类精度不高往往也导致了最终强分类器性能较差,因此进一步为了提高算法的分类精确率,本文提出一种MDTAda模型,首先利用基尼指数迭代构造一棵不完全决策树,然后在决策树的非纯伪叶结点上添加简单分类器,生成MDT(模型决策树),将MDT作为AdaBoost算法的基分类器,加权平均生成强分类器。在标准数据集上的实验表明,相比传统的AdaBoost算法,本文提出的算法拥有更好的泛化性能和更优的间隔分布,且在与AdaBoost算法达到相同精度时所需迭代次数更少。The AdaBoost algorithm is an ensemble algorithm that combines multiple base learners through reasonable strategies to generate a strong learner. Its performance depends on the accuracy and diversity of the base learners. However, the poor classification accuracy of weak learners often leads to poor performance of the final strong classifier. Therefore, in order to further improve the classification accuracy of the algorithm, this paper proposes an MDTAda model, which first uses Gini index to iteratively construct an incomplete decision tree. Then add a simple classifier to the non-pure pseudo-leaf nodes of the decision tree to generate MDT(model decision tree), use MDT as the base classifier of AdaBoost algorithm, and weighted average to generate a strong classifier. Experiments on standard data sets show that compared with the traditional AdaBoost algorithm, the algorithm proposed in this paper has better generalization performance and better margin distribution, and requires fewer iterations to achieve the same accuracy as AdaBoost algorithm.

关 键 词:基尼指数 决策树 集成学习 ADABOOST算法 间隔分析 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象