决策树C4.5连续属性分割阈值算法改进及其应用  被引量:41

Improvement of C4.5 decision tree continuous attributes segmentation threshold algorithm and its application

在线阅读下载全文

作  者:姚亚夫[1] 邢留涛[1] 

机构地区:[1]中南大学机电工程学院,湖南长沙410083

出  处:《中南大学学报(自然科学版)》2011年第12期3772-3776,共5页Journal of Central South University:Science and Technology

基  金:国家高技术研究发展计划("863"计划)(2009AA11Z 206)

摘  要:结合Fayyad边界点原理提出一种新的连续值属性最佳分割阈值的选择算法。根据Fayyad连续值属性的最佳分割点总在边界点处的原理,只在连续属性分界点处的少数几个分割点中选择最佳分割阈值。构造并训练了改进C4.5分类器,将其应用于视频序列中的人车目标识别。实验结果表明:改进C4.5算法的计算量减少近20%,大大提高了决策树的生成效率,分类准确率也略有提高。In order to reduce the computational complexity of this algorithm,combined with the Fayyad boundary point principle,a new algorithm,which selects the best segmentation threshold of the continuous attribute values,was proposed.According to the principle that there always exist the boundary points at the optimal segmentation point of the continuous attribute values,the improved algorithm only selected the best segmentation threshold from the few points of boundary.The improved C4.5 classifier was established and trained,and then it was applied in the recognition of people and vehicle targets in video sequences.The results show that the computation of the improved C4.5 algorithm is reduced by nearly 20% and also greatly improves the efficiency of generating a decision tree,and at the same time,the classification accuracy is slightly increased.

关 键 词:决策树C4.5算法 连续属性 分割阈值 信息增益率 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象