检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]东华大学计算机科学与技术学院,上海201620
出 处:《计算机与现代化》2008年第9期47-50,共4页Computer and Modernization
基 金:上海市科委资助项目(05DZ11C06)
摘 要:ID3算法作为一种流行的决策树算法,因为其算法简单、易实现而被广泛使用。但其生成的树结构往往过于庞大,复杂,也影响了算法效率。为了优化树的结构,提高树生成的效率,避免"过拟合"效应,本文将每个分类属性分类后的效果也考虑在内,即,若分类效果达到某个预定的标准则终止那条分支继续分类,并引入了最大支持度的概念,采用了前剪枝策略,对ID3算法进行了改进。实验结果显示,改进算法的确能够使生成的决策树在保证精度的基础上更加精简。As a popular algorithm of decision tree, ID3 is widely used because of its simple idea and facile realization. However, the structure of the tree produced by this algorithm is usually too large and complex, thus the performance of the algorithm is restricted. In order to enhance the efficiency of the tree-producing process and avoid "overfitting", we take the classification effect of each classifying attribute into account, that is, if the classification effect reaches a certain level, the process of classification of that branch will be terminated, and propose an improved algorithm by using the maximum support and adopting pre-pruning strategy. The experiment results show that the improved algorithm can make decision tree simpler without reducing precise.
分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222