检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李建[1] 付小斌 吴媛媛 LI Jian;FU Xiaobin;WU Yuanyuan(School of Computer Science,Southwest Petroleum University,Chengdu 610500,China)
机构地区:[1]西南石油大学计算机科学学院,成都610500
出 处:《计算机工程》2019年第2期290-295,共6页Computer Engineering
基 金:国家科技重大专项(2016ZX05020-006)
摘 要:决策树算法用于井漏分类时,由于井漏数据离散化后多值属性占比较大,且具有多值偏向的缺点,分类效果不理想。为此,提出一种基于改进ID3的AFIV-ID3算法。在ID3的基础上引入属性重要度计算新的信息熵,属性重要度大小由决策者依靠先验或领域知识决定。在信息增益计算中加入关联度函数比,对信息增益值做出修正。AFIV-ID3算法克服了ID3多值偏向的缺点,提高了数据中重要属性的权重,从而提升井漏类型分类精度。4组UCI数据集和真实井漏数据测试结果表明,该算法的分类精度优于ID3和C4. 5算法,并能够将人工经验法不稳定的分类精度提高至约72. 23%。When the decision tree algorithm is used in well leakage classification,the classification effect is not satisfactory because of the large proportion of multi-valued attributes after the well leakage data is discretized,and because the algorithm has the shortcoming of multi-value bias.Therefore,an improved AFIV-ID3 algorithm based on ID3 is proposed.On the basis of ID3,attribute importance is introduced to calculate new information entropy.Attribute importance is determined by the decision maker depending on prior knowledge or domain knowledge.The association function ratio is added to the information gain calculation to modify the information gain value.The AFIV-ID3 algorithm overcomes the shortcoming of ID3 multi-value bias,improves the weight of important attributes in the data,and effectively improves the classification accuracy of well leakage type.The test results of four UCI data sets and real well leakage data show that the classification accuracy of this algorithm is better than that of ID3 and C4.5 algorithm,and the unstable classification accuracy of artificial experience method can be improved to about 72.23 %.
关 键 词:井漏类型 ID3算法 关联度函数比 属性重要度 多值偏向
分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222