检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:姜文煊 段友祥[1] 孙歧峰[1] JIANG Wenxuan;DUAN Youxiang;SUN Qifeng(School of Computer Science and Technology,China University of Petroleum(East China),Qingdao 266580,Shandong,China)
机构地区:[1]中国石油大学(华东)计算机科学与技术学院,山东青岛266580
出 处:《应用科学学报》2021年第4期545-558,共14页Journal of Applied Sciences
基 金:国家科技重大专项基金(No.2017ZX05009-001,No.2016ZX05011-002);中央高校基本科研业务费项目基金(No.18CX02020A)资助。
摘 要:针对传统的特征选择算法只专注于特征间的相关性和冗余性而没有考虑特征之间交互作用的问题,提出一种基于交互信息的混合特征选择(hybrid feature selection based on mutual information,MIHFS)算法,该算法以K-最近邻算法的分类准确率作为衡量所选特征分类性能的评价指标,有效地去除了冗余和不相关的特征,保留了具有交互作用的特征。为了评估该算法的性能,从分类准确率、所选特征数量以及算法稳定性三方面,与最大相关最小冗余、联合互信息等7种特征选择算法在8个数据集上进行了实验比较和分析。实验结果表明:MIHFS算法具有较强的稳定性,不仅有效降低了特征空间的维数,而且在所选特征的分类性能方面明显优于其他特征选择算法。最后将MIHFS算法与灰色关联分析法-逼近理想解的排序技术法相结合并应用到高邮凹陷永安地区戴一段地质评价中,其评价结果准确率为80%,与实际钻探结果基本吻合,具有较高的可靠性,能够有效指导油气地质评价。Traditional feature selection algorithms only focus on feature correlation and feature redundancy without considering the interaction between features.This paper proposes a hybrid feature selection based on mutual information(MIHFS)algorithm.The algorithm takes the classification accuracy of K-nearest neighbor(KNN)algorithm as evaluation index to evaluate the classification performance of selected features,effectively removes redundant and irrelevant features,and retains the interactive features.In order to evaluate the performance of the proposed algorithm,the classification accuracy,the number of selected features and the stability of the algorithm are compared with seven other feature selection algorithms such as minimal redundancy maximal relevance(mRMR)and joint mutual information(JMI)in eight datasets.Experimental results show that the MIHFS algorithm has strong stability,which not only effectively reduces the dimension of feature space,but also has better classification performance than other feature selection algorithms.Finally,in combination with grey relation analysis(GRA)method-technique for order preference by similarity to ideal solution(TOPSIS)method,MIHFS algorithm is applied to the geological evaluation of the first member of Dainan Formation at Yong’an Area,Gaoyou Sag.Experimental results show that MIHFS algorithm performs an evaluation accuracy of 80%with high reliability,and this is basically consistent with actual drilling results and proves the effectiveness of MIHFS in oil and gas geological evaluation.
关 键 词:特征选择 交互信息 混合特征选择 K-最近邻 灰色关联分析法 逼近理想解的排序技术
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.219.90.165