重要度集成的属性约简方法研究  被引量:7

Research on ensemble significance based attribute reduction approach

在线阅读下载全文

作  者:李京政 杨习贝[1,2] 窦慧莉[1] 王平心[3] 陈向坚[1] LI Jingzheng;YANG Xibei;DOU Huili;WANG Pingxin;CHEN Xiangjian(School of Computer,Jiangsu University of Science and Technology,Zhenjiang 212003,China;School of Economics and Management,Nanjing University of Science and Technology,Nanjing 210094,China;School of Mathematics and Physics,Jiangsu University of Science and Technology,Zhenjiang 212003,China)

机构地区:[1]江苏科技大学计算机学院,江苏镇江212003 [2]南京理工大学经济管理学院,江苏南京210094 [3]江苏科技大学数理学院,江苏镇江212003

出  处:《智能系统学报》2018年第3期414-421,共8页CAAI Transactions on Intelligent Systems

基  金:国家自然科学基金项目(61572242;61503160;61502211);江苏省高校哲学社会科学基金项目(2015SJD769);中国博士后科学基金项目(2014M550293)

摘  要:启发式算法在求解约简的过程中逐步加入重要度最高的属性,但其忽视了数据扰动将会直接引起重要度计算的波动问题,从而造成约简结果的不稳定。鉴于此,提出了一种基于集成属性重要度的启发式算法框架。首先,在原始数据上进行多重采样;然后,在每次循环过程中分别计算各个采样结果上的属性重要度并对这些重要度进行集成;最后,将集成重要度最大的属性加入到约简中去。利用邻域粗糙集方法进行的实验结果表明,基于集成重要度的属性约简算法不仅能够获取更加稳定的约简,而且利用所生成的约简能够得到一致性较高的分类结果。In the process of computing reduct using a heuristic algorithm,the attribute with the highest importance is gradually added in.However,this approach neglects the fluctuation of important calculations which is directly caused by data perturbation.Notably,such fluctuation may lead to an unstable reduct result.To eliminate such an anomaly,a framework consisting of a heuristic algorithm based on the importance of the ensemble attribute was proposed.In this approach,firstly,multiple sampling is executed for raw data;secondly,in each cycle,the importance of each attribute is computed on the basis of each sampling and the importance indices are integrated;finally,the attribute with the highest importance is added into the reduct.The experimental results obtained by utilizing the neighborhood rough set method show that the new approach not only obtains a more stable reduct,but also attains the classification results with high uniformity.

关 键 词:属性约简 分类 聚类 数据扰动 集成 启发式算法 邻域粗糙集 稳定性 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象