连续型数据的辨识矩阵属性约简方法  被引量:2

A discernibility matrix-based attribute reduction for continuous data

在线阅读下载全文

作  者:冯丹[1,2] 黄洋[2] 石云鹏[2] 王长忠[2] 

机构地区:[1]国网葫芦岛供电公司信息通信分公司,辽宁葫芦岛125000 [2]渤海大学数理学院,辽宁锦州121000

出  处:《智能系统学报》2017年第3期371-376,共6页CAAI Transactions on Intelligent Systems

基  金:国家自然科学基金项目(61572082;61673396;61473111;61363056);辽宁省教育厅项目(LZ2016003);辽宁省自然科学基金项目(2014020142);辽宁省高校创新团队计划项目(LT2014024)

摘  要:属性约简是粗糙集理论在数据处理方面的重要应用,已有的针对连续型数据的属性约简算法主要集中在基于正域的贪心算法,该方法只考虑了一致样本和其他样本的可辨识性,而忽略了边界样本点间可区分性。为了克服基于正域算法的缺点,提出了连续型数据的辨识矩阵属性约简模型,该模型不但考虑了正域样本的一致性,同时考虑了边界样本的可分性。基于该模型,分析了属性约简结构,定义了辨识矩阵来刻画特征子集的分类能力,构造了实值型数据的属性约简启发式算法,并利用UCI标准数据集进行了验证。理论分析和实验结果表明,提出的算法能够有效地处理连续型数据,提高了数据的分类精度。In data processing,attribute reduction is an important application of rough set theory. The existing methods for continuous data mainly concentrate on the greedy algorithms based on the positive region. These methods take account of only the identifiability between consistent samples and other samples while ignoring distinguishability among the boundary samples. To overcome the disadvantage based on the positive domain algorithm,this paper proposed a new method for attribute reduction using a discernibility matrix. The model considers not only the consistency of samples in the positive region but also the reparability of boundary samples. On this basis,this paper analyzes the structure of attribute reduction and defines a discernibility matrix to characterize the discernibility ability of a subset of attributes. Next,an attribute reduction algorithm was designed based on the discernibility matrix. The validity of the proposed algorithm was verified using UCI standard data sets and theoretical analysis.

关 键 词:邻域关系 粗糙集 属性约简 辨识矩阵 启发式算法 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象