检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李懿恒 杜晨曦 杨燕燕 李翔宇 LI Yiheng;DU Chenxi;YANG Yanyan;LI Xiangyu(School of Software Engineering,Beijing Jiaotong University,Beijing 100044,China)
出 处:《计算机应用》2022年第2期475-484,共10页journal of Computer Applications
基 金:国家自然科学基金资助项目(61806108,62003028);中央高校基本科研业务费专项资金资助项目(2019RC055,2019RC045);北京市级大学生创新创业训练计划项目(202110004107)。
摘 要:针对大多数粒计算特征选择算法未考虑数据的类别不平衡性的问题,提出一种融合伪标签策略的类别不平衡数据特征选择算法。首先,为了便于研究类别不平衡数据特征选择算法,重新定义样本和数据集一致度的概念,并设计了相应特征选择的贪婪前向搜索算法;其次,引入伪标签策略以平衡数据的类别分布,并将所学样本的伪标签融入一致性测度中,以构造伪标签一致度来估计类别不平衡数据集的特征;最后,通过保持类别不平衡数据集的伪标签一致度不变,设计一种面向类别不平衡数据的基于伪标签一致性的特征选择算法(PLCFS)。实验结果表明,所提PLCFS的性能仅次于最大相关最小冗余(mRMR)算法,而优于Relief算法和基于一致性的特征选择算法(CFS)。Aiming at the problem that most algorithms of granular computing ignore the class-imbalance of data,a feature selection algorithm integrating pseudo-label strategy was proposed to deal with class-imbalanced data.Firstly,to investigate feature selection from class-imbalanced data conveniently,the sample consistency and dataset consistency were re-defined,and the corresponding greedy forward search algorithm for feature selection was designed.Then,the pseudo-label strategy was introduced to balance the class distribution of the data.By integrating the learned pseudo-label of a sample into consistency measure,the pseudo-label consistency was defined to estimate the features of the class-imbalanced dataset.Finally,an algorithm for Pseudo-Label Consistency based Feature Selection(PLCFS)for class-imbalanced data was developed based on the preservation of the pseudo-label consistency measure for the class-imbalanced dataset.Experimental results indicate that the proposed PLCFS has the performance only lower than max-Relevancy and Min-Redundancy(mRMR)algorithm,but outperforms Relief algorithm and algorithm for Consistency-based Feature Selection(CFS).
关 键 词:粒计算 伪标签 类别不平衡数据 特征选择 一致性测度
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.191.89.23