基于隶属度的模糊加权k近质心近邻算法  被引量:2

Fuzzy Weighted k-Nearest Centroid Neighbor Algorithm Based on Membership

在线阅读下载全文

作  者:刘利 张德生[1] 肖燕婷[1] LIU Li;ZHANG Desheng;XIAO Yanting(School of Sciences,Xi’an University of Technology,Xi’an 710054,China)

机构地区:[1]西安理工大学理学院,西安710054

出  处:《计算机工程》2022年第7期122-129,共8页Computer Engineering

基  金:国家自然科学基金青年科学基金项目(11801438)。

摘  要:模糊k近质心近邻算法(FKNCN)的分类结果易受噪声点和离群点影响,并且算法对所有样本特征同等对待,不能体现样本特征的差异性。针对这两个问题,提出基于隶属度的模糊加权k近质心近邻算法MRFKNCN。利用密度聚类思想构造新的隶属度函数计算训练样本的隶属度,以减小噪声或离群样本对分类结果的影响。在此基础上,设计基于冗余分析的Relief-F算法计算每个特征的权重,删去较小权重所对应的特征和冗余特征,并通过加权欧氏距离选取有代表性的k个近质心近邻,提高分类性能。最终,根据最大隶属度原则确定待分类样本的类别。利用UCI和KEEL中的多个数据集对MRFKNCN算法进行测试,并与KNN、KNCN、LMKNCN、FKNN、FKNCN2和BMFKNCN算法进行比较。实验结果表明,MRFKNCN算法的分类性能明显优于其他6个对比算法,平均准确率最高可提升4.68个百分点。The classification results of Fuzzy K-Nearest Centroid Neighbor(FKNCN)algorithm is susceptible to noise points,outliers,at the same time,the algorithm treats all sample features equally and cannot reflect the difference of sample features. To solve these two problems,fuzzy weighted k-nearest centroid neighbor algorithm(MRFKNCN) based on membership was proposed. Firstly,a new membership function is constructed by the idea of density clustering and the membership degree of training samples is calculated,which can avoid the influence of noise or outlier samples on the classification results. Then,the weight of each feature was calculated by the Relief-F algorithm of redundancy analysis,the features and redundant features corresponding to smaller weights were deleted,and k representative nearest centroid neighbors were selected by weighted Euclidean distance to improve the performance of classification. Finally,the classification of samples to be classified is determined by the maximum membership principle. The MRFKNCN algorithm is tested using multiple datasets in UCI and KEEL,and compared with KNN,KNCN,LMKNCN,FKNN,FKNCN2 and BMFKNCN.The experimental results show that the classification performance of MRFKNCN algorithm is significantly better than the other six comparison algorithms,the average accuracy can be improved by up to 4.68 percentage points.

关 键 词:k近质心近邻算法 隶属度 冗余分析 特征选择 数据分类 

分 类 号:TP301[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象