基于犹豫模糊集的凝聚式层次聚类算法  被引量:1

Agglomerative hierarchical clustering algorithm based on hesitant fuzzy set

在线阅读下载全文

作  者:李文全 毛伊敏 彭新东 LI Wenquan;MAO Yimin;PENG Xindong(School of Information Engineering,Shaoguan University,Shaoguan Guangdong 512005,China)

机构地区:[1]韶关学院信息工程学院,广东韶关512005

出  处:《计算机应用》2023年第12期3755-3763,共9页journal of Computer Applications

基  金:国家自然科学基金资助项目(62006155);广东省教育厅科研项目(2022ZDJS048);广东省普通高校特色创新类项目(2023KTSCX137)。

摘  要:针对犹豫模糊聚类分析存在信息失真、属性权重客观性差、时间复杂度高的问题,提出一种基于犹豫模糊集的凝聚式层次聚类算法(AHCHF)。首先,采用犹豫模糊元的平均值扩充犹豫度小的数据对象;其次,利用原始信息熵和内部最大差异计算数据对象扩充前后的权重,并根据两个权重向量之间的最小鉴别信息确定属性的综合权重;最后,以加权距离和更小为目标,给出犹豫度恒定的中心点构造方法。在具体实例和人造数据集上进行的实验结果表明,相较于经典的犹豫模糊层次聚类算法(HFHC)和较新的模糊层次聚类算法(FHCA),AHCHF的轮廓系数(SC)均值分别提高了23.99%和9.28%,运行时间分别平均减少了27.18%和6.40%。以上结果验证了所提算法可以有效解决信息失真、属性权重客观性差的问题,并较好地提升聚类效果和聚类性能。Aiming at the problems of information distortion,poor objectivity of attribute weights,and high time complexity in hesitant fuzzy clustering analysis,an Agglomerative Hierarchical Clustering algorithm based on Hesitant Fuzzy set(AHCHF)was proposed.Firstly,the average value of hesitancy fuzzy elements was used to expand the data object with small hesitation.Secondly,the weights of data object before and after expansion were calculated by using the original information entropy and internal maximum difference,and the comprehensive attribute weight was determined according to the minimum discrimination information between the two weight vectors.Finally,with the goal of making the sum of weighted distances smaller,a center point construction method with constant hesitation was given.Experimental results on specific examples and synthetic datasets show that compared with the classic Hesitant Fuzzy Hierarchical Clustering algorithm(HFHC)and the recent Fuzzy Hierarchical Clustering Algorithm(FHCA),the proposed AHCHF increases the mean Silhouette Coefficient(SC)by 23.99%and 9.28%respectively,and shortens the running time by 27.18%and 6.40%averagely and respectively,proving that the proposed algorithm can effectively solve the problems of information distortion and poor objectivity of attribute weights,and improve the clustering effect and performance well.

关 键 词:犹豫模糊集 聚类分析 犹豫度 数据挖掘 模糊熵 

分 类 号:TP391.7[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象