检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《山东大学学报(工学版)》2012年第2期18-22,44,共6页Journal of Shandong University(Engineering Science)
基 金:国家自然科学基金资助项目(61170224);山东省自然科学基金资助项目(2009ZRB019CE)
摘 要:近邻传播聚类算法(affinity propagation,AP)受偏向参数影响较大,很难确定最优聚类所需的参数。设计了两阶段近邻传播半监督聚类算法(two-stage semi-supervised clustering algorithm based on affinity propagation,2SAP),在整个数据集上运行半监督近邻传播算法(semi-supervised clustering based on affinity propagation,SAP),得出类代表点集合,在类代表点集合上运行SAP算法得出结果。在实际数据集上进行实验,结果证实:与算法SAP和并行近邻传播半监督聚类算法(parallel computation of semi-supervised clustering algorithm based on affinity propagation,PSAP)相比,2SAP算法的CRI和FCRI值较高,而相应的离散系数较小,说明2SAP受偏向参数的影响较小。The affinity propagation clustering algorithm(AP) is sensitive to the preference value, and it is difficult to find the optimal preference value. 2SAP, a two-stage semi-supervised clustering algorithm based on AP, was proposed to overcome this limitation. Semi-supervised clustering based on affinity propagation (SAP) was used to cluster the whole dataset and obtain the exemplar set, and then the SAP was used again to cluster the exemplar set to find the final clusters. Experimental results on real data sets showed that the 2SAP was better than SAP and PSAP in terms of CRI and FCRI, and the lower coefficients of dispersion illustrated that 2SAP was less sensitive to the preference value.
关 键 词:近邻传播 偏向参数 半监督聚类 先验信息 成对约束
分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.30