检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:周治平[1] 张道文[1] 王杰锋[1] 孙子文[1]
出 处:《南京大学学报(自然科学版)》2015年第4期741-748,共8页Journal of Nanjing University(Natural Science)
基 金:江苏省产学研联合创新资金―前瞻性联合研究项目(BY2013015-33);江苏省自然科学基金(BK20131107)
摘 要:近邻传播算法是一种快速有效的聚类方法.针对近邻传播算法在无先验知识条件下偏向参数选择的问题,使用Silhouette聚类有效性指标确定偏向参数.针对近邻传播算法在处理结构复杂或高维数据时,存在数据信息重叠的问题,提出将局部保持投影方法与近邻传播算法相结合的方法,在有效保留数据内部非线性结构的前提下,有效删除数据空间中的冗余信息.仿真结果验证了提出的算法优于传统的近邻传播算法.Affinity propagation(AP)algorithm is a fast and effective clustering method.Compared with other traditional clustering algorithms,the AP algorithm treats each data point as the candidate of the representative point to avoid the clustering results limiting in the choice of initial representative point.At the same time,the algorithm does not need the symmetry of the similarity matrix generated in the dataset with high operation speed in dealing with large-scale multi class data.Hence,AP algorithm can effectively solve the problem of non Euclidean space and large sparse matrix calculation.Due to the great advantage of the AP algorithm in clustering,it is widely applied in pattern recognition,web mining,biomedical and multi target detection,and is becoming a necessary method of data analysis.In order to well determine bias parameter of AP algorithm without prior knowledge,a novel method called silhouette clustering validity index is utilized to determine the parameter in this paper.The problem of information overlap is the main drawback of AP algorithm in dealing with complex structure or high dimensional data for clustering.In order to resolve the above problem,we propose an approaching algorithm which combines the locality preserving projections(LPP)method and the AP algorithm.It deletes the redundant information in the data space under the condition of effectively keeping the data inner nonlinear structure.The experiment results verify its accuracy and effectiveness and shows that the performance of the proposed algorithm is better than the traditional AP algorithm.
关 键 词:近邻传播算法 局部保持投影 Silhouette指标 邻域选择 流形距离
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.13