一种基于似然极大的动态聚类方法及其应用被引量：2

A Maximum Likelihood-Based Dynamic Clustering Method and Its Application

机构地区：[1]扬州大学江苏省作物遗传生理重点实验室,江苏扬州225009

出　　处：《作物学报》2007年第1期70-76,共7页Acta Agronomica Sinica

基　　金：国家自然科学基金项目(30270724;30370758);教育部新世纪优秀人才支持计划项目

摘　　要：将传统的动态聚类分析和判别分析相结合,引出一种基于似然极大的动态聚类方法,该方法以EM算法实现的极大似然估计进行类参数估计,以相应的贝叶斯后验概率判别个体的归类。模拟研究表明,该方法通常既可无偏估计类参数,又可判别最佳分类个数。与重心法动态聚类和最小组内平方和法动态聚类相比,稳健性较高。同时通过提高判别标准,可以降低误判率。用Fisher的Iris试验数据验证了方法的可行性,并将之成功应用于一个水稻F2群体的个体的主基因基因型鉴别。Clustering analysis is to determine the intrinsic grouping in a set of unlabeled data. A cluster is a collection of objects which are similar between them and are dissimilar to the objects belonging to other clusters. However, the current clustering techniques have not addressed all the requirements adequately. For instance, dealing with large number of dimensions and large number of data can be problematic because of time complexity. The effectiveness of the distance-based clustering methods depends on the definition of distance ; if an obvious distance measure doesn＇ t exist we must define it, which is not always easy, especially in multi-dimensional spaces. In addition, the choice of the optimal number of clusters in practice is impossible. Thus, choosing the correct number of clusters and the best clustering method is still a question open to discussion, in order to solve these problems, in this paper, we introduced a maximum likelihood-based dynamic clustering method, which combined the conventional dynamic clustering and discrimination analysis. The parameters of different clusters were estimated by the maximum likelihood method implemented via expectation-maximization （EM） algorithm and the objects were classified by the Bayesian posterior probability. This classified idea could increase the posterior confidence of classified individuals. The results of simulation studies showed that the proposed method not only unbiasedly estimated the corresponding cluster parameters but also differentiated the optimum clustering numbers by Bayesian information criterion （BIC）. Compared with the K-means method and the minimum square sum within groups （MinSSw） method, the proposed method was more robustness and had almost the same clustering accuracy as K-means and MinSSw methods. Moreover, the miselassified rate （MR） could be reduced by enhancing the discrimination criterion. However, the unclassified rate （UR） would be increased by enhancing the discrimination criterion. Thus, an eclectic discrimin

关键词：聚类分析后验概率贝叶斯信息准则判别分析

分类号：S11[农业科学—农业基础科学]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种基于似然极大的动态聚类方法及其应用被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种基于似然极大的动态聚类方法及其应用 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

一种基于似然极大的动态聚类方法及其应用被引量：2