一种新的不确定树模式聚类算法被引量：1

A novel clustering algorithm for uncertain tree

出　　处：《计算机工程与科学》2013年第7期156-163,共8页Computer Engineering & Science

基　　金：湖南省工业支撑计划项目(2012GK2006);湖南省教育厅科学研究资助项目(12C0291);吉首大学校级科研资助项目(11JD051)

摘　　要：不确定树模式聚类是数据挖掘领域中的一个重要问题,提出了一种新的不确定树模式聚类算法,有效地解决了因数据的不确定性而导致的无法聚类的问题。为了更加准确地度量树模式之间的相似性,提出了一种语义相似度计算方法与结构相似度计算方法。设计了一个动态聚类过程,自适应获取聚类阈值,较大程度上减少了人为干扰导致聚类结果不准确的影响,使得具有相似结构的子树聚集在同一个相似分组中,不同分组之间的子树相似度达到最小化。通过模拟数据和真实环境两部分实验表明,算法有效可行,聚类结果较准确且具有较好的运行效率。Uncertain tree clustering is an important problem in data mining domain. In this paper, a new uncertain tree clustering algorithm is proposed. The algorithm effectively resolves the clustering problems for uncertain data. In order to improve accurate measurement on the similarities among trees, the method of semantic similarity and structural similarity are presented. A dynamic clustering process is designed in which self-adaptive threshold be applied so as to greatly reduce the jamming impact on the result accuracy. This process can cluster subtrees of similar structure within similar groups , minimizing the similarity of subtree groups. Both simulation and real experiments show that the algorithm is effec- tive and efficient and the clustering result is accurate.

关键词：数据挖掘有序树频繁子树相似度不确定树聚类

分类号：TP311[自动化与计算机技术—计算机软件与理论]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种新的不确定树模式聚类算法被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种新的不确定树模式聚类算法 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

一种新的不确定树模式聚类算法被引量：1