基于数据预处理的并行分层聚类算法被引量：4

Parallel hierarchical clustering algorithm based on preprocessed data

机构地区：[1]湖南人文科技学院,湖南娄底417000 [2]湖南大学计算机与通讯学院,长沙410082 [3]湖南工程学院,湖南湘潭411101

出　　处：《计算机应用研究》2010年第1期71-73,共3页Application Research of Computers

基　　金：国家自然科学基金资助项目(90715029);湖南省自然科学基金资助项目(07JJ6116);湖南省重点建设学科资助项目;湖南省教育厅项目(09C546)

摘　　要：分层聚类技术在图像处理、入侵检测和生物信息学等方面有着极为重要的应用,是数据挖掘领域的研究热点之一。针对目前基于SIMD模型的并行分层聚类算法处理海量数据时效果不理想的问题,提出一种基于数据预处理的自适应并行分层聚类算法,在O((λn)2/p)的时间内对n个输入数据点进行聚类。其中1≤p≤n/logn,0.1≤λ≤0.3。将提出的算法与现有文献结论进行的性能对比分析表明,本算法明显改进了现有文献的研究结果。Hierarchial clustering technology plays a very important role in image processing, intrusion detection and bioinformatics applications, which is one of the most extensively studied branch in data mining. Presently the parallel hierarchical algorithms aren＇ t very good at processing large data. To overcome this shortcoming, this paper proposed a new parallel algorithm based on preprocessed data. The proposed algorithms could cluster n objects with O（p） processors in O（（λn）2/p） time, where 1 ≤p≤n/log n,0. 1 ≤λ≤0. 3. Performance comparisons show that it is the first parallel hierarchical clustering algorithm without memory conflicts, and thus it is an improved result over the past researches.

关键词：分层聚类并行算法预处理数据

分类号：TP301[自动化与计算机技术—计算机系统结构]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于数据预处理的并行分层聚类算法被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于数据预处理的并行分层聚类算法 被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于数据预处理的并行分层聚类算法被引量：4