基于云计算的海量数据挖掘研究被引量：97

Study of Massive Data Mining Based on Cloud Computing

机构地区：[1]西安邮电大学管理工程学院,陕西西安710061 [2]西安邮电大学自动化学院,陕西西安710061

出　　处：《计算机技术与发展》2013年第2期69-72,共4页Computer Technology and Development

基　　金：国家自然科学基金资助项目(61100165/F020508);陕西省自然科学基金(2007F18)

摘　　要：为了实现高效率低成本的海量数据挖掘,为企业决策提供参考,提出了基于云计算的海量数据挖掘模型。该模型中海量数据的处理和存储都是在云计算环境中进行的,首先对海量的数据进行一定的预处理,形成结构一致的数据后,应用云计算平台上的MapReduce模型进行高效的并行数据处理,最后得到所需的数据挖掘结果。基于云计算的海量数据挖掘的效率明显高于传统的数据挖掘,并且数据挖掘结果的准确性有了一定的提高,而且随着数据量的增多,该模型的优势会愈发明显。In order to achieve high efficiency and low cost of massive data mining, and provide decision references for enterprise, the mod- el of massive data mining based on cloud computing has been proposed. The massive data：s processing and storage of the model were car- ried on the cloud computing environment. Firstly, take some certain preprocessing for the massive data to form data with the same struc- ture. Then, use the MapReduce model on the cloud computing platform to parallelly process the data efficiently. Finally, get the needed re- sult of data mining. The efficiency of massive data mining based on cloudcomputing is clearly higher than traditional data mining. Mean- while, the accuracy of data mining will be improved. Along with the increase of data, the advantage of the model will increasingly obvi- ous.

关键词：云计算数据挖掘海量数据 MAPREDUCE 数据预处理

分类号：TP31[自动化与计算机技术—计算机软件与理论]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于云计算的海量数据挖掘研究被引量：97

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于云计算的海量数据挖掘研究 被引量：97

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于云计算的海量数据挖掘研究被引量：97