基于Hadoop的时态信息存储与时态关系演算问题研究  被引量:1

Research on problem of temporal information storage and temporal relational calculus based on Hadoop

在线阅读下载全文

作  者:左亚尧[1] 封朝永 陈磊[1] 

机构地区:[1]广东工业大学计算机学院,广州510006

出  处:《计算机应用研究》2014年第5期1390-1395,共6页Application Research of Computers

基  金:国家自然科学基金资助项目(60970044;61272067;60736020);广东省自然科学基金资助项目(S2011040004281;S2013010014457)

摘  要:面对海量的非结构化时态信息,构建了在分布式环境下的数据存储模型,并在此基础上提出一种基本的时态数据处理方法。使用Hadoop平台下的分布式、非结构化数据库HBase对海量时态数据进行存储,构造以时态集合为时态存储单元的时态数据存储模型;针对分布式处理特征和时态集合数据类型,提出一种在Map/Reduce编程计算模式下进行海量时态信息关系演算的实现方法;通过扩展时态区间关系运算,实现以时态集合为基本时态数据操作对象的交、并等关系运算。以医疗时态数据作为研究实例,表明了所提出的时态数据存储模型和关系演算方案在分布式应用系统下的适用性。When facing the large amounts of unstructured temporal information, this paper established a data storage model under the distributed environment, and put forward a basic method about temporal data processing. It used the distributed and unstructured databases HBase which was under the Hadoop platform to store temporal data, then built the temporal storage data model by temporal storage unit which was based on temporal set. And for the characteristics of distributed processing and data types of temporal set, it proposed an implementation method about the relational calculus of massive temporal information in the model of Map/Reduce. Through extending relation calculation of temporal interval, it achieved relational calculus such as intersect operation, union operation which using temporal set as basement temporal data processing object. It gave the research example of medical temporal data show the applicability of the proposed data storage model and relational calculus scheme un- der the distributed application system.

关 键 词:时态信息 HADOOP 数据存储模型 时态关系演算 医疗时态数据 

分 类 号:TP301[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象