检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《计算机应用研究》2014年第5期1390-1395,共6页Application Research of Computers
基 金:国家自然科学基金资助项目(60970044;61272067;60736020);广东省自然科学基金资助项目(S2011040004281;S2013010014457)
摘 要:面对海量的非结构化时态信息,构建了在分布式环境下的数据存储模型,并在此基础上提出一种基本的时态数据处理方法。使用Hadoop平台下的分布式、非结构化数据库HBase对海量时态数据进行存储,构造以时态集合为时态存储单元的时态数据存储模型;针对分布式处理特征和时态集合数据类型,提出一种在Map/Reduce编程计算模式下进行海量时态信息关系演算的实现方法;通过扩展时态区间关系运算,实现以时态集合为基本时态数据操作对象的交、并等关系运算。以医疗时态数据作为研究实例,表明了所提出的时态数据存储模型和关系演算方案在分布式应用系统下的适用性。When facing the large amounts of unstructured temporal information, this paper established a data storage model under the distributed environment, and put forward a basic method about temporal data processing. It used the distributed and unstructured databases HBase which was under the Hadoop platform to store temporal data, then built the temporal storage data model by temporal storage unit which was based on temporal set. And for the characteristics of distributed processing and data types of temporal set, it proposed an implementation method about the relational calculus of massive temporal information in the model of Map/Reduce. Through extending relation calculation of temporal interval, it achieved relational calculus such as intersect operation, union operation which using temporal set as basement temporal data processing object. It gave the research example of medical temporal data show the applicability of the proposed data storage model and relational calculus scheme un- der the distributed application system.
关 键 词:时态信息 HADOOP 数据存储模型 时态关系演算 医疗时态数据
分 类 号:TP301[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.62