检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李韬睿 徐超 胡龙舟 朱彤 白海 LI Taorui;XU Chao;HU Longzhou;ZHU Tong;BAI Hai(State Grid Hubei Electric Power Co.,Ltd.Hubei EHV Transmission&Substation Company,Wuhan 430050,China)
机构地区:[1]国网湖北省电力有限公司超高压公司,湖北武汉430050
出 处:《微型电脑应用》2022年第10期90-93,共4页Microcomputer Applications
摘 要:面对海量信息的有效存储,为了保证存储信息的抽取和查询的效率,研究基于云计算技术的海量信息分布式的存储方法。采用GFS作为分布式文件系统和HDFS管理节点/存储节点架构作为分布式存储技术的依据,形成极大存储容量的计算机群,对信息实行并行处理;生成事实表,分析和处理不同维度和粒度的情况下的信息后,对其实行数据聚集;采用基于云计算技术改进ETL处理算法实行海量信息抽取,存储在数据库中,用户即可根据需求实行数据库信息查询。实验结果表明,该方法的存储性能较好,物理节点的增加会提高信息的插入效率,并且抽取后的信息信噪比较高,信息查询速度较快。In the face of the effective storage of massive information,in order to ensure the efficiency of the extraction and query of stored information,the distributed storage method of massive information based on cloud computing technology is studied.Using GFS as a distributed file system and HDFS management node/storage node architecture as the basis of distributed storage technology,a computer group is formed with a large storage capacity and implementing parallel processing of information.And the fact table is generated to analyze and process the information in different dimensions and granularity,and implement the data aggregation.The ETL processing algorithm based on cloud computing technology is improved to extract massive information and store it in the database,so that users can query the database information according to their needs.The experimental results show that the storage performance of this method is good,the increase of physical nodes will improve the insertion efficiency of information,and the SNR of the extracted information is high,and the information query speed is fast.
关 键 词:云计算技术 海量信息 分布式存储 数据聚集 信息查询
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.13