检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《计算机工程与科学》2013年第10期58-64,共7页Computer Engineering & Science
基 金:国家科技部支撑计划课题基金(2012BAH04F01);科技创新平台(PXM2013_014212_000011)
摘 要:小文件作为信息传输、存储的重要方式,使用相当广泛,用户对其可靠性和速度的要求也在不断提高。针对目前小文件存储效率较低的问题,首先结合分布式存储系统HDFS的大文件存储优势和Redis缓存技术,提出快速合并小文件的存储方案。把小文件合并为Sequence File存储到HDFS上,采用多元线性回归分析确定负载系数进行负载均衡调节,并在获取文件时使用缓存保证效率。在实验上,搭建相应的文件平台,分别对上传、获取、删除以及内存占用和传统直接上传的方式进行对比分析。可以看出,与传统的直接上传文件到HDFS的方式相比,经过改进的小文件处理方式可以在保证文件可靠性的同时,更快速地处理小文件。As an important way of information transmission and storage,small file has been widely used in many fields.Meanwhile,its reliability and speed requirements need to be improved.For the inefficiency of small file storage,combining the advantage of big file storage of distributed storage system HDFS and the Redis cache technology,we propose a fast small file merging scheme.Small files are merged to Sequence File,which is then stored in HDFS.Loads are balanced by load coefficients that are determined by multiple linear regression analysis,and the efficiency of file access is guaranteed by cache.In experiments,the corresponding file platform is constructed to analyze and compare upload,access,delete,and memory footprint with the traditional direct upload.We can see that,compared with the traditional way of uploading files to HDFS,the improved small files treatment can ensure the reliability of files and enables users operations on small files faster.
分 类 号:TP393[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.149.27.125