机构地区:[1]School of Computer Science and Technology, National University of Defence Technology
出 处:《Science China(Information Sciences)》2015年第6期167-177,共11页中国科学(信息科学)(英文版)
基 金:supported by the National Natural Science Foundation of China(Grant Nos.61379146,61272483,and 61272056);Fund of National University of Defence Technology(Grant No.13-06-03)
摘 要:Erasure codes are widely used in storage systems for providing data reliability due to the advantage of high storage efficiency, while the access efficiency becomes the main shortcoming because of the extra data retrieve and decoding for accessing unavailable data. Most existing work designs erasure codes based on the ideal failure pattern where all storage nodes fail at the same rate. However in practice, the physical storage nodes fail at different rates due to the heterogeneous hardware, topologies and application behaviors. In this paper, we consider the heterogeneous failure pattern and analyze how the failure pattern impacts the overall access efficiency and reliability of erasure-coded storage systems. We propose HeMatch, a redundancy layout placement scheme in practical heterogeneous failure pattern for erasure-coded storage access efficiency. Specifically, we first study how the heterogeneous failure pattern impacts the access efficiency and propose a general model based on the Tanner graph to evaluate and predict the access efficiency in specific failure pattern and redundancy layout. Then, we propose the redundancy layout placement scheme, which matches the redundancy layout with the physical storage nodes in practical heterogeneous failure patterns based on the evaluation and prediction from our model. The experimental results demonstrate that the model we propose accurately evaluates the access efficiency, and HeMatch saves unavailable data access cost by up to 20% and improves the system reliability as well.Erasure codes are widely used in storage systems for providing data reliability due to the advantage of high storage efficiency, while the access efficiency becomes the main shortcoming because of the extra data retrieve and decoding for accessing unavailable data. Most existing work designs erasure codes based on the ideal failure pattern where all storage nodes fail at the same rate. However in practice, the physical storage nodes fail at different rates due to the heterogeneous hardware, topologies and application behaviors. In this paper, we consider the heterogeneous failure pattern and analyze how the failure pattern impacts the overall access efficiency and reliability of erasure-coded storage systems. We propose HeMatch, a redundancy layout placement scheme in practical heterogeneous failure pattern for erasure-coded storage access efficiency. Specifically, we first study how the heterogeneous failure pattern impacts the access efficiency and propose a general model based on the Tanner graph to evaluate and predict the access efficiency in specific failure pattern and redundancy layout. Then, we propose the redundancy layout placement scheme, which matches the redundancy layout with the physical storage nodes in practical heterogeneous failure patterns based on the evaluation and prediction from our model. The experimental results demonstrate that the model we propose accurately evaluates the access efficiency, and HeMatch saves unavailable data access cost by up to 20% and improves the system reliability as well.
关 键 词:network storage erasure-coded storage access efficiency redundancy placement heterogeneous failure pattern
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...