英语原文共 13 页,剩余内容已隐藏,支付完成后下载完整资料
武汉理工大学毕业设计(论文)
英文文献翻译
目 录
Tree-Structured Parallel Regeneration for Multiple Data Losses in Distributed Storage Systems Based on Erasure Codes
Tree-Structured Parallel Regeneration for Multiple Data Losses in Distributed Storage Systems Based on Erasure Codes
SUN Weidong, WANG Yijie, PEI Xiaoqiang
National Key Laboratory for Parallel and Distributed Processing, School of Computer, National University of Defense Technology Changsha 410073, China
Abstract: To reduce the time required to complete the regeneration process of erasure codes, we propose a Tree-structured Parallel Regeneration (TPR) scheme for multiple data losses in distributed storage systems. Under the scheme, two algorithms are proposed for the construction of multiple regeneration trees, namely the edge-disjoint algorithm and edge-sharing algorithm. The edge-disjoint algorithm constructs multiple independent trees, and is simple and appropriate for environments where newcomers and their providers are distributed over a large area and have few intersections. The edge-sharing algorithm constructs multiple trees that compete to utilize the bandwidth, and make a better utilization of the bandwidth, although it needs to measure the available band-width and deal with the bandwidth changes; it is therefore difficult to implement in practical systems. The parallel regeneration for multiple data losses of TPR primarily includes two optimizations: firstly, transferring the data through the bandwidth optimized-paths in a pipeline manner; secondly, executing data regeneration over multiple trees in parallel. To evaluate the proposal, we implement an event-based simulator and make a detailed comparison with some popular regeneration methods. The quantitative comparison results show that the use of TPR employing either the edge-disjoint algorithm or edge-sharing algorithm reduces the regeneration time significantly.
Key words: distributed storage system; erasure code; replication; regeneration tree
I. INTRODUCTION
Cloud service providers gather a large number of commodity nodes to build up broad-scale data centers. In such a large system, data losses brought by frequent node departures and hardware failures should be treated as a rule and not an exception, which dramatically degrades the reliability of data in the system. Therefore, replication and erasure codes are widely used as redundant techniques to ensure the data reliability.
However, regeneration is indispensable to redundant techniques as failures may damage redundancy as well and eventually harm the reliability of stored data. During the regeneration process, the system chooses some active nodes (named as providers) to provide data furan idle node (named as newcomer) to reconstruct and store the lost data.
Replication is one of the most common redundant techniques, where m (the redundant factor equals m) identical copies of data object are kept in the system. Any of the replicas can provide services to data users. Another common redundant technique is erasure codes, in which each data object is firstly divided into blocks, and then these blocks are encoded into n (the redundant factor equals m/n)blocks and stored separately in n different storage nodes, where ngt;m. Compared with replication, erasure codes require less storage space. In Refs. [I-2], it is shown that systems employing erasure codes have Mean-Time-to-Failure (MTTF) many orders of magnitude higher than that of the replication. However, when the failure occurs, in the system employing replication, the newcomer only needs to download a replica from one provider; different from replication, newcomer of the system employing erasure codes has to download from at least k providers to regenerate the lost data. Therefore, the regeneration of erasure codes consumes more bandwidth and takes longer time.
In order to reduce the bandwidth consumption of erasure codes, Demakis et al. Propose Regeneration Code, which reduces the total amount of transferred data for regeneration process by accessing more than k providers[3-4]. In the traditional star-structured regeneration, the providers transfer data directly tithe newcomer, and the regeneration time depends on the bottleneck bandwidth between newcomer and providers. Li et al. in Ref. [5]prove that by adopting random linear codes, the bottleneck link can be bypassed by constructing a regeneration tree structure instead of the traditional star structure. Their experiment results further show that tree-structured construction can greatly reduce the regeneration time compared with the star-structured regeneration.
In modern large-scale distributed systems, frequent failures make multiple nodes failed often at the same time. Moreover, the service providers prefer to perform the regeneration with lazy policy so as to reduce the management cost, which means that the regenerations triggered only when the total amount of losses reaches a given threshold. Suppose there are n storage nodes initially, and r nodes failed at a certain time. It is necessary to construct r new nodes to maintain the original redundancy. The most common scheme for constructing multiple newcomers is to regenerate multiple newcomers one by one, such as sequential Star-Structured Regeneration (referred to as and sequential Tree-Structured Regeneration (TSR). To deal with the multiple losses, Hub et al. [6] propose a Mutually Cooperative Recovery (MCR) mechanism, in which all the newcomers repair the lost data cooperatively and simultaneously. But MCR is still based on the star structure, which cannot maximize the utilization of the network bandwidth capacity, and results in slower regeneration process, lower probability of successful regeneration and lower da
剩余内容已隐藏,支付完成后下载完整资料
资料编号:[141416],资料为PDF文档或Word文档,PDF文档可免费转换为Word
课题毕业论文、开题报告、任务书、外文翻译、程序设计、图纸设计等资料可联系客服协助查找。
您可能感兴趣的文章
- 饮用水微生物群:一个全面的时空研究,以监测巴黎供水系统的水质外文翻译资料
- 步进电机控制和摩擦模型对复杂机械系统精确定位的影响外文翻译资料
- 具有温湿度控制的开式阴极PEM燃料电池性能的提升外文翻译资料
- 警报定时系统对驾驶员行为的影响:调查驾驶员信任的差异以及根据警报定时对警报的响应外文翻译资料
- 门禁系统的零知识认证解决方案外文翻译资料
- 车辆废气及室外环境中悬浮微粒中有机磷的含量—-个案研究外文翻译资料
- ZigBee协议对城市风力涡轮机的无线监控: 支持应用软件和传感器模块外文翻译资料
- ZigBee系统在医疗保健中提供位置信息和传感器数据传输的方案外文翻译资料
- 基于PLC的模糊控制器在污水处理系统中的应用外文翻译资料
- 光伏并联最大功率点跟踪系统独立应用程序外文翻译资料