ACM Transactions on Modeling and Performance Evaluation of Computing Systems, volume 5, issue 3, pages 1-27

Toward Efficient Block Replication Management in Distributed Storage

Jianwei Liao ¹

Zhibing Sha ¹

Zhigang Cai ¹

Zhiming Liu ¹

Kenli Li ²

Wei-keng Liao ³

Alok N. Choudhary ³

Yutaka Ishiakwa ⁴

Hide authors affiliations

Southwest University, Beibei, Chongqing, China |

Hunan University, Changsha, Hunan, China |

Northwestern University, Chicago, USA |

⁴

RIKEN, Chuo, Kobe, Hyogo, Japan |

Publication type: Journal Article

Publication date: 2020-09-30

Association for Computing Machinery (ACM)

Journal: ACM Transactions on Modeling and Performance Evaluation of Computing Systems

scimago Q2

SJR: 0.525

CiteScore: 2.1

Impact factor: 0.7

ISSN: 23763639, 23763647

DOI: 10.1145/3412450

Copy DOI

Computer Science (miscellaneous)

Hardware and Architecture

Information Systems

Computer Networks and Communications

Software

Safety, Risk, Reliability and Quality

Media Technology

Abstract

Distributed/parallel file systems commonly suffer from load imbalance and resource contention due to the bursty characteristic exhibited in scientific applications. This article presents an adaptive scheme supporting dynamic block data replication and an efficient replica placement policy to improve the I/O performance of a distributed file system. Our goal is not only to yield a balanced data replication among storage servers but also a high degree of data access parallelism for the applications. We first present mathematical cost models to formulate the cost of data block replication by considering both the overhead and reduced data access time to the replicated data. To verify the validity and feasibility of the proposed cost model, we implement our proposal in a prototype distributed file system and evaluate it using a set of representative database-relevant application benchmarks. Our results demonstrate that the proposed approach can boost the usage efficiency of the data replicas with acceptable overhead of data replication management. Consequently, the overall data throughput of storage system can be noticeably improved. In summary, the proposed replication management scheme works well, especially for the database-relevant applications that exhibit an uneven access frequency and pattern to different parts of files.

Found 24

By date By citations

Institute of Electrical and Electronics Engineers (IEEE)

PRTuner: Proactive-Reactive Re-Replication Tuning in HDFS-based Cloud Data Center

Shwe T., Aritsugi M.

IEEE Cloud Computing ,

2018-11-29, citations by CoLab: 4 , Abstract

A new Prefetching-aware Data Replication to decrease access latency in cloud environment

Mansouri N., Javidi M.M.

Journal of Systems and Software scimago Q1 wos Q1 ,

2018-10-01, citations by CoLab: 41 , Abstract

A Cost-Effective Distribution-Aware Data Replication Scheme for Parallel I/O Systems

He S., Sun X.

IEEE Transactions on Computers scimago Q1 wos Q2 ,

2018-10-01, citations by CoLab: 11 , Abstract

A Genetic Algorithm Based Data Replica Placement Strategy for Scientific Applications in Clouds

Cui L., Zhang J., Yue L., Shi Y., Li H., Yuan D.

IEEE Transactions on Services Computing scimago Q1 wos Q1 ,

2018-07-01, citations by CoLab: 63 , Abstract

Adaptive Process Migrations in Coupled Applications for Exchanging Data in Local File Cache

Liao J., Cai Z., Trahay F., Zhou J., Xiao G.

ACM Transactions on Autonomous and Adaptive Systems scimago Q2 wos Q2 ,

2018-06-30, citations by CoLab: 1 , Abstract

Block Placement in Distributed File Systems Based on Block Access Frequency

Liao J., Cai Z., Trahay F., Peng X.

IEEE Access scimago Q1 wos Q2 Open Access

2018-06-29, citations by CoLab: 10 , Abstract

A Decentralized Replica Placement Algorithm for Edge Computing

Aral A., Ovatman T.

IEEE Transactions on Network and Service Management scimago Q1 wos Q1 ,

2018-06-01, citations by CoLab: 99 , Abstract

Achieving Load Balance for Parallel Data Access on Distributed File Systems

Huang D., Han D., Wang J., Yin J., Chen X., Zhang X., Zhou J., Ye M.

IEEE Transactions on Computers scimago Q1 wos Q2 ,

2018-03-01, citations by CoLab: 27 , Abstract

The distributed file system, HDFS, is widely deployed as the bedrock for many parallel big data analysis. However, when running multiple parallel applications over the shared file system, the data requests from different processes/executors will unfortunately be served in a surprisingly imbalanced fashion on the distributed storage servers. These imbalanced access patterns among storage nodes are caused because a). unlike conventional parallel file system using striping policies to evenly distribute data among storage nodes, data-intensive file system such as HDFS store each data unit, referred to as chunk file, with several copies based on a relative random policy, which can result in an uneven data distribution among storage nodes; b). based on the data retrieval policy in HDFS, the more data a storage node contains, the higher probability the storage node could be selected to serve the data. Therefore, on the nodes serving multiple chunk files, the data requests from different processes/executors will compete for shared resources such as hard disk head and networkbandwidth, resulting in a degraded I/O performance. In this paper, we first conduct a complete analysis on how remote and imbalanced read/write patterns occur and how they are affected by the size of the cluster. We then propose novel methods, referred to as Opass, to optimize parallel data reads, as well as to reduce the imbalance of parallel writes on distributed file systems. Our proposed methods can benefit parallel data-intensive analysis with various parallel data access strategies. Opass adopts new matching-based algorithms to match processes to data so as to compute the maximum degree of data locality and balanced data access. Furthermore, to reduce the imbalance of parallel writes, Opass employs a heatmap for monitoring the I/O statuses of storage nodes and performs HM-LRU policy to select a local optimal storage node for serving write requests. Experiments are conducted on PRObE's Marmot 128-node cluster testbed and the results from both benchmark and well-known parallel applications show the performance benefits and scalability of Opass.

Migration-Aware Genetic Optimization for MapReduce Scheduling and Replica Placement in Hadoop

Guerrero C., Lera I., Juiz C.

Journal of Grid Computing scimago Q1 wos Q1 ,

2018-02-14, citations by CoLab: 26 , Abstract

REHDFS: A random read/write enhanced HDFS

Rao Chandakanna V.

Journal of Network and Computer Applications scimago Q1 wos Q1 ,

2018-02-01, citations by CoLab: 11 , Abstract

Deister: A light-weight autonomous block management in data-intensive file systems using deterministic declustering distribution

Wang J., Zhang X., Zhang J., Yin J., Han D., Wang R., Huang D.

Journal of Parallel and Distributed Computing scimago Q1 wos Q1 ,

2017-10-01, citations by CoLab: 5 , Abstract

Redundancy Does Not Imply Fault Tolerance

Ganesan A., Alagappan R., Arpaci-Dusseau A.C., Arpaci-Dusseau R.H.

ACM Transactions on Storage scimago Q2 wos Q2 ,

2017-08-31, citations by CoLab: 19 , Abstract

EAFR: An Energy-Efficient Adaptive File Replication System in Data-Intensive Clusters

Lin Y., Shen H.

IEEE Transactions on Parallel and Distributed Systems scimago Q1 wos Q1 ,

2017-04-01, citations by CoLab: 15 , Abstract

In data intensive clusters, a large amount of files are stored, processed and transferred simultaneously. To increase the data availability, some file systems create and store three replicas for each file in randomly selected servers across different racks. However, they neglect the file heterogeneity and server heterogeneity, which can be leveraged to further enhance data availability and file system efficiency. As files have heterogeneous popularities, a rigid number of three replicas may not provide immediate response to an excessive number of read requests to hot files, and waste resources (including energy) for replicas of cold files that have few read requests. Also, servers are heterogeneous in network bandwidth, hardware configuration and capacity (i.e., the maximal number of service requests that can be supported simultaneously), it is crucial to select replica servers to ensure low replication delay and request response delay. In this paper, we propose an Energy-Efficient Adaptive File Replication System (EAFR), which incorporates three components. It is adaptive to time-varying file popularities to achieve a good tradeoff between data availability and efficiency. Higher popularity of a file leads to more replicas and vice versa. Also, to achieve energy efficiency, servers are classified into hot servers and cold servers with different energy consumption, and cold files are stored in cold servers. EAFR then selects a server with sufficient capacity (including network bandwidth and capacity) to hold a replica. To further improve the performance of EAFR, we propose a dynamic transmission rate adjustment strategy to prevent potential incast congestion when replicating a file to a server, a networkaware data node selection strategy to reduce file read latency, and a load-aware replica maintenance strategy to quickly create file replicas under replica node failures. Experimental results on a real-world cluster show the effectiveness of EAFR and proposed strategies in reducing file read latency, replication time, and power consumption in large clusters.

Different aspects of workflow scheduling in large-scale distributed systems

Stavrinides G.L., Duro F.R., Karatza H.D., Blas J.G., Carretero J.

Simulation Modelling Practice and Theory scimago Q1 wos Q1 ,

2017-01-01, citations by CoLab: 28 , Abstract

E2FS: an elastic storage system for cloud computing

Chen L., Qiu M., Song J., Xiong Z., Hassan H.

Journal of Supercomputing scimago Q2 wos Q2 ,

2016-08-27, citations by CoLab: 20 , Abstract

Top-30

Journals

	1
Journal of Intelligent and Fuzzy Systems	Journal of Intelligent and Fuzzy Systems, 1, 50% Journal of Intelligent and Fuzzy Systems 1 publication, 50%
	1

Publishers

	1
IOS Press	IOS Press, 1, 50% IOS Press 1 publication, 50%
	1

We do not take into account publications without a DOI.
Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Cite this

GOST | RIS | BibTex | MLA

Found error?

Publisher

Association for Computing Machinery (ACM)

Journal

ACM Transactions on Modeling and Performance Evaluation of Computing Systems

scimago Q2

SJR

0.525

CiteScore

2.1

Impact factor

0.7

ISSN

23763639 (Print)

23763647 (Electronic)