volume 67 issue 3 pages 388-402

Achieving Load Balance for Parallel Data Access on Distributed File Systems

Publication typeJournal Article
Publication date2018-03-01
scimago Q1
wos Q2
SJR1.156
CiteScore8.1
Impact factor3.8
ISSN00189340, 15579956, 23263814
Hardware and Architecture
Computational Theory and Mathematics
Software
Theoretical Computer Science
Abstract
The distributed file system, HDFS, is widely deployed as the bedrock for many parallel big data analysis. However, when running multiple parallel applications over the shared file system, the data requests from different processes/executors will unfortunately be served in a surprisingly imbalanced fashion on the distributed storage servers. These imbalanced access patterns among storage nodes are caused because a). unlike conventional parallel file system using striping policies to evenly distribute data among storage nodes, data-intensive file system such as HDFS store each data unit, referred to as chunk file, with several copies based on a relative random policy, which can result in an uneven data distribution among storage nodes; b). based on the data retrieval policy in HDFS, the more data a storage node contains, the higher probability the storage node could be selected to serve the data. Therefore, on the nodes serving multiple chunk files, the data requests from different processes/executors will compete for shared resources such as hard disk head and networkbandwidth, resulting in a degraded I/O performance. In this paper, we first conduct a complete analysis on how remote and imbalanced read/write patterns occur and how they are affected by the size of the cluster. We then propose novel methods, referred to as Opass, to optimize parallel data reads, as well as to reduce the imbalance of parallel writes on distributed file systems. Our proposed methods can benefit parallel data-intensive analysis with various parallel data access strategies. Opass adopts new matching-based algorithms to match processes to data so as to compute the maximum degree of data locality and balanced data access. Furthermore, to reduce the imbalance of parallel writes, Opass employs a heatmap for monitoring the I/O statuses of storage nodes and performs HM-LRU policy to select a local optimal storage node for serving write requests. Experiments are conducted on PRObE's Marmot 128-node cluster testbed and the results from both benchmark and well-known parallel applications show the performance benefits and scalability of Opass.
Found 
Found 

Top-30

Journals

1
2
3
4
IEEE Access
4 publications, 14.81%
IEEE Transactions on Computers
2 publications, 7.41%
IEEE Transactions on Parallel and Distributed Systems
2 publications, 7.41%
ACM Transactions on Modeling and Performance Evaluation of Computing Systems
1 publication, 3.7%
Web Intelligence
1 publication, 3.7%
Royal Society Open Science
1 publication, 3.7%
CMES - Computer Modeling in Engineering and Sciences
1 publication, 3.7%
CCF Transactions on High Performance Computing
1 publication, 3.7%
Knowledge and Information Systems
1 publication, 3.7%
IEEE Cloud Computing
1 publication, 3.7%
IEEE Transactions on Intelligent Transportation Systems
1 publication, 3.7%
Lecture Notes in Computer Science
1 publication, 3.7%
Future Generation Computer Systems
1 publication, 3.7%
1
2
3
4

Publishers

2
4
6
8
10
12
14
16
18
Institute of Electrical and Electronics Engineers (IEEE)
18 publications, 66.67%
Springer Nature
3 publications, 11.11%
Association for Computing Machinery (ACM)
2 publications, 7.41%
IOS Press
1 publication, 3.7%
The Royal Society
1 publication, 3.7%
Tech Science Press
1 publication, 3.7%
Elsevier
1 publication, 3.7%
2
4
6
8
10
12
14
16
18
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
27
Share
Cite this
GOST |
Cite this
GOST Copy
Huang D. et al. Achieving Load Balance for Parallel Data Access on Distributed File Systems // IEEE Transactions on Computers. 2018. Vol. 67. No. 3. pp. 388-402.
GOST all authors (up to 50) Copy
Huang D., Han D., Wang J., Yin J., Chen X., Zhang X., Zhou J., Ye M. Achieving Load Balance for Parallel Data Access on Distributed File Systems // IEEE Transactions on Computers. 2018. Vol. 67. No. 3. pp. 388-402.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1109/tc.2017.2749229
UR - https://doi.org/10.1109/tc.2017.2749229
TI - Achieving Load Balance for Parallel Data Access on Distributed File Systems
T2 - IEEE Transactions on Computers
AU - Huang, Dan
AU - Han, Dezhi
AU - Wang, Jun
AU - Yin, Jiangling
AU - Chen, Xunchao
AU - Zhang, Xuhong
AU - Zhou, Jian
AU - Ye, Mao
PY - 2018
DA - 2018/03/01
PB - Institute of Electrical and Electronics Engineers (IEEE)
SP - 388-402
IS - 3
VL - 67
SN - 0018-9340
SN - 1557-9956
SN - 2326-3814
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2018_Huang,
author = {Dan Huang and Dezhi Han and Jun Wang and Jiangling Yin and Xunchao Chen and Xuhong Zhang and Jian Zhou and Mao Ye},
title = {Achieving Load Balance for Parallel Data Access on Distributed File Systems},
journal = {IEEE Transactions on Computers},
year = {2018},
volume = {67},
publisher = {Institute of Electrical and Electronics Engineers (IEEE)},
month = {mar},
url = {https://doi.org/10.1109/tc.2017.2749229},
number = {3},
pages = {388--402},
doi = {10.1109/tc.2017.2749229}
}
MLA
Cite this
MLA Copy
Huang, Dan, et al. “Achieving Load Balance for Parallel Data Access on Distributed File Systems.” IEEE Transactions on Computers, vol. 67, no. 3, Mar. 2018, pp. 388-402. https://doi.org/10.1109/tc.2017.2749229.