Open Access
Lecture Notes in Computer Science, pages 239-249
Cost Effective Load-Balancing Approach for Range-Partitioned Main-Memory Resident Data
1
LCSI Laboratory, Ecole Nationale Superieure d’Informatique, Algiers, Algeria
|
2
LIAS/ISAE-ENSMA, Futuroscope, France
|
Publication type: Book Chapter
Publication date: 2018-08-08
Journal:
Lecture Notes in Computer Science
Q2
SJR: 0.606
CiteScore: 2.6
Impact factor: —
ISSN: 03029743, 16113349, 18612075, 18612083
Abstract
Due to the availability of larger RAM capacity, there is a new trend bringing parallel main memory database systems with higher performance, compared to traditional DBMSs. In parallel database systems the most critical aspect is data partitioning, which significantly impacts query processing time. Specifically, unbalanced data partitioning introduces data skew, which ends up decreasing query performance if not managed. In this work, we focus on optimizing range queries, widely used in P2P, decision support systems and spatio-temporal databases. We improve the communication complexity of the state-of-the-art previous algorithm based on skip graphs, which required O(log p) messages between 2 nodes to rebalance load, resulting in a high complexity O(p log p) to rebalance load on the p nodes. With such high cost in mind, we propose to create a global view of data distribution among all processing nodes and database clients. Our main contribution is the Approximate Partitioning Vector (
$$\mathcal {APV}$$
), which provides a global approximate view of data distribution to both processing nodes and database clients. A new data balancing algorithm, following a ring topology, reduces communication to 2 messages per node pair, resulting in O(1) communication complexity per node pair and O(p) globally among the p nodes. Experiments analyze the tradeoff between adjusting load balance and query performance.
Found
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
Cite this
GOST |
RIS |
BibTex
Cite this
GOST
Copy
Belayadi D. et al. Cost Effective Load-Balancing Approach for Range-Partitioned Main-Memory Resident Data // Lecture Notes in Computer Science. 2018. pp. 239-249.
GOST all authors (up to 50)
Copy
Belayadi D., Hidouci K., Bellatreche L., Ordonez C. Cost Effective Load-Balancing Approach for Range-Partitioned Main-Memory Resident Data // Lecture Notes in Computer Science. 2018. pp. 239-249.
Cite this
RIS
Copy
TY - GENERIC
DO - 10.1007/978-3-319-98812-2_20
UR - https://doi.org/10.1007/978-3-319-98812-2_20
TI - Cost Effective Load-Balancing Approach for Range-Partitioned Main-Memory Resident Data
T2 - Lecture Notes in Computer Science
AU - Belayadi, Djahida
AU - Hidouci, Khaled-Walid
AU - Bellatreche, Ladjel
AU - Ordonez, Carlos
PY - 2018
DA - 2018/08/08
PB - Springer Nature
SP - 239-249
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
Cite this
BibTex (up to 50 authors)
Copy
@incollection{2018_Belayadi,
author = {Djahida Belayadi and Khaled-Walid Hidouci and Ladjel Bellatreche and Carlos Ordonez},
title = {Cost Effective Load-Balancing Approach for Range-Partitioned Main-Memory Resident Data},
publisher = {Springer Nature},
year = {2018},
pages = {239--249},
month = {aug}
}