том 46 издание 4 страницы 10863-10882

Improving big data analytics data processing speed through map reduce scheduling and replica placement with HDFS using genetic optimization techniques

Тип публикацииJournal Article
Дата публикации2024-04-18
scimago Q2
wos Q4
БС2
SJR0.364
CiteScore4.2
Impact factor1.0
ISSN10641246, 18758967
Statistics and Probability
General Engineering
Artificial Intelligence
Краткое описание

Big Data Analytics (BDA) is an unavoidable technique in today’s digital world for dealing with massive amounts of digital data generated by online and internet sources. It is kept in repositories for data processing via cluster nodes that are distributed throughout the wider network. Because of its magnitude and real-time creation, big data processing faces challenges with latency and throughput. Modern systems such as Hadoop and SPARK manage large amounts of data with their HDFS, Map Reduce, and In-Memory analytics approaches, but the migration cost is higher than usual. With Genetic Algorithm-based Optimization (GABO), Map Reduce Scheduling (MRS) and Data Replication have provided answers to this challenge. With multi objective solutions provided by Genetic Algorithm, resource utilization and node availability improve processing performance in large data environments. This work develops a novel creative strategy for enhancing data processing performance in big data analytics called Map Reduce Scheduling Based Non-Dominated Sorting Genetic Algorithm (MRSNSGA). The Hadoop-Map Reduce paradigm handles the placement of data in distributed blocks as a chunk and their scheduling among the cluster nodes in a wider network. Best fit solutions with high latency and low accessing time are extracted from the findings of various objective solutions. Experiments were carried out as a simulation with several inputs of varied location node data and cluster racks. Finally, the results show that the speed of data processing in big data analytics was enhanced by 30–35% over previous methodologies. Optimization approaches developed to locate the best solutions from multi-objective solutions at a rate of 24–30% among cluster nodes.

Найдено 
Найдено 

Топ-30

Журналы

1
Lecture Notes in Networks and Systems
1 публикация, 9.09%
1

Издатели

2
4
6
8
10
Institute of Electrical and Electronics Engineers (IEEE)
10 публикаций, 90.91%
Springer Nature
1 публикация, 9.09%
2
4
6
8
10
  • Мы не учитываем публикации, у которых нет DOI.
  • Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
Метрики
11
Поделиться
Цитировать
ГОСТ |
Цитировать
Sundara Kumar M. R., Mohan H.S. Improving big data analytics data processing speed through map reduce scheduling and replica placement with HDFS using genetic optimization techniques // Journal of Intelligent and Fuzzy Systems. 2024. Vol. 46. No. 4. pp. 10863-10882.
ГОСТ со всеми авторами (до 50) Скопировать
Sundara Kumar M. R., Mohan H.S. Improving big data analytics data processing speed through map reduce scheduling and replica placement with HDFS using genetic optimization techniques // Journal of Intelligent and Fuzzy Systems. 2024. Vol. 46. No. 4. pp. 10863-10882.
RIS |
Цитировать
TY - JOUR
DO - 10.3233/jifs-240069
UR - https://www.medra.org/servlet/aliasResolver?alias=iospress&doi=10.3233/JIFS-240069
TI - Improving big data analytics data processing speed through map reduce scheduling and replica placement with HDFS using genetic optimization techniques
T2 - Journal of Intelligent and Fuzzy Systems
AU - Sundara Kumar, M. R.
AU - Mohan H.S.
PY - 2024
DA - 2024/04/18
PB - SAGE
SP - 10863-10882
IS - 4
VL - 46
SN - 1064-1246
SN - 1875-8967
ER -
BibTex |
Цитировать
BibTex (до 50 авторов) Скопировать
@article{2024_Sundara Kumar,
author = {M. R. Sundara Kumar and Mohan H.S.},
title = {Improving big data analytics data processing speed through map reduce scheduling and replica placement with HDFS using genetic optimization techniques},
journal = {Journal of Intelligent and Fuzzy Systems},
year = {2024},
volume = {46},
publisher = {SAGE},
month = {apr},
url = {https://www.medra.org/servlet/aliasResolver?alias=iospress&doi=10.3233/JIFS-240069},
number = {4},
pages = {10863--10882},
doi = {10.3233/jifs-240069}
}
MLA
Цитировать
Sundara Kumar, M. R., and Mohan H.S.. “Improving big data analytics data processing speed through map reduce scheduling and replica placement with HDFS using genetic optimization techniques.” Journal of Intelligent and Fuzzy Systems, vol. 46, no. 4, Apr. 2024, pp. 10863-10882. https://www.medra.org/servlet/aliasResolver?alias=iospress&doi=10.3233/JIFS-240069.