Holistic Root Cause Analysis for Failures in Cloud-Native Systems Through Observability Data
Тип публикации: Journal Article
Дата публикации: 2024-11-01
scimago Q1
wos Q1
БС1
SJR: 1.441
CiteScore: 10.8
Impact factor: 5.8
ISSN: 19391374, 23720204
Краткое описание
Microservices are widely adopted in large IT enterprises, leveraging the scalability, resiliency, and elasticity of the cloud-native architecture. Effective root cause analysis is crucial for ensuring the reliability of such cloud-native systems. Many efforts have focused on using the three modalities of observability data–traces, metrics, and logs. However, existing approaches are limited by inconsistent problem definitions and cloud-native heterogeneity. To address these challenges, we propose HolisticRCA, a root cause analysis framework in cloud-native systems from a holistic perspective. HolisticRCA formally defines root cause analysis through three dimensions. Then HolisticRCA uses an “assembling building blocks” strategy to address the cloud-native heterogeneity. It maps each observability feature into a shared vector space and concatenates the vector embeddings associated with each resource entity for standardized resource entity vector embeddings. Then it applies Graph Attention Network to capture intertwined resource entity relations and incorporates mask embeddings to enable holistic analysis. The evaluation results on three public datasets show that HolisticRCA outperforms existing approaches in holistic root cause analysis of cloud-native systems.
Найдено
Ничего не найдено, попробуйте изменить настройки фильтра.
Для доступа к списку цитирований публикации необходимо авторизоваться.
Топ-30
Журналы
|
1
|
|
|
Computing (Vienna/New York)
1 публикация, 25%
|
|
|
Mathematics
1 публикация, 25%
|
|
|
Lecture Notes in Computer Science
1 публикация, 25%
|
|
|
IEEE Transactions on Services Computing
1 публикация, 25%
|
|
|
1
|
Издатели
|
1
2
|
|
|
Springer Nature
2 публикации, 50%
|
|
|
MDPI
1 публикация, 25%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
1 публикация, 25%
|
|
|
1
2
|
- Мы не учитываем публикации, у которых нет DOI.
- Статистика публикаций обновляется еженедельно.
Вы ученый?
Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
Метрики
4
Всего цитирований:
4
Цитирований c 2025:
4
(100%)
Цитировать
ГОСТ |
RIS |
BibTex |
MLA
Цитировать
ГОСТ
Скопировать
Han Y. et al. Holistic Root Cause Analysis for Failures in Cloud-Native Systems Through Observability Data // IEEE Transactions on Services Computing. 2024. Vol. 17. No. 6. pp. 3789-3802.
ГОСТ со всеми авторами (до 50)
Скопировать
Han Y., Du Q., Huang Y., Li P., Shi X., Wu J., - F. P., Tian F., He C. Holistic Root Cause Analysis for Failures in Cloud-Native Systems Through Observability Data // IEEE Transactions on Services Computing. 2024. Vol. 17. No. 6. pp. 3789-3802.
Цитировать
RIS
Скопировать
TY - JOUR
DO - 10.1109/tsc.2024.3478759
UR - https://ieeexplore.ieee.org/document/10713920/
TI - Holistic Root Cause Analysis for Failures in Cloud-Native Systems Through Observability Data
T2 - IEEE Transactions on Services Computing
AU - Han, Yongqi
AU - Du, Qingfeng
AU - Huang, Ying
AU - Li, Pengsheng
AU - Shi, Xiaonan
AU - Wu, Jiaqi
AU - -, Fang Pei
AU - Tian, Fulong
AU - He, Cheng
PY - 2024
DA - 2024/11/01
PB - Institute of Electrical and Electronics Engineers (IEEE)
SP - 3789-3802
IS - 6
VL - 17
SN - 1939-1374
SN - 2372-0204
ER -
Цитировать
BibTex (до 50 авторов)
Скопировать
@article{2024_Han,
author = {Yongqi Han and Qingfeng Du and Ying Huang and Pengsheng Li and Xiaonan Shi and Jiaqi Wu and Fang Pei - and Fulong Tian and Cheng He},
title = {Holistic Root Cause Analysis for Failures in Cloud-Native Systems Through Observability Data},
journal = {IEEE Transactions on Services Computing},
year = {2024},
volume = {17},
publisher = {Institute of Electrical and Electronics Engineers (IEEE)},
month = {nov},
url = {https://ieeexplore.ieee.org/document/10713920/},
number = {6},
pages = {3789--3802},
doi = {10.1109/tsc.2024.3478759}
}
Цитировать
MLA
Скопировать
Han, Yongqi, et al. “Holistic Root Cause Analysis for Failures in Cloud-Native Systems Through Observability Data.” IEEE Transactions on Services Computing, vol. 17, no. 6, Nov. 2024, pp. 3789-3802. https://ieeexplore.ieee.org/document/10713920/.