volume 80 pages 89-101

Self-adaptive cloud monitoring with online anomaly detection

Tao Wang 1
Jiwei Xu 1
Wenbo Zhang 1
Zeyu Gu 2
ZHONG HUA 1
Publication typeJournal Article
Publication date2018-03-01
scimago Q1
wos Q1
SJR1.551
CiteScore17.1
Impact factor6.1
ISSN0167739X, 18727115
Hardware and Architecture
Computer Networks and Communications
Software
Abstract
Monitoring is the key to guarantee the reliability of cloud computing systems. By analyzing monitoring data, administrators can understand systems statuses to detect, diagnose and solve problems. However, due to the enormous scale and complex structure of cloud computing, a monitoring system should collect, transfer, store and process a large amount of monitoring data, which brings a significant performance overhead and increases the difficulty of analyzing useful information. To address the above issue, this paper proposes a self-adaptive monitoring approach for cloud computing systems. First, we conduct correlation analysis between different metrics, and monitor selected important ones which represent the others and reflect the running status of a system. Second, we characterize the running status with Principal Component Analysis (PCA), estimate the anomaly degree, and predict the possibility of faults. Finally, we dynamically adjust the monitoring period based on the estimated anomaly degree and a reliability model. To evaluate our proposal, we have applied the approach in our open-source TPC-W benchmark Bench4Q deployed in our real cloud computing platform OnceCloud. The experimental results demonstrate that our approach can adapt to dynamic workloads, accurately estimate the anomaly degree, and automatically adjust monitoring periods. Thus, the approach can effectively improve the accuracy and timeliness of anomaly detection in an abnormal status, and efficiently lower the monitoring overhead in a normal status. Correlation analysis is proposed to select key metrics representing others.PCA is proposed to characterize running status and predict the possibility of faults.We dynamically adjust metrics and periods based on a reliability model.We evaluate the approach on our real cloud platform with case studies.
Found 
Found 

Top-30

Journals

1
2
3
IEEE Access
3 publications, 6.12%
Lecture Notes in Computer Science
3 publications, 6.12%
IEEE Transactions on Network and Service Management
3 publications, 6.12%
Concurrency Computation Practice and Experience
2 publications, 4.08%
IET Networks
1 publication, 2.04%
IET Control Theory and Applications
1 publication, 2.04%
International Journal of e-Collaboration
1 publication, 2.04%
Journal of Intelligent and Fuzzy Systems
1 publication, 2.04%
Journal of Circuits, Systems and Computers
1 publication, 2.04%
Electronics (Switzerland)
1 publication, 2.04%
Neural Computing and Applications
1 publication, 2.04%
Journal of Supercomputing
1 publication, 2.04%
Pattern Recognition Letters
1 publication, 2.04%
IOP Conference Series: Materials Science and Engineering
1 publication, 2.04%
Future Generation Computer Systems
1 publication, 2.04%
Microprocessors and Microsystems
1 publication, 2.04%
Expert Systems with Applications
1 publication, 2.04%
Transactions on Emerging Telecommunications Technologies
1 publication, 2.04%
Enterprise Information Systems
1 publication, 2.04%
IEEE Transactions on Vehicular Technology
1 publication, 2.04%
IEEE Communications Surveys and Tutorials
1 publication, 2.04%
IEEE Transactions on Industrial Informatics
1 publication, 2.04%
Lecture Notes in Electrical Engineering
1 publication, 2.04%
IEEE Internet of Things Journal
1 publication, 2.04%
Journal of Network and Computer Applications
1 publication, 2.04%
Scientific Reports
1 publication, 2.04%
BIO Web of Conferences
1 publication, 2.04%
ACM Transactions on Software Engineering and Methodology
1 publication, 2.04%
Journal of Network and Systems Management
1 publication, 2.04%
1
2
3

Publishers

2
4
6
8
10
12
14
16
18
Institute of Electrical and Electronics Engineers (IEEE)
18 publications, 36.73%
Springer Nature
9 publications, 18.37%
Elsevier
5 publications, 10.2%
Association for Computing Machinery (ACM)
4 publications, 8.16%
Wiley
3 publications, 6.12%
Institution of Engineering and Technology (IET)
2 publications, 4.08%
MDPI
2 publications, 4.08%
IGI Global
1 publication, 2.04%
IOS Press
1 publication, 2.04%
World Scientific
1 publication, 2.04%
IOP Publishing
1 publication, 2.04%
Taylor & Francis
1 publication, 2.04%
EDP Sciences
1 publication, 2.04%
2
4
6
8
10
12
14
16
18
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
49
Share
Cite this
GOST |
Cite this
GOST Copy
Wang T. et al. Self-adaptive cloud monitoring with online anomaly detection // Future Generation Computer Systems. 2018. Vol. 80. pp. 89-101.
GOST all authors (up to 50) Copy
Wang T., Xu J., Zhang W., Gu Z., HUA Z. Self-adaptive cloud monitoring with online anomaly detection // Future Generation Computer Systems. 2018. Vol. 80. pp. 89-101.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1016/j.future.2017.09.067
UR - https://doi.org/10.1016/j.future.2017.09.067
TI - Self-adaptive cloud monitoring with online anomaly detection
T2 - Future Generation Computer Systems
AU - Wang, Tao
AU - Xu, Jiwei
AU - Zhang, Wenbo
AU - Gu, Zeyu
AU - HUA, ZHONG
PY - 2018
DA - 2018/03/01
PB - Elsevier
SP - 89-101
VL - 80
SN - 0167-739X
SN - 1872-7115
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@article{2018_Wang,
author = {Tao Wang and Jiwei Xu and Wenbo Zhang and Zeyu Gu and ZHONG HUA},
title = {Self-adaptive cloud monitoring with online anomaly detection},
journal = {Future Generation Computer Systems},
year = {2018},
volume = {80},
publisher = {Elsevier},
month = {mar},
url = {https://doi.org/10.1016/j.future.2017.09.067},
pages = {89--101},
doi = {10.1016/j.future.2017.09.067}
}