Open Access
Open access
volume 70 pages 4010

Effectiveness Evaluation of Random Forest, Naive Bayes, and Support Vector Machine Models for KDDCUP99 Anomaly Detection Based on K-means Clustering

Publication typeJournal Article
Publication date2025-01-23
SJR
CiteScore
Impact factor
ISSN22712097
Abstract

Security in the World Wide Web has recently seen an enormous upgrade in almost every aspect. Identifying malicious activities hi a network such as network attacks and malicious users plays a significant role hi these upgraded security directions. This research utilizes the KDDCUP99 dataset to incorporate K-means clustering with three classifiers: Random Forest (RF). Naïve Bayes (NB). and Support Vector Machine (SVM) with the goal to boost the accuracy of predicting network intrusions. In tins paper. K-means clustering technique is applied as a preprocessing step to enhance the overall quality of network intrusion detection and maximize the accuracy of the network security measures. The goal is to identify anomalies with high accuracy. Experimental results hidicate that the optimal combination is K-means + RF. which outperformed the others hi precision, recall, and Fl-score. Although K-means + NB demonstrated superior recall for certahi smaller anomalies, it underperformed compared to the RF model. The paper concludes by highlighting the value of ensemble approaches, in particular Random Forest, for tackling anomaly detection and network security issues, particularly hi light of the expanding significance of social networks and the internet.

Found 

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
0
Share
Cite this
GOST |
Cite this
GOST Copy
Zhang M. Effectiveness Evaluation of Random Forest, Naive Bayes, and Support Vector Machine Models for KDDCUP99 Anomaly Detection Based on K-means Clustering // ITM Web of Conferences. 2025. Vol. 70. p. 4010.
GOST all authors (up to 50) Copy
Zhang M. Effectiveness Evaluation of Random Forest, Naive Bayes, and Support Vector Machine Models for KDDCUP99 Anomaly Detection Based on K-means Clustering // ITM Web of Conferences. 2025. Vol. 70. p. 4010.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1051/itmconf/20257004010
UR - https://www.itm-conferences.org/10.1051/itmconf/20257004010
TI - Effectiveness Evaluation of Random Forest, Naive Bayes, and Support Vector Machine Models for KDDCUP99 Anomaly Detection Based on K-means Clustering
T2 - ITM Web of Conferences
AU - Zhang, Majun
PY - 2025
DA - 2025/01/23
PB - EDP Sciences
SP - 4010
VL - 70
SN - 2271-2097
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@article{2025_Zhang,
author = {Majun Zhang},
title = {Effectiveness Evaluation of Random Forest, Naive Bayes, and Support Vector Machine Models for KDDCUP99 Anomaly Detection Based on K-means Clustering},
journal = {ITM Web of Conferences},
year = {2025},
volume = {70},
publisher = {EDP Sciences},
month = {jan},
url = {https://www.itm-conferences.org/10.1051/itmconf/20257004010},
pages = {4010},
doi = {10.1051/itmconf/20257004010}
}