Open Access
Open access
volume 9 issue 8 pages 1295

The k-means Algorithm: A Comprehensive Survey and Performance Evaluation

Publication typeJournal Article
Publication date2020-08-12
scimago Q2
wos Q2
SJR0.615
CiteScore6.1
Impact factor2.6
ISSN20799292
Electrical and Electronic Engineering
Hardware and Architecture
Computer Networks and Communications
Control and Systems Engineering
Signal Processing
Abstract

The k-means clustering algorithm is considered one of the most powerful and popular data mining algorithms in the research community. However, despite its popularity, the algorithm has certain limitations, including problems associated with random initialization of the centroids which leads to unexpected convergence. Additionally, such a clustering algorithm requires the number of clusters to be defined beforehand, which is responsible for different cluster shapes and outlier effects. A fundamental problem of the k-means algorithm is its inability to handle various data types. This paper provides a structured and synoptic overview of research conducted on the k-means algorithm to overcome such shortcomings. Variants of the k-means algorithms including their recent developments are discussed, where their effectiveness is investigated based on the experimental analysis of a variety of datasets. The detailed experimental analysis along with a thorough comparison among different k-means clustering algorithms differentiates our work compared to other existing survey papers. Furthermore, it outlines a clear and thorough understanding of the k-means algorithm along with its different research directions.

Found 
Found 

Top-30

Journals

5
10
15
20
25
30
IEEE Access
28 publications, 2.61%
Lecture Notes in Computer Science
21 publications, 1.96%
Sensors
18 publications, 1.68%
Lecture Notes in Networks and Systems
18 publications, 1.68%
Applied Sciences (Switzerland)
17 publications, 1.59%
Mathematics
14 publications, 1.31%
Scientific Reports
14 publications, 1.31%
Electronics (Switzerland)
12 publications, 1.12%
Communications in Computer and Information Science
11 publications, 1.03%
Sustainability
10 publications, 0.93%
Knowledge-Based Systems
10 publications, 0.93%
Expert Systems with Applications
10 publications, 0.93%
Remote Sensing
8 publications, 0.75%
PLoS ONE
7 publications, 0.65%
Algorithms
6 publications, 0.56%
Procedia Computer Science
6 publications, 0.56%
Information Sciences
5 publications, 0.47%
Neurocomputing
5 publications, 0.47%
Journal of Marine Science and Engineering
5 publications, 0.47%
Engineering Applications of Artificial Intelligence
5 publications, 0.47%
IEEE Internet of Things Journal
5 publications, 0.47%
Agriculture (Switzerland)
4 publications, 0.37%
Energies
4 publications, 0.37%
Ocean Engineering
4 publications, 0.37%
Pattern Recognition
4 publications, 0.37%
Buildings
4 publications, 0.37%
AIP Conference Proceedings
4 publications, 0.37%
Information (Switzerland)
4 publications, 0.37%
Applied Soft Computing Journal
4 publications, 0.37%
5
10
15
20
25
30

Publishers

50
100
150
200
250
300
Institute of Electrical and Electronics Engineers (IEEE)
265 publications, 24.72%
Elsevier
228 publications, 21.27%
MDPI
182 publications, 16.98%
Springer Nature
182 publications, 16.98%
Association for Computing Machinery (ACM)
24 publications, 2.24%
Wiley
23 publications, 2.15%
Taylor & Francis
15 publications, 1.4%
Cold Spring Harbor Laboratory
11 publications, 1.03%
Hindawi Limited
10 publications, 0.93%
AIP Publishing
9 publications, 0.84%
SAGE
9 publications, 0.84%
Frontiers Media S.A.
9 publications, 0.84%
Public Library of Science (PLoS)
7 publications, 0.65%
American Chemical Society (ACS)
7 publications, 0.65%
SPIE-Intl Soc Optical Eng
7 publications, 0.65%
IOP Publishing
6 publications, 0.56%
American Society of Civil Engineers (ASCE)
4 publications, 0.37%
Walter de Gruyter
4 publications, 0.37%
Optica Publishing Group
4 publications, 0.37%
Oxford University Press
4 publications, 0.37%
Ovid Technologies (Wolters Kluwer Health)
3 publications, 0.28%
PeerJ
3 publications, 0.28%
Copernicus
3 publications, 0.28%
IGI Global
3 publications, 0.28%
Tech Science Press
2 publications, 0.19%
Hans Publishers
2 publications, 0.19%
American Physical Society (APS)
2 publications, 0.19%
SAE International
2 publications, 0.19%
Federal Center for Hygiene and Epidemiology
2 publications, 0.19%
50
100
150
200
250
300
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
1.1k
Share
Cite this
GOST |
Cite this
GOST Copy
Ahmed M. et al. The k-means Algorithm: A Comprehensive Survey and Performance Evaluation // Electronics (Switzerland). 2020. Vol. 9. No. 8. p. 1295.
GOST all authors (up to 50) Copy
Ahmed M., Seraj R., Islam S. The k-means Algorithm: A Comprehensive Survey and Performance Evaluation // Electronics (Switzerland). 2020. Vol. 9. No. 8. p. 1295.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.3390/electronics9081295
UR - https://doi.org/10.3390/electronics9081295
TI - The k-means Algorithm: A Comprehensive Survey and Performance Evaluation
T2 - Electronics (Switzerland)
AU - Ahmed, Mohiuddin
AU - Seraj, Raihan
AU - Islam, Syed
PY - 2020
DA - 2020/08/12
PB - MDPI
SP - 1295
IS - 8
VL - 9
SN - 2079-9292
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2020_Ahmed,
author = {Mohiuddin Ahmed and Raihan Seraj and Syed Islam},
title = {The k-means Algorithm: A Comprehensive Survey and Performance Evaluation},
journal = {Electronics (Switzerland)},
year = {2020},
volume = {9},
publisher = {MDPI},
month = {aug},
url = {https://doi.org/10.3390/electronics9081295},
number = {8},
pages = {1295},
doi = {10.3390/electronics9081295}
}
MLA
Cite this
MLA Copy
Ahmed, Mohiuddin, et al. “The k-means Algorithm: A Comprehensive Survey and Performance Evaluation.” Electronics (Switzerland), vol. 9, no. 8, Aug. 2020, p. 1295. https://doi.org/10.3390/electronics9081295.