The k-means Algorithm: A Comprehensive Survey and Performance Evaluation
The k-means clustering algorithm is considered one of the most powerful and popular data mining algorithms in the research community. However, despite its popularity, the algorithm has certain limitations, including problems associated with random initialization of the centroids which leads to unexpected convergence. Additionally, such a clustering algorithm requires the number of clusters to be defined beforehand, which is responsible for different cluster shapes and outlier effects. A fundamental problem of the k-means algorithm is its inability to handle various data types. This paper provides a structured and synoptic overview of research conducted on the k-means algorithm to overcome such shortcomings. Variants of the k-means algorithms including their recent developments are discussed, where their effectiveness is investigated based on the experimental analysis of a variety of datasets. The detailed experimental analysis along with a thorough comparison among different k-means clustering algorithms differentiates our work compared to other existing survey papers. Furthermore, it outlines a clear and thorough understanding of the k-means algorithm along with its different research directions.
Top-30
Journals
|
5
10
15
20
25
30
|
|
|
IEEE Access
28 publications, 2.61%
|
|
|
Lecture Notes in Computer Science
21 publications, 1.96%
|
|
|
Sensors
18 publications, 1.68%
|
|
|
Lecture Notes in Networks and Systems
18 publications, 1.68%
|
|
|
Applied Sciences (Switzerland)
17 publications, 1.59%
|
|
|
Mathematics
14 publications, 1.31%
|
|
|
Scientific Reports
14 publications, 1.31%
|
|
|
Electronics (Switzerland)
12 publications, 1.12%
|
|
|
Communications in Computer and Information Science
11 publications, 1.03%
|
|
|
Sustainability
10 publications, 0.93%
|
|
|
Knowledge-Based Systems
10 publications, 0.93%
|
|
|
Expert Systems with Applications
10 publications, 0.93%
|
|
|
Remote Sensing
8 publications, 0.75%
|
|
|
PLoS ONE
7 publications, 0.65%
|
|
|
Algorithms
6 publications, 0.56%
|
|
|
Procedia Computer Science
6 publications, 0.56%
|
|
|
Information Sciences
5 publications, 0.47%
|
|
|
Neurocomputing
5 publications, 0.47%
|
|
|
Journal of Marine Science and Engineering
5 publications, 0.47%
|
|
|
Engineering Applications of Artificial Intelligence
5 publications, 0.47%
|
|
|
IEEE Internet of Things Journal
5 publications, 0.47%
|
|
|
Agriculture (Switzerland)
4 publications, 0.37%
|
|
|
Energies
4 publications, 0.37%
|
|
|
Ocean Engineering
4 publications, 0.37%
|
|
|
Pattern Recognition
4 publications, 0.37%
|
|
|
Buildings
4 publications, 0.37%
|
|
|
AIP Conference Proceedings
4 publications, 0.37%
|
|
|
Information (Switzerland)
4 publications, 0.37%
|
|
|
Applied Soft Computing Journal
4 publications, 0.37%
|
|
|
5
10
15
20
25
30
|
Publishers
|
50
100
150
200
250
300
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
265 publications, 24.72%
|
|
|
Elsevier
228 publications, 21.27%
|
|
|
MDPI
182 publications, 16.98%
|
|
|
Springer Nature
182 publications, 16.98%
|
|
|
Association for Computing Machinery (ACM)
24 publications, 2.24%
|
|
|
Wiley
23 publications, 2.15%
|
|
|
Taylor & Francis
15 publications, 1.4%
|
|
|
Cold Spring Harbor Laboratory
11 publications, 1.03%
|
|
|
Hindawi Limited
10 publications, 0.93%
|
|
|
AIP Publishing
9 publications, 0.84%
|
|
|
SAGE
9 publications, 0.84%
|
|
|
Frontiers Media S.A.
9 publications, 0.84%
|
|
|
Public Library of Science (PLoS)
7 publications, 0.65%
|
|
|
American Chemical Society (ACS)
7 publications, 0.65%
|
|
|
SPIE-Intl Soc Optical Eng
7 publications, 0.65%
|
|
|
IOP Publishing
6 publications, 0.56%
|
|
|
American Society of Civil Engineers (ASCE)
4 publications, 0.37%
|
|
|
Walter de Gruyter
4 publications, 0.37%
|
|
|
Optica Publishing Group
4 publications, 0.37%
|
|
|
Oxford University Press
4 publications, 0.37%
|
|
|
Ovid Technologies (Wolters Kluwer Health)
3 publications, 0.28%
|
|
|
PeerJ
3 publications, 0.28%
|
|
|
Copernicus
3 publications, 0.28%
|
|
|
IGI Global
3 publications, 0.28%
|
|
|
Tech Science Press
2 publications, 0.19%
|
|
|
Hans Publishers
2 publications, 0.19%
|
|
|
American Physical Society (APS)
2 publications, 0.19%
|
|
|
SAE International
2 publications, 0.19%
|
|
|
Federal Center for Hygiene and Epidemiology
2 publications, 0.19%
|
|
|
50
100
150
200
250
300
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.