Open Access
Open access
volume 10 issue 6 pages 699

A Hybrid Supervised Machine Learning Classifier System for Breast Cancer Prognosis Using Feature Selection and Data Imbalance Handling Approaches

Yogendra Singh Solanki 1
Prasun Chakrabarti 2
Alexander Vinogradov 4
Radomir Gono 6
Mohammad Nami 7
Publication typeJournal Article
Publication date2021-03-16
scimago Q2
wos Q2
SJR0.615
CiteScore6.1
Impact factor2.6
ISSN20799292
Electrical and Electronic Engineering
Hardware and Architecture
Computer Networks and Communications
Control and Systems Engineering
Signal Processing
Abstract

Nowadays, breast cancer is the most frequent cancer among women. Early detection is a critical issue that can be effectively achieved by machine learning (ML) techniques. Thus in this article, the methods to improve the accuracy of ML classification models for the prognosis of breast cancer are investigated. Wrapper-based feature selection approach along with nature-inspired algorithms such as Particle Swarm Optimization, Genetic Search, and Greedy Stepwise has been used to identify the important features. On these selected features popular machine learning classifiers Support Vector Machine, J48 (C4.5 Decision Tree Algorithm), Multilayer-Perceptron (a feed-forward ANN) were used in the system. The methodology of the proposed system is structured into five stages which include (1) Data Pre-processing; (2) Data imbalance handling; (3) Feature Selection; (4) Machine Learning Classifiers; (5) classifier’s performance evaluation. The dataset under this research experimentation is referred from the UCI Machine Learning Repository, named Breast Cancer Wisconsin (Diagnostic) Data Set. This article indicated that the J48 decision tree classifier is the appropriate machine learning-based classifier for optimum breast cancer prognosis. Support Vector Machine with Particle Swarm Optimization algorithm for feature selection achieves the accuracy of 98.24%, MCC = 0.961, Sensitivity = 99.11%, Specificity = 96.54%, and Kappa statistics of 0.9606. It is also observed that the J48 Decision Tree classifier with the Genetic Search algorithm for feature selection achieves the accuracy of 98.83%, MCC = 0.974, Sensitivity = 98.95%, Specificity = 98.58%, and Kappa statistics of 0.9735. Furthermore, Multilayer Perceptron ANN classifier with Genetic Search algorithm for feature selection achieves the accuracy of 98.59%, MCC = 0.968, Sensitivity = 98.6%, Specificity = 98.57%, and Kappa statistics of 0.9682.

Found 
Found 

Top-30

Journals

1
2
3
Electronics (Switzerland)
3 publications, 7.89%
Lecture Notes in Networks and Systems
2 publications, 5.26%
IEEE Access
2 publications, 5.26%
Scientific Reports
1 publication, 2.63%
Applied Sciences (Switzerland)
1 publication, 2.63%
Sustainability
1 publication, 2.63%
Biomedical Signal Processing and Control
1 publication, 2.63%
Computational Intelligence
1 publication, 2.63%
Diagnostics
1 publication, 2.63%
Computational Intelligence and Neuroscience
1 publication, 2.63%
Journal of Education and Health Promotion
1 publication, 2.63%
Journal of Cancer Survivorship
1 publication, 2.63%
Lecture Notes in Electrical Engineering
1 publication, 2.63%
Metaheuristics for Machine Learning
1 publication, 2.63%
Evolving Systems
1 publication, 2.63%
Computers in Biology and Medicine
1 publication, 2.63%
Chemometrics and Intelligent Laboratory Systems
1 publication, 2.63%
International Journal of Computational Intelligence Systems
1 publication, 2.63%
STEAM-H: Science, Technology, Engineering, Agriculture, Mathematics & Health
1 publication, 2.63%
PeerJ Computer Science
1 publication, 2.63%
Frontiers in Physiology
1 publication, 2.63%
Procedia Computer Science
1 publication, 2.63%
Advances in Computational Intelligence and Robotics
1 publication, 2.63%
Smart Innovation, Systems and Technologies
1 publication, 2.63%
AIP Conference Proceedings
1 publication, 2.63%
Shiraz E Medical Journal
1 publication, 2.63%
1
2
3

Publishers

2
4
6
8
10
Springer Nature
10 publications, 26.32%
Institute of Electrical and Electronics Engineers (IEEE)
9 publications, 23.68%
MDPI
6 publications, 15.79%
Elsevier
4 publications, 10.53%
Wiley
1 publication, 2.63%
Hindawi Limited
1 publication, 2.63%
Ovid Technologies (Wolters Kluwer Health)
1 publication, 2.63%
IntechOpen
1 publication, 2.63%
PeerJ
1 publication, 2.63%
Frontiers Media S.A.
1 publication, 2.63%
IGI Global
1 publication, 2.63%
AIP Publishing
1 publication, 2.63%
Brieflands
1 publication, 2.63%
2
4
6
8
10
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
38
Share
Cite this
GOST |
Cite this
GOST Copy
Solanki Y. S. et al. A Hybrid Supervised Machine Learning Classifier System for Breast Cancer Prognosis Using Feature Selection and Data Imbalance Handling Approaches // Electronics (Switzerland). 2021. Vol. 10. No. 6. p. 699.
GOST all authors (up to 50) Copy
Solanki Y. S., Chakrabarti P., Jasinski M., Leonowicz Z., Bolshev V., Vinogradov A., Jasinska E., Gono R., Nami M. A Hybrid Supervised Machine Learning Classifier System for Breast Cancer Prognosis Using Feature Selection and Data Imbalance Handling Approaches // Electronics (Switzerland). 2021. Vol. 10. No. 6. p. 699.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.3390/electronics10060699
UR - https://doi.org/10.3390/electronics10060699
TI - A Hybrid Supervised Machine Learning Classifier System for Breast Cancer Prognosis Using Feature Selection and Data Imbalance Handling Approaches
T2 - Electronics (Switzerland)
AU - Solanki, Yogendra Singh
AU - Chakrabarti, Prasun
AU - Jasinski, Michal
AU - Leonowicz, Zbigniew
AU - Bolshev, Vadim
AU - Vinogradov, Alexander
AU - Jasinska, Elzbieta
AU - Gono, Radomir
AU - Nami, Mohammad
PY - 2021
DA - 2021/03/16
PB - MDPI
SP - 699
IS - 6
VL - 10
SN - 2079-9292
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2021_Solanki,
author = {Yogendra Singh Solanki and Prasun Chakrabarti and Michal Jasinski and Zbigniew Leonowicz and Vadim Bolshev and Alexander Vinogradov and Elzbieta Jasinska and Radomir Gono and Mohammad Nami},
title = {A Hybrid Supervised Machine Learning Classifier System for Breast Cancer Prognosis Using Feature Selection and Data Imbalance Handling Approaches},
journal = {Electronics (Switzerland)},
year = {2021},
volume = {10},
publisher = {MDPI},
month = {mar},
url = {https://doi.org/10.3390/electronics10060699},
number = {6},
pages = {699},
doi = {10.3390/electronics10060699}
}
MLA
Cite this
MLA Copy
Solanki, Yogendra Singh, et al. “A Hybrid Supervised Machine Learning Classifier System for Breast Cancer Prognosis Using Feature Selection and Data Imbalance Handling Approaches.” Electronics (Switzerland), vol. 10, no. 6, Mar. 2021, p. 699. https://doi.org/10.3390/electronics10060699.