volume 585 pages 313-343

Binary imbalanced data classification based on diversity oversampling by generative models

Publication typeJournal Article
Publication date2022-03-01
scimago Q1
wos Q1
SJR1.803
CiteScore14.4
Impact factor6.8
ISSN00200255, 18726291
Computer Science Applications
Artificial Intelligence
Software
Control and Systems Engineering
Theoretical Computer Science
Information Systems and Management
Abstract
In many practical applications, the data are class imbalanced. Accordingly, it is very meaningful and valuable to investigate the classification of imbalanced data. In the framework of binary imbalanced data classification, the synthetic minority oversampling technique (SMOTE) is the best-known oversampling method. However, for each positive sample, SMOTE generates only k synthetic samples on the lines between the positive sample and its k-nearest neighbors, resulting in three drawbacks: (1) SMOTE cannot effectively extend the training field of positive samples; (2) the generated positive samples lack diversity; (3) SMOTE does not accurately approximate the probability distribution of the positive samples. Therefore, two binary imbalanced data classification methods named BIDC1 and BIDC2 based on diversity oversampling by generative models are proposed. The BIDC1 and BIDC2 conduct diversity oversampling using extreme learning machine autoencoder and generative adversarial network, respectively. Extensive experiments on 26 data sets are conducted to compare the two methods with 14 state-of-the-art methods using five metrics: F-measure, G-means, AUC-area, MMD-score, and Silhouette-score. The experimental results demonstrate that the two methods outperform the other 14 methods.
Found 
Found 

Top-30

Journals

1
2
3
4
5
6
Information Sciences
6 publications, 13.04%
IEEE Access
6 publications, 13.04%
Expert Systems with Applications
3 publications, 6.52%
Scientific Reports
2 publications, 4.35%
Knowledge-Based Systems
2 publications, 4.35%
Neural Networks
2 publications, 4.35%
International Journal of Machine Learning and Cybernetics
1 publication, 2.17%
Engineering Applications of Artificial Intelligence
1 publication, 2.17%
Marine Policy
1 publication, 2.17%
Applied Intelligence
1 publication, 2.17%
Methods in Ecology and Evolution
1 publication, 2.17%
Studies in Computational Intelligence
1 publication, 2.17%
Artificial Intelligence in Data and Big Data Processing
1 publication, 2.17%
Communications in Computer and Information Science
1 publication, 2.17%
Journal of Big Data
1 publication, 2.17%
Lecture Notes in Networks and Systems
1 publication, 2.17%
IEEE Transactions on Fuzzy Systems
1 publication, 2.17%
Neurocomputing
1 publication, 2.17%
Group Decision and Negotiation
1 publication, 2.17%
IEEE Transactions on Knowledge and Data Engineering
1 publication, 2.17%
Future Generation Computer Systems
1 publication, 2.17%
Sankhya A
1 publication, 2.17%
Knowledge and Information Systems
1 publication, 2.17%
IEEE Transactions on Systems, Man, and Cybernetics: Systems
1 publication, 2.17%
IEEE Transactions on Intelligent Transportation Systems
1 publication, 2.17%
Electronics (Switzerland)
1 publication, 2.17%
1
2
3
4
5
6

Publishers

2
4
6
8
10
12
14
16
18
Elsevier
17 publications, 36.96%
Institute of Electrical and Electronics Engineers (IEEE)
15 publications, 32.61%
Springer Nature
12 publications, 26.09%
Wiley
1 publication, 2.17%
MDPI
1 publication, 2.17%
2
4
6
8
10
12
14
16
18
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
47
Share
Cite this
GOST |
Cite this
GOST Copy
Zhai J., QI J., Shen C. Binary imbalanced data classification based on diversity oversampling by generative models // Information Sciences. 2022. Vol. 585. pp. 313-343.
GOST all authors (up to 50) Copy
Zhai J., QI J., Shen C. Binary imbalanced data classification based on diversity oversampling by generative models // Information Sciences. 2022. Vol. 585. pp. 313-343.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1016/j.ins.2021.11.058
UR - https://doi.org/10.1016/j.ins.2021.11.058
TI - Binary imbalanced data classification based on diversity oversampling by generative models
T2 - Information Sciences
AU - Zhai, Junhai
AU - QI, JIAXING
AU - Shen, Chu
PY - 2022
DA - 2022/03/01
PB - Elsevier
SP - 313-343
VL - 585
SN - 0020-0255
SN - 1872-6291
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@article{2022_Zhai,
author = {Junhai Zhai and JIAXING QI and Chu Shen},
title = {Binary imbalanced data classification based on diversity oversampling by generative models},
journal = {Information Sciences},
year = {2022},
volume = {585},
publisher = {Elsevier},
month = {mar},
url = {https://doi.org/10.1016/j.ins.2021.11.058},
pages = {313--343},
doi = {10.1016/j.ins.2021.11.058}
}