A Systematic Review on Data Scarcity Problem in Deep Learning: Solution and Applications
Recent advancements in deep learning architecture have increased its utility in real-life applications. Deep learning models require a large amount of data to train the model. In many application domains, there is a limited set of data available for training neural networks as collecting new data is either not feasible or requires more resources such as in marketing, computer vision, and medical science. These models require a large amount of data to avoid the problem of overfitting. One of the data space solutions to the problem of limited data is data augmentation. The purpose of this study focuses on various data augmentation techniques that can be used to further improve the accuracy of a neural network. This saves the cost and time consumption required to collect new data for the training of deep neural networks by augmenting available data. This also regularizes the model and improves its capability of generalization. The need for large datasets in different fields such as computer vision, natural language processing, security, and healthcare is also covered in this survey paper. The goal of this paper is to provide a comprehensive survey of recent advancements in data augmentation techniques and their application in various domains.
Top-30
Journals
1
2
3
4
|
|
Applied Sciences (Switzerland)
4 publications, 4.76%
|
|
IEEE Access
3 publications, 3.57%
|
|
Lecture Notes in Computer Science
3 publications, 3.57%
|
|
Nature Machine Intelligence
2 publications, 2.38%
|
|
Engineering Applications of Artificial Intelligence
2 publications, 2.38%
|
|
Expert Systems with Applications
2 publications, 2.38%
|
|
ACM Transactions on Recommender Systems
2 publications, 2.38%
|
|
Pattern Recognition
2 publications, 2.38%
|
|
New Media and Society
1 publication, 1.19%
|
|
Frontiers in Big Data
1 publication, 1.19%
|
|
Computer Methods and Programs in Biomedicine
1 publication, 1.19%
|
|
Technological Forecasting and Social Change
1 publication, 1.19%
|
|
Professional Geographer
1 publication, 1.19%
|
|
Journal of King Saud University - Computer and Information Sciences
1 publication, 1.19%
|
|
Computers and Industrial Engineering
1 publication, 1.19%
|
|
Ecological Informatics
1 publication, 1.19%
|
|
Journal of Visual Communication and Image Representation
1 publication, 1.19%
|
|
GigaScience
1 publication, 1.19%
|
|
Information Fusion
1 publication, 1.19%
|
|
Earth's Future
1 publication, 1.19%
|
|
IEEE Journal of Biomedical and Health Informatics
1 publication, 1.19%
|
|
Science of the Total Environment
1 publication, 1.19%
|
|
Information (Switzerland)
1 publication, 1.19%
|
|
Small Methods
1 publication, 1.19%
|
|
Advances in Business Information Systems and Analytics
1 publication, 1.19%
|
|
Theory, Culture and Society
1 publication, 1.19%
|
|
Discover Artificial Intelligence
1 publication, 1.19%
|
|
Materials and Design
1 publication, 1.19%
|
|
Lecture Notes in Electrical Engineering
1 publication, 1.19%
|
|
1
2
3
4
|
Publishers
5
10
15
20
25
|
|
Elsevier
23 publications, 27.38%
|
|
Institute of Electrical and Electronics Engineers (IEEE)
18 publications, 21.43%
|
|
Springer Nature
11 publications, 13.1%
|
|
MDPI
8 publications, 9.52%
|
|
Association for Computing Machinery (ACM)
5 publications, 5.95%
|
|
SAGE
4 publications, 4.76%
|
|
Frontiers Media S.A.
2 publications, 2.38%
|
|
Cold Spring Harbor Laboratory
2 publications, 2.38%
|
|
Oxford University Press
2 publications, 2.38%
|
|
Wiley
2 publications, 2.38%
|
|
Taylor & Francis
1 publication, 1.19%
|
|
King Saud University
1 publication, 1.19%
|
|
Royal Society of Chemistry (RSC)
1 publication, 1.19%
|
|
IGI Global
1 publication, 1.19%
|
|
IOS Press
1 publication, 1.19%
|
|
ASME International
1 publication, 1.19%
|
|
Ovid Technologies (Wolters Kluwer Health)
1 publication, 1.19%
|
|
5
10
15
20
25
|
- We do not take into account publications without a DOI.
- Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
- Statistics recalculated weekly.