Open Access
Open access
volume 13 issue 2 pages 326

On the Generalization Ability of Data-Driven Models in the Problem of Total Cloud Cover Retrieval

Marina Aleksandrova 1
Sergey Gulev 1
Alexey Sinitsyn 1
Nadezhda Kovaleva 1
Alexander Gavrikov 1
Publication typeJournal Article
Publication date2021-01-19
scimago Q1
wos Q1
SJR1.019
CiteScore8.6
Impact factor4.1
ISSN20724292, 23154632, 23154675
General Earth and Planetary Sciences
Abstract

Total Cloud Cover (TCC) retrieval from ground-based optical imagery is a problem that has been tackled by several generations of researchers. The number of human-designed algorithms for the estimation of TCC grows every year. However, there has been no considerable progress in terms of quality, mostly due to the lack of systematic approach to the design of the algorithms, to the assessment of their generalization ability, and to the assessment of the TCC retrieval quality. In this study, we discuss the optimization nature of data-driven schemes for TCC retrieval. In order to compare the algorithms, we propose a framework for the assessment of the algorithms’ characteristics. We present several new algorithms that are based on deep learning techniques: A model for outliers filtering, and a few models for TCC retrieval from all-sky imagery. For training and assessment of data-driven algorithms of this study, we present the Dataset of All-Sky Imagery over the Ocean (DASIO) containing over one million all-sky optical images of the visible sky dome taken in various regions of the world ocean. The research campaigns that contributed to the DASIO collection took place in the Atlantic ocean, the Indian ocean, the Red and Mediterranean seas, and the Arctic ocean. Optical imagery collected during these missions are accompanied by standard meteorological observations of cloudiness characteristics made by experienced observers. We assess the generalization ability of the presented models in several scenarios that differ in terms of the regions selected for the train and test subsets. As a result, we demonstrate that our models based on convolutional neural networks deliver a superior quality compared to all previously published approaches. As a key result, we demonstrate a considerable drop in the ability to generalize the training data in the case of a strong covariate shift between the training and test subsets of imagery which may occur in the case of region-aware subsampling.

Found 
Found 

Top-30

Journals

1
2
3
Moscow University Physics Bulletin (English Translation of Vestnik Moskovskogo Universiteta, Fizika)
3 publications, 21.43%
Remote Sensing
2 publications, 14.29%
Earth and Space Science
1 publication, 7.14%
Energies
1 publication, 7.14%
Oceanology
1 publication, 7.14%
Advances in Applied Energy
1 publication, 7.14%
Atmospheric Measurement Techniques
1 publication, 7.14%
AIP Conference Proceedings
1 publication, 7.14%
Izvestiya - Atmospheric and Oceanic Physics
1 publication, 7.14%
Smart Agricultural Technology
1 publication, 7.14%
Известия Российской академии наук Физика атмосферы и океана
1 publication, 7.14%
1
2
3

Publishers

1
2
3
MDPI
3 publications, 21.43%
Pleiades Publishing
3 publications, 21.43%
Elsevier
2 publications, 14.29%
Allerton Press
2 publications, 14.29%
Wiley
1 publication, 7.14%
Copernicus
1 publication, 7.14%
AIP Publishing
1 publication, 7.14%
The Russian Academy of Sciences
1 publication, 7.14%
1
2
3
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
14
Share
Cite this
GOST |
Cite this
GOST Copy
Krinitskiy M. et al. On the Generalization Ability of Data-Driven Models in the Problem of Total Cloud Cover Retrieval // Remote Sensing. 2021. Vol. 13. No. 2. p. 326.
GOST all authors (up to 50) Copy
Krinitskiy M., Aleksandrova M., Verezemskaya P., Gulev S., Sinitsyn A., Kovaleva N., Gavrikov A. On the Generalization Ability of Data-Driven Models in the Problem of Total Cloud Cover Retrieval // Remote Sensing. 2021. Vol. 13. No. 2. p. 326.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.3390/rs13020326
UR - https://doi.org/10.3390/rs13020326
TI - On the Generalization Ability of Data-Driven Models in the Problem of Total Cloud Cover Retrieval
T2 - Remote Sensing
AU - Krinitskiy, Mikhail
AU - Aleksandrova, Marina
AU - Verezemskaya, Polina
AU - Gulev, Sergey
AU - Sinitsyn, Alexey
AU - Kovaleva, Nadezhda
AU - Gavrikov, Alexander
PY - 2021
DA - 2021/01/19
PB - MDPI
SP - 326
IS - 2
VL - 13
SN - 2072-4292
SN - 2315-4632
SN - 2315-4675
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2021_Krinitskiy,
author = {Mikhail Krinitskiy and Marina Aleksandrova and Polina Verezemskaya and Sergey Gulev and Alexey Sinitsyn and Nadezhda Kovaleva and Alexander Gavrikov},
title = {On the Generalization Ability of Data-Driven Models in the Problem of Total Cloud Cover Retrieval},
journal = {Remote Sensing},
year = {2021},
volume = {13},
publisher = {MDPI},
month = {jan},
url = {https://doi.org/10.3390/rs13020326},
number = {2},
pages = {326},
doi = {10.3390/rs13020326}
}
MLA
Cite this
MLA Copy
Krinitskiy, Mikhail, et al. “On the Generalization Ability of Data-Driven Models in the Problem of Total Cloud Cover Retrieval.” Remote Sensing, vol. 13, no. 2, Jan. 2021, p. 326. https://doi.org/10.3390/rs13020326.