Open Access
Open access
volume 20 issue 2 pages e0314327

Enhancing PM2.5 prediction by mitigating annual data drift using wrapped loss and neural networks

Md Khalid Hossen 1
Yan-Tsung Peng 2
Meng Chen Meng Chen 3
Publication typeJournal Article
Publication date2025-02-11
scimago Q1
wos Q2
SJR0.803
CiteScore5.4
Impact factor2.6
ISSN19326203
Abstract

In many deep learning tasks, it is assumed that the data used in the training process is sampled from the same distribution. However, this may not be accurate for data collected from different contexts or during different periods. For instance, the temperatures in a city can vary from year to year due to various unclear reasons. In this paper, we utilized three distinct statistical techniques to analyze annual data drifting at various stations. These techniques calculate the P values for each station by comparing data from five years (2014-2018) to identify data drifting phenomena. To find out the data drifting scenario those statistical techniques and calculate the P value from those techniques to measure the data drifting in specific locations. From those statistical techniques, the highest drifting stations can be identified from the previous year’s datasets To identify data drifting and highlight areas with significant drift, we utilized meteorological air quality and weather data in this study. We proposed two models that consider the characteristics of data drifting for PM2.5 prediction and compared them with various deep learning models, such as Long Short-Term Memory (LSTM) and its variants, for predictions from the next hour to the 64th hour. Our proposed models significantly outperform traditional neural networks. Additionally, we introduced a wrapped loss function incorporated into a model, resulting in more accurate results compared to those using the original loss function alone and prediction has been evaluated by RMSE, MAE and MAPE metrics. The proposed Front-loaded connection model(FLC) and Back-loaded connection model (BLC) solve the data drifting issue and the wrap loss function also help alleviate the data drifting problem with model training and works for the neural network models to achieve more accurate results. Eventually, the experimental results have shown that the proposed model performance enhanced from 24.1% -16%, 12%-8.3% respectively at 1h-24h, 32h-64h with compared to baselines BILSTM model, by 24.6% -11.8%, 10%-10.2% respectively at 1h-24h, 32h-64h compared to CNN model in hourly PM2.5 predictions.

Found 
Found 

Top-30

Journals

1
Machine Learning for Computational Science and Engineering
1 publication, 50%
Scientific Reports
1 publication, 50%
1

Publishers

1
2
Springer Nature
2 publications, 100%
1
2
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
2
Share
Cite this
GOST |
Cite this
GOST Copy
Hossen M. K. et al. Enhancing PM2.5 prediction by mitigating annual data drift using wrapped loss and neural networks // PLoS ONE. 2025. Vol. 20. No. 2. p. e0314327.
GOST all authors (up to 50) Copy
Hossen M. K., Yan-Tsung Peng, Meng Chen M. C. Enhancing PM2.5 prediction by mitigating annual data drift using wrapped loss and neural networks // PLoS ONE. 2025. Vol. 20. No. 2. p. e0314327.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1371/journal.pone.0314327
UR - https://dx.plos.org/10.1371/journal.pone.0314327
TI - Enhancing PM2.5 prediction by mitigating annual data drift using wrapped loss and neural networks
T2 - PLoS ONE
AU - Hossen, Md Khalid
AU - Yan-Tsung Peng
AU - Meng Chen, Meng Chen
PY - 2025
DA - 2025/02/11
PB - Public Library of Science (PLoS)
SP - e0314327
IS - 2
VL - 20
SN - 1932-6203
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2025_Hossen,
author = {Md Khalid Hossen and Yan-Tsung Peng and Meng Chen Meng Chen},
title = {Enhancing PM2.5 prediction by mitigating annual data drift using wrapped loss and neural networks},
journal = {PLoS ONE},
year = {2025},
volume = {20},
publisher = {Public Library of Science (PLoS)},
month = {feb},
url = {https://dx.plos.org/10.1371/journal.pone.0314327},
number = {2},
pages = {e0314327},
doi = {10.1371/journal.pone.0314327}
}
MLA
Cite this
MLA Copy
Hossen, Md Khalid, et al. “Enhancing PM2.5 prediction by mitigating annual data drift using wrapped loss and neural networks.” PLoS ONE, vol. 20, no. 2, Feb. 2025, p. e0314327. https://dx.plos.org/10.1371/journal.pone.0314327.