From NMR to AI: Do We Need 1H NMR Experimental Spectra to Obtain High-Quality logD Prediction Models?
1
Department of Medicinal Chemistry, Celon Pharma S.A., ul. Marymoncka 15, Kazun Nowy 05-152, Poland
|
3
Faculty of Mathematics and Natural Sciences, Department of Chemistry, University of Applied Sciences in Tarnow, Mickiewicza 8, Tarnow 33-100, Poland
|
Publication type: Journal Article
Publication date: 2025-03-05
scimago Q1
wos Q1
SJR: 1.467
CiteScore: 9.8
Impact factor: 5.3
ISSN: 15499596, 1549960X
Abstract
This study presents a novel approach to 1H NMR-based machine learning (ML) models for predicting logD using computer-generated 1H NMR spectra. Building on our previous work, which integrated experimental 1H NMR data, this study addresses key limitations associated with experimental measurements, such as sample stability, solvent variability, and extensive processing, by replacing them with fully computational workflows. Benchmarking across various density functional theory (DFT) functionals and basis sets highlighted their limitations, with DFT-based models showing relatively high RMSE values (average CHI logD of 1.12, lowest at 0.96) and extensive computational demands, limiting their usefulness for large-scale predictions. In contrast, models trained on predicted 1H NMR spectra by NMRshiftDB2 and JEOL JASON achieved RMSE values as low as 0.76, compared to 0.88 for experimental spectra. Further analysis revealed that mixing experimental and predicted spectra did not enhance accuracy, underscoring the advantage of homogeneous datasets. Validation with external datasets confirmed the robustness of our models, showing comparable performance to commercial software like Instant JChem, thus underscoring the reliability of the proposed computational workflow. Additionally, using normalized RMSE (NRMSE) proved essential for consistent model evaluation across datasets with varying data scales. By eliminating the need for experimental input, this workflow offers a widely accessible, computationally efficient pipeline, setting a new standard for ML-driven chemical property predictions without experimental data constraints.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
5
Total citations:
5
Citations from 0:
0
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Leniak A. et al. From NMR to AI: Do We Need 1H NMR Experimental Spectra to Obtain High-Quality logD Prediction Models? // Journal of Chemical Information and Modeling. 2025. Vol. 65. No. 6. pp. 2924-2939.
GOST all authors (up to 50)
Copy
Leniak A., Pietruś W., Świderska A., Kurczab R. From NMR to AI: Do We Need 1H NMR Experimental Spectra to Obtain High-Quality logD Prediction Models? // Journal of Chemical Information and Modeling. 2025. Vol. 65. No. 6. pp. 2924-2939.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1021/acs.jcim.4c02145
UR - https://pubs.acs.org/doi/10.1021/acs.jcim.4c02145
TI - From NMR to AI: Do We Need 1H NMR Experimental Spectra to Obtain High-Quality logD Prediction Models?
T2 - Journal of Chemical Information and Modeling
AU - Leniak, Arkadiusz
AU - Pietruś, Wojciech
AU - Świderska, Aleksandra
AU - Kurczab, Rafał
PY - 2025
DA - 2025/03/05
PB - American Chemical Society (ACS)
SP - 2924-2939
IS - 6
VL - 65
SN - 1549-9596
SN - 1549-960X
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2025_Leniak,
author = {Arkadiusz Leniak and Wojciech Pietruś and Aleksandra Świderska and Rafał Kurczab},
title = {From NMR to AI: Do We Need 1H NMR Experimental Spectra to Obtain High-Quality logD Prediction Models?},
journal = {Journal of Chemical Information and Modeling},
year = {2025},
volume = {65},
publisher = {American Chemical Society (ACS)},
month = {mar},
url = {https://pubs.acs.org/doi/10.1021/acs.jcim.4c02145},
number = {6},
pages = {2924--2939},
doi = {10.1021/acs.jcim.4c02145}
}
Cite this
MLA
Copy
Leniak, Arkadiusz, et al. “From NMR to AI: Do We Need 1H NMR Experimental Spectra to Obtain High-Quality logD Prediction Models?.” Journal of Chemical Information and Modeling, vol. 65, no. 6, Mar. 2025, pp. 2924-2939. https://pubs.acs.org/doi/10.1021/acs.jcim.4c02145.