Journal of Chemometrics, volume 39, issue 2

Improving Vapor Pressure Prediction Through Integration of Multiple Molecular Representations: A Super Learner Approach

Ji Hyun Nam ¹

Seul Lee ²

Seongil Jo ¹

Jaeoh Kim ¹

Jooyeon Lee ³

Jahyun Koo ³

Byounghwak Lee ⁴

Keunhong Jeong ⁴

Donghyeon Yu ¹

Show full list: 9 authors

Hide authors affiliations

Department of Statistics and Data Science Inha University Incheon Republic of Korea |

Department of Statistics Seoul National University Seoul Republic of Korea |

School of Biomedical Engineering Korea University Seoul Republic of Korea |

⁴

Department of Chemistry Korea Military Academy Seoul Republic of Korea |

Publication type: Journal Article

Publication date: 2025-02-11

Wiley

Journal: Journal of Chemometrics

scimago Q3

SJR: 0.383

CiteScore: 5.2

Impact factor: 1.9

ISSN: 08869383, 1099128X

DOI: 10.1002/cem.70003

Copy DOI

Abstract

ABSTRACT

Accurate prediction of vapor pressure is essential in chemical engineering, environmental science, and pharmaceutical development, impacting the volatility and stability of compounds. Traditional methods often fall short for complex and new molecular structures. This study introduces an advanced machine learning approach, integrating graph neural networks (GNNs), and CHEM‐BERT models to improve prediction accuracy. Utilizing the largest dataset to date, we derived comprehensive chemical descriptors and fingerprints. We evaluated 19 predictive models, including ridge regression, random forest, support vector regression, and feed‐forward neural networks, trained on diverse features like PaDEL and Morgan fingerprints, chemical descriptors, and Chem‐BERT embeddings. Central to our methodology is the super learner architecture, which combines 19 multiple models to enhance accuracy. The super learner achieved a root mean squared error (RMSE) of 0.8200, outperforming individual models and previous reports. These successful results highlight the effectiveness of integrating GNNs and Chem‐BERT for capturing detailed molecular information, setting a new benchmark for vapor pressure prediction. This study underscores the value of advanced machine learning techniques and comprehensive datasets, offering a robust tool for researchers and paving the way for future advancements in chemical property prediction.

Found

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Cite this

GOST | RIS | BibTex

Found error?

Publisher

Wiley

Journal

Journal of Chemometrics

scimago Q3

SJR

0.383

CiteScore

5.2

Impact factor

1.9

ISSN

08869383 (Print)

1099128X (Electronic)