Open Access
Open access

Identifying organic co-solvents via machine learning solubility predictions in organic solvents and water

Тип публикацииPosted Content
Дата публикации2025-01-30
SJR
CiteScore
Impact factor
ISSN25732293
Краткое описание

Developing predictive models of solubility is useful for accelerating solvent selection for applications ranging from electrochemical conversion of organics to pharmaceutical drug development. Herein, we report the development of a machine learning (ML) workflow for identifying organic co-solvents to increase the concentration of hydrophobic molecules in aqueous mixtures. This task is of particular interest for the electrocatalytic conversion of biomass and bio-oils into sustainable fuels, which faces challenges due to the low aqueous solubility of the feedstock. First, we predict the miscibility of potential co-solvents in water, and we only consider co-solvents that are miscible. Second, we rank the co-solvents based on their ability to solubilize the molecules of interest. As such, we train two ML models on the AqSolDB and the BigSolDB datasets to predict the aqueous solubility (S) and the organic solubility (x), respectively. We select the Light Gradient Boosting Machine model architecture for aqueous solubility (test R2 = 0.864, RMSE = 0.851 log(S / (mol/dm3)) and organic solubility (test R2 = 0.805, RMSE = 0.511 log(x)) predictions based on comparing different ML models and features. We examine the generalizability of the organic solubility model on unseen solutes both quantitatively and qualitatively. We evaluate the utility of this ML workflow by identifying co-solvents for benzaldehyde and limonene—two hydrophobic molecules that are relevant for sustainable fuel production—and validate our predictions via experimental solubility measurements.

Для доступа к списку цитирований публикации необходимо авторизоваться.

Топ-30

Журналы

1
Scientific data
1 публикация, 50%
Artificial Intelligence in Data and Big Data Processing
1 публикация, 50%
1

Издатели

1
2
Springer Nature
2 публикации, 100%
1
2
  • Мы не учитываем публикации, у которых нет DOI.
  • Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
Метрики
2
Поделиться
Цитировать
ГОСТ |
Цитировать
Krzyżanowski M. et al. Identifying organic co-solvents via machine learning solubility predictions in organic solvents and water // ChemRxiv. 2025.
ГОСТ со всеми авторами (до 50) Скопировать
Krzyżanowski M., Aishee S. M., Singh N., Goldsmith B. R. Identifying organic co-solvents via machine learning solubility predictions in organic solvents and water // ChemRxiv. 2025.
RIS |
Цитировать
TY - GENERIC
DO - 10.26434/chemrxiv-2025-xlt1q
UR - https://chemrxiv.org/engage/chemrxiv/article-details/6799224c81d2151a02515c0a
TI - Identifying organic co-solvents via machine learning solubility predictions in organic solvents and water
T2 - ChemRxiv
AU - Krzyżanowski, Maurycy
AU - Aishee, Sirazam Munira
AU - Singh, Nirala
AU - Goldsmith, Bryan R.
PY - 2025
DA - 2025/01/30
PB - American Chemical Society (ACS)
SN - 2573-2293
ER -
BibTex
Цитировать
BibTex (до 50 авторов) Скопировать
@article{2025_Krzyżanowski,
author = {Maurycy Krzyżanowski and Sirazam Munira Aishee and Nirala Singh and Bryan R. Goldsmith},
title = {Identifying organic co-solvents via machine learning solubility predictions in organic solvents and water},
journal = {ChemRxiv},
year = {2025},
publisher = {American Chemical Society (ACS)},
month = {jan},
url = {https://chemrxiv.org/engage/chemrxiv/article-details/6799224c81d2151a02515c0a},
doi = {10.26434/chemrxiv-2025-xlt1q}
}
Ошибка в публикации?