Analytica Chimica Acta, volume 1147, pages 64-71

DeepReI: Deep learning-based gas chromatographic retention index predictor

Tomáš Vrzal ¹

Michaela Malečková ^{2, 3}

Jana Olsovska ¹

Research Institute of Brewing and Malting, Plc., Lípová 511/15, 120 44, Prague 2, Czech Republic. |

Research Institute of Brewing and Malting, Plc., Lípová 511/15, 120 44, Prague 2, Czech Republic |

Charles University, Faculty of Science, Department of Analytical Chemistry, Albertov 6, 128 43, Prague 2, Czech Republic. |

Publication type: Journal Article

Publication date: 2021-02-01

Elsevier

Journal: Analytica Chimica Acta

scimago Q1

wos Q1

SJR: 0.998

CiteScore: 10.4

Impact factor: 5.7

ISSN: 00032670, 18734324

DOI: 10.1016/j.aca.2020.12.043

Copy DOI

Biochemistry

Spectroscopy

Analytical Chemistry

Environmental Chemistry

Abstract

Retention index in gas chromatographic analyses is an essential tool for appropriate analyte identification. Currently, many libraries providing retention indices for a huge number of compounds on distinct stationary phase chemistries are available. However, situation could be complicated in the case of unknown unknowns not present in such libraries. The importance of identification of these compounds have risen together with a rapidly expanding interest in non-targeted analyses in the last decade. Therefore, precise in silico computation/prediction of retention indices based on a suggested molecular structure will be highly appreciated in such situations. On this basis, a predictive model based on deep learning was developed and presented in this paper. It is designed for user-friendly and accurate prediction of retention indices of compounds in gas chromatography with the semi-standard non-polar stationary phase. Simplified Molecular Input Entry System (SMILES) is used as the model’s input. Architecture of the model consists of 2D-convolutional layers, together with batch normalization, max pooling, dropout, and three residual connections. The model reaches median absolute error of prediction of the retention index for validation and test set at 16.4 and 16.0 units, respectively. Median percentage error is lower than or equal to 0.81% in the case of all mentioned data sets. Finally, the DeepReI model is presented in R package, and is available on https://github.com/TomasVrzal/DeepReI together with a user-friendly graphical user interface. • Advanced model for retention indices prediction of compounds in GC was developed. • The model is based on a convolutional neural network and advanced approaches. • Median percentage error of prediction is ≤ 0.81%. • The model is publicly available in the R package - DeepReI.

Found 23

By date By citations

Deep Learning Driven GC-MS Library Search and Its Application for Metabolomics

Matyushin D.D., Sholokhova A.Y., Buryak A.K.

Analytical Chemistry scimago Q1 wos Q1 ,

2020-08-12, citations by CoLab: 49 , Abstract

Predicting a Molecular Fingerprint from an Electron Ionization Mass Spectrum with Deep Neural Networks

Ji H., Deng H., Lu H., Zhang Z.

Analytical Chemistry scimago Q1 wos Q1 ,

2020-06-17, citations by CoLab: 72 , Abstract

From chemical structure to quantitative polymer properties prediction through convolutional neural networks

Miccio L.A., Schwartz G.A.

Polymer scimago Q1 wos Q2 ,

2020-04-01, citations by CoLab: 62 , Abstract

A deep convolutional neural network for the estimation of gas chromatographic retention indices

Matyushin D.D., Sholokhova A.Y., Buryak A.K.

Journal of Chromatography A scimago Q2 wos Q1 ,

2019-12-01, citations by CoLab: 46 , Abstract

Peak alignment of gas chromatography–mass spectrometry data with deep learning

Li M., Wang X.R.

Journal of Chromatography A scimago Q2 wos Q1 ,

2019-10-01, citations by CoLab: 35 , Abstract

Pyrolytic profiling nitrosamine specific chemiluminescence detection combined with multivariate chemometric discrimination for non-targeted detection and classification of nitroso compounds in complex samples

Vrzal T., Olšovská J.

Analytica Chimica Acta scimago Q1 wos Q1 ,

2019-06-01, citations by CoLab: 8 , Abstract

Deep learning-based component identification for the Raman spectra of mixtures

Fan X., Ming W., Zeng H., Zhang Z., Lu H.

The Analyst scimago Q2 wos Q2 ,

2019-01-24, citations by CoLab: 153 , Abstract

Convolutional neural network based on SMILES representation of compounds for detecting chemical motif

Hirohara M., Saito Y., Koda Y., Sato K., Sakakibara Y.

BMC Bioinformatics scimago Q1 wos Q1 Open Access

2018-12-31, citations by CoLab: 144 , PDF, Abstract

Prediction Models of Retention Indices for Increased Confidence in Structural Elucidation during Complex Matrix Analysis: Application to Gas Chromatography Coupled with High-Resolution Mass Spectrometry

Dossin E., Martin E., Diana P., Castellon A., Monge A., Pospisil P., Bentley M., Guy P.A.

Analytical Chemistry scimago Q1 wos Q1 ,

2016-07-22, citations by CoLab: 30 , Abstract

What experimental factors influence the accuracy of retention projections in gas chromatography–mass spectrometry?

Wilson M.B., Barnes B.B., Boswell P.G.

Journal of Chromatography A scimago Q2 wos Q1 ,

2014-12-01, citations by CoLab: 7 , Abstract

A large scale test dataset to determine optimal retention index threshold based on three mass spectral similarity measures

Zhang J., Koo I., Wang B., Gao Q., Zheng C., Zhang X.

Journal of Chromatography A scimago Q2 wos Q1 ,

2012-08-01, citations by CoLab: 20 , Abstract

Retention Indices for Frequently Reported Compounds of Plant Essential Oils

Babushok V.I., Linstrom P.J., Zenkevich I.G.

Journal of Physical and Chemical Reference Data scimago Q1 wos Q1 ,

2011-11-29, citations by CoLab: 652 , Abstract

Evaluation of a rapid method for the quantitative analysis of fatty acids in various matrices

Araujo P., Nguyen T., Frøyland L., Wang J., Kang J.X.

Journal of Chromatography A scimago Q2 wos Q1 ,

2008-11-01, citations by CoLab: 101 , Abstract

ChemmineR: a compound mining framework for R

Cao Y., Charisi A., Cheng L.-., Jiang T., Girke T.

Bioinformatics scimago Q1 wos Q1 Open Access

2008-07-02, citations by CoLab: 291 , PDF, Abstract

Practical retention index models of OV-101, DB-1, DB-5, and DB-Wax for flavor and fragrance compounds

Goodner K.L.

LWT - Food Science and Technology scimago Q1 wos Q1 Open Access

2008-07-01, citations by CoLab: 175 , Abstract

Found 33

By date By citations

Identification and hazard prioritization of hydrophobic organic chemicals in flowback and produced water particles: Implications for water management

Lin H., Zhong C., Wen R., Ma T.H., He D., Martin J.W., Goss G.G., Alessi D.S., He Y.

Water Research scimago Q1 wos Q1 ,

2025-01-01, citations by CoLab: 1

MWFormer: Estimation of Molecular Weights from Electron Ionization Mass Spectra for Improved Library Searching

Yang Q., Zhang H., Wang Y., Tan L., Xie T., Wang Y., Long J., Guo Z., Zhang Z., Lu H.

Analytical Chemistry scimago Q1 wos Q1 ,

2024-12-19, citations by CoLab: 0

Molecular Similarity Used for Evaluating the Accuracy of Retention Index Predictions in Gas Chromatography Using Deep Learning

Matyushin D.D., Sholokhova A.Y., Khrisanfov M.D., Borovikova S.A.

Russian Journal of Physical Chemistry A scimago Q4 wos Q4 ,

2024-12-01, citations by CoLab: 0 , Abstract

Ready‐to‐use Models Built Using a Diverse Set of 266 Aroma Compounds for the Estimation of Gas Chromatographic Retention Indices for the 50%‐Cyanopropylphenyl‐50%‐Dimethylpolysiloxane Stationary Phase

Sholokhova A.Y., Matyushin D.D.

Journal of Separation Science scimago Q2 wos Q2 ,

2024-11-04, citations by CoLab: 0 , Abstract

Bridging knowledge gaps in human chemical exposure via drinking water with non-target screening

Ciccarelli D., Samanipour S., Rapp-Wright H., Bieber S., Letzel T., O’Brien J.W., Marczylo T., Gant T.W., Vineis P., Barron L.P.

Critical Reviews in Environmental Science and Technology scimago Q1 wos Q1 ,

2024-09-02, citations by CoLab: 3

Validation of the identification reliability of known and assumed UDMH transformation products using gas chromatographic retention indices and machine learning

Karnaeva A.E., Sholokhova A.Y.

Chemosphere scimago Q1 wos Q1 ,

2024-08-01, citations by CoLab: 3 , Abstract

Intelligent Consensus Predictions of the Retention Index of Flavor and Fragrance Compounds Using 2D Descriptors

Bera D., Kumar A., Roy J., Roy K.

Chromatographia scimago Q3 wos Q4 ,

2024-07-18, citations by CoLab: 0 , Abstract

Semi-volatile organic compounds in a museum in China: A non-targeted screening approach

Song Z., Nian L., Shi M., Ren X., Tang M., Shi A., Han Y., Liu M., Wang L., Zhang Y., Xu Y., Feng X.

Science China Technological Sciences scimago Q1 wos Q2 ,

2024-07-12, citations by CoLab: 0 , Abstract

Unveiling Hidden Insights in Gas Chromatography Data Analysis with Generative Adversarial Networks

Yoon N., Jung W., Kim H.

Chemosensors scimago Q2 wos Q1 Open Access

2024-07-07, citations by CoLab: 1 , PDF, Abstract

Chemometrics in Quality Control of Traditional Chinese Medicines

He M., Li S.

2024-04-26, citations by CoLab: 1 , Abstract

GCMSFormer: A Fully Automatic Method for the Resolution of Overlapping Peaks in Gas Chromatography–Mass Spectrometry

Guo Z., Fan Y., Yu C., Lu H., Zhang Z.

Analytical Chemistry scimago Q1 wos Q1 ,

2024-04-01, citations by CoLab: 2

A general procedure for finding potentially erroneous entries in the database of retention indices

Khrisanfov M.D., Matyushin D.D., Samokhin A.S.

Analytica Chimica Acta scimago Q1 wos Q1 ,

2024-04-01, citations by CoLab: 6 , Abstract

AIRI: Predicting Retention Indices and Their Uncertainties Using Artificial Intelligence

Geer L.Y., Stein S.E., Mallard W.G., Slotta D.J.

Journal of Chemical Information and Modeling scimago Q1 wos Q1 ,

2024-01-17, citations by CoLab: 7

An Integrated Workflow Assisted by In Silico Predictions To Expand the List of Priority Polycyclic Aromatic Compounds

Li T., Su W., Zhong L., Liang W., Feng X., Zhu B., Ruan T., Jiang G.

Environmental Science & Technology scimago Q1 wos Q1 ,

2023-11-27, citations by CoLab: 14

An Integrated Non-Targeted and Targeted Analysis Approach for Identification of Semi-Volatile Organic Compounds in Indoor Dust

Song Z., Shi M., Ren X., Wang L., Wu Y., Fan Y., Zhang Y., Xu Y.

Journal of Hazardous Materials scimago Q1 wos Q1 ,

2023-10-01, citations by CoLab: 8 , Abstract

	1 2 3
Chemometrics and Intelligent Laboratory Systems	Chemometrics and Intelligent Laboratory Systems, 3, 9.09% Chemometrics and Intelligent Laboratory Systems 3 publications, 9.09%
Journal of Chromatography A	Journal of Chromatography A, 3, 9.09% Journal of Chromatography A 3 publications, 9.09%
Analytica Chimica Acta	Analytica Chimica Acta, 2, 6.06% Analytica Chimica Acta 2 publications, 6.06%
Analytical Chemistry	Analytical Chemistry, 2, 6.06% Analytical Chemistry 2 publications, 6.06%
Artificial Intelligence Review	Artificial Intelligence Review, 1, 3.03% Artificial Intelligence Review 1 publication, 3.03%
Journal of Cheminformatics	Journal of Cheminformatics, 1, 3.03% Journal of Cheminformatics 1 publication, 3.03%
International Journal of Molecular Sciences	International Journal of Molecular Sciences, 1, 3.03% International Journal of Molecular Sciences 1 publication, 3.03%
Biomedicines	Biomedicines, 1, 3.03% Biomedicines 1 publication, 3.03%
Separations	Separations, 1, 3.03% Separations 1 publication, 3.03%
Frontiers in Aging Neuroscience	Frontiers in Aging Neuroscience, 1, 3.03% Frontiers in Aging Neuroscience 1 publication, 3.03%
TrAC - Trends in Analytical Chemistry	TrAC - Trends in Analytical Chemistry, 1, 3.03% TrAC - Trends in Analytical Chemistry 1 publication, 3.03%
Journal of Agricultural and Food Chemistry	Journal of Agricultural and Food Chemistry, 1, 3.03% Journal of Agricultural and Food Chemistry 1 publication, 3.03%
Russian Chemical Bulletin	Russian Chemical Bulletin, 1, 3.03% Russian Chemical Bulletin 1 publication, 3.03%
Journal of Chemical Education	Journal of Chemical Education, 1, 3.03% Journal of Chemical Education 1 publication, 3.03%
Wireless Communications and Mobile Computing	Wireless Communications and Mobile Computing, 1, 3.03% Wireless Communications and Mobile Computing 1 publication, 3.03%
Journal of Hazardous Materials	Journal of Hazardous Materials, 1, 3.03% Journal of Hazardous Materials 1 publication, 3.03%
Environmental Science & Technology	Environmental Science & Technology, 1, 3.03% Environmental Science & Technology 1 publication, 3.03%
Journal of Chemical Information and Modeling	Journal of Chemical Information and Modeling, 1, 3.03% Journal of Chemical Information and Modeling 1 publication, 3.03%
Chemosensors	Chemosensors, 1, 3.03% Chemosensors 1 publication, 3.03%
Chromatographia	Chromatographia, 1, 3.03% Chromatographia 1 publication, 3.03%
Science China Technological Sciences	Science China Technological Sciences, 1, 3.03% Science China Technological Sciences 1 publication, 3.03%
Critical Reviews in Environmental Science and Technology	Critical Reviews in Environmental Science and Technology, 1, 3.03% Critical Reviews in Environmental Science and Technology 1 publication, 3.03%
Chemosphere	Chemosphere, 1, 3.03% Chemosphere 1 publication, 3.03%
Water Research	Water Research, 1, 3.03% Water Research 1 publication, 3.03%
Journal of Separation Science	Journal of Separation Science, 1, 3.03% Journal of Separation Science 1 publication, 3.03%
Russian Journal of Physical Chemistry A	Russian Journal of Physical Chemistry A, 1, 3.03% Russian Journal of Physical Chemistry A 1 publication, 3.03%
	1 2 3

	2 4 6 8 10 12
Elsevier	Elsevier, 12, 36.36% Elsevier 12 publications, 36.36%
Springer Nature	Springer Nature, 6, 18.18% Springer Nature 6 publications, 18.18%
American Chemical Society (ACS)	American Chemical Society (ACS), 6, 18.18% American Chemical Society (ACS) 6 publications, 18.18%
MDPI	MDPI, 4, 12.12% MDPI 4 publications, 12.12%
Frontiers Media S.A.	Frontiers Media S.A., 1, 3.03% Frontiers Media S.A. 1 publication, 3.03%
Hindawi Limited	Hindawi Limited, 1, 3.03% Hindawi Limited 1 publication, 3.03%
Taylor & Francis	Taylor & Francis, 1, 3.03% Taylor & Francis 1 publication, 3.03%
Wiley	Wiley, 1, 3.03% Wiley 1 publication, 3.03%
Pleiades Publishing	Pleiades Publishing, 1, 3.03% Pleiades Publishing 1 publication, 3.03%
	2 4 6 8 10 12

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Cite this

GOST | RIS | BibTex

Found error?

Publisher

Elsevier

Journal

Analytica Chimica Acta

scimago Q1

wos Q1

SJR

0.998

CiteScore

10.4

Impact factor

5.7

ISSN

00032670 (Print)

18734324 (Electronic)

DeepReI: Deep learning-based gas chromatographic retention index predictor

Top-30

Journals

Publishers

Are you a researcher?