Open Access

Chemistry - Methods

, том 2 , издание 1 , номер публикации e202100069

Image2SMILES: Transformer‐Based Molecular Optical Recognition Engine**

Тип публикации: Journal Article

Дата публикации: 2022-01-11

Wiley

Chemistry - Methods

SCImago Q1

Tоп 10% SCImago

WOS Q2

SJR: 0.856

CiteScore: 4.2

Impact factor: 3.7

ISSN: 26289725

DOI: 10.1002/cmtd.202100069

Скопировать DOI

Materials Science (miscellaneous)

Краткое описание

The rise of deep learning in various scientific and technology areas promotes the development of AI‐based tools for information retrieval. Optical recognition of organic structures is a key part of the automated extraction of chemical information. However, this is a challenging task because there is a large variety of representation styles. In this research, we present a Transformer‐based artificial neural network to convert images of organic structures to molecular structures. To train the model, we created a comprehensive data generator that stochastically simulates various drawing styles, functional groups, functional group placeholders (R‐groups), and visual contamination. We demonstrate that the Transformer‐based architecture can gather chemical insights from our generator with almost absolute confidence. That means that, with Transformer, one can fully concentrate on data simulation to build a good recognition model. A web demo of our optical recognition engine is available online at Syntelly platform, and the code for dataset generation is available on GitHub.

Для доступа к списку цитирований публикации необходимо авторизоваться.

Войти с ORCID

Для доступа к списку профилей, цитирующих публикацию, необходимо авторизоваться.

Войти с ORCID

Топ-30

Журналы

	1 2 3 4 5 6
Journal of Cheminformatics	Journal of Cheminformatics, 6, 13.64% Journal of Cheminformatics 6 публикаций, 13.64%
Journal of Chemical Information and Modeling	Journal of Chemical Information and Modeling, 4, 9.09% Journal of Chemical Information and Modeling 4 публикации, 9.09%
Briefings in Bioinformatics	Briefings in Bioinformatics, 2, 4.55% Briefings in Bioinformatics 2 публикации, 4.55%
Journal of Physical Chemistry Letters	Journal of Physical Chemistry Letters, 2, 4.55% Journal of Physical Chemistry Letters 2 публикации, 4.55%
Chemical Reviews	Chemical Reviews, 2, 4.55% Chemical Reviews 2 публикации, 4.55%
Molecular Informatics	Molecular Informatics, 1, 2.27% Molecular Informatics 1 публикация, 2.27%
Lecture Notes in Computer Science	Lecture Notes in Computer Science, 1, 2.27% Lecture Notes in Computer Science 1 публикация, 2.27%
28th International Conference on Intelligent User Interfaces	28th International Conference on Intelligent User Interfaces, 1, 2.27% 28th International Conference on Intelligent User Interfaces 1 публикация, 2.27%
npj Computational Materials	npj Computational Materials, 1, 2.27% npj Computational Materials 1 публикация, 2.27%
Nature Communications	Nature Communications, 1, 2.27% Nature Communications 1 публикация, 2.27%
bioRxiv	bioRxiv, 1, 2.27% bioRxiv 1 публикация, 2.27%
Macromolecules	Macromolecules, 1, 2.27% Macromolecules 1 публикация, 2.27%
Energy	Energy, 1, 2.27% Energy 1 публикация, 2.27%
IEEE International Conference on Computer Vision (ICCV)	IEEE International Conference on Computer Vision (ICCV), 1, 2.27% IEEE International Conference on Computer Vision (ICCV) 1 публикация, 2.27%
RSC Advances	RSC Advances, 1, 2.27% RSC Advances 1 публикация, 2.27%
Complex & Intelligent Systems	Complex & Intelligent Systems, 1, 2.27% Complex & Intelligent Systems 1 публикация, 2.27%
Scientific Reports	Scientific Reports, 1, 2.27% Scientific Reports 1 публикация, 2.27%
Journal of Pharmaceutical Analysis	Journal of Pharmaceutical Analysis, 1, 2.27% Journal of Pharmaceutical Analysis 1 публикация, 2.27%
Nature Machine Intelligence	Nature Machine Intelligence, 1, 2.27% Nature Machine Intelligence 1 публикация, 2.27%
IEEE Workshop on Applications of Computer Vision (WACV)	IEEE Workshop on Applications of Computer Vision (WACV), 1, 2.27% IEEE Workshop on Applications of Computer Vision (WACV) 1 публикация, 2.27%
Journal of Supercomputing	Journal of Supercomputing, 1, 2.27% Journal of Supercomputing 1 публикация, 2.27%
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)	IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1, 2.27% IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 1 публикация, 2.27%
Chemical Society Reviews	Chemical Society Reviews, 1, 2.27% Chemical Society Reviews 1 публикация, 2.27%
Environmental Science and Technology Letters	Environmental Science and Technology Letters, 1, 2.27% Environmental Science and Technology Letters 1 публикация, 2.27%
Plants	Plants, 1, 2.27% Plants 1 публикация, 2.27%
Computational and Structural Biotechnology Journal	Computational and Structural Biotechnology Journal, 1, 2.27% Computational and Structural Biotechnology Journal 1 публикация, 2.27%
ChemistryOpen	ChemistryOpen, 1, 2.27% ChemistryOpen 1 публикация, 2.27%
Annual Review of Chemical and Biomolecular Engineering	Annual Review of Chemical and Biomolecular Engineering, 1, 2.27% Annual Review of Chemical and Biomolecular Engineering 1 публикация, 2.27%
International Journal of Computer Vision	International Journal of Computer Vision, 1, 2.27% International Journal of Computer Vision 1 публикация, 2.27%
	1 2 3 4 5 6

Издатели

	2 4 6 8 10 12 14
Springer Nature	Springer Nature, 14, 31.82% Springer Nature 14 публикаций, 31.82%
American Chemical Society (ACS)	American Chemical Society (ACS), 10, 22.73% American Chemical Society (ACS) 10 публикаций, 22.73%
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 6, 13.64% Institute of Electrical and Electronics Engineers (IEEE) 6 публикаций, 13.64%
Elsevier	Elsevier, 3, 6.82% Elsevier 3 публикации, 6.82%
Oxford University Press	Oxford University Press, 2, 4.55% Oxford University Press 2 публикации, 4.55%
Wiley	Wiley, 2, 4.55% Wiley 2 публикации, 4.55%
Royal Society of Chemistry (RSC)	Royal Society of Chemistry (RSC), 2, 4.55% Royal Society of Chemistry (RSC) 2 публикации, 4.55%
Association for Computing Machinery (ACM)	Association for Computing Machinery (ACM), 1, 2.27% Association for Computing Machinery (ACM) 1 публикация, 2.27%
openRxiv	openRxiv, 1, 2.27% openRxiv 1 публикация, 2.27%
MDPI	MDPI, 1, 2.27% MDPI 1 публикация, 2.27%
Annual Reviews	Annual Reviews, 1, 2.27% Annual Reviews 1 публикация, 2.27%
	2 4 6 8 10 12 14

Мы не учитываем публикации, у которых нет DOI.
Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.

Войти с ORCID

PDF

Метрики

Цитировать

ГОСТ |

Цитировать

ГОСТ Скопировать

Khokhlov I. et al. Image2SMILES: Transformer‐Based Molecular Optical Recognition Engine** // Chemistry - Methods. 2022. Vol. 2. No. 1. e202100069

ГОСТ со всеми авторами (до 50) Скопировать

Khokhlov I., Krasnov L., Fedorov M. V., Sosnin S. Image2SMILES: Transformer‐Based Molecular Optical Recognition Engine** // Chemistry - Methods. 2022. Vol. 2. No. 1. e202100069

RIS |

Цитировать

RIS Скопировать

TY - JOUR

DO - 10.1002/cmtd.202100069

UR - https://chemistry-europe.onlinelibrary.wiley.com/doi/10.1002/cmtd.202100069

TI - Image2SMILES: Transformer‐Based Molecular Optical Recognition Engine**

T2 - Chemistry - Methods

AU - Khokhlov, Ivan

AU - Krasnov, Lev

AU - Fedorov, Maxim V.

AU - Sosnin, Sergey

PY - 2022

DA - 2022/01/11

PB - Wiley

IS - 1

VL - 2

SN - 2628-9725

ER -

BibTex

Цитировать

BibTex (до 50 авторов) Скопировать

@article{2022_Khokhlov,

author = {Ivan Khokhlov and Lev Krasnov and Maxim V. Fedorov and Sergey Sosnin},

title = {Image2SMILES: Transformer‐Based Molecular Optical Recognition Engine**},

journal = {Chemistry - Methods},

year = {2022},

volume = {2},

publisher = {Wiley},

month = {jan},

url = {https://chemistry-europe.onlinelibrary.wiley.com/doi/10.1002/cmtd.202100069},

number = {1},

pages = {e202100069},

doi = {10.1002/cmtd.202100069}

}

Издатель

Wiley

Журнал

Chemistry - Methods

SCImago Q1

Tоп 10% SCImago

WOS Q2

SJR

0.856

CiteScore

4.2

Impact factor

3.7

ISSN

26289725 (Print, Electronic)

Профили

Ошибка в публикации?

Новости

Нейросеть научилась превращать изображения химических молекул в SMILES-строки