Journal of Physical Chemistry Letters, volume 12, issue 38, pages 9213-9219

Size Doesn't Matter: Predicting Physico- or Biochemical Properties Based on Dozens of Molecules

Kirill Karpov 1, 2
Artem Mitrofanov 1, 2
Vadim Korolev 1, 2
Valery Tkachenko 2
Publication typeJournal Article
Publication date2021-09-16
scimago Q1
SJR1.586
CiteScore9.6
Impact factor4.8
ISSN19487185
Physical and Theoretical Chemistry
General Materials Science
Abstract
The use of machine learning in chemistry has become a common practice. At the same time, despite the success of modern machine learning methods, the lack of data limits their use. Using a transfer learning methodology can help solve this problem. This methodology assumes that a model built on a sufficient amount of data captures general features of the chemical compound structure on which it was trained and that the further reuse of these features on a data set with a lack of data will greatly improve the quality of the new model. In this paper, we develop this approach for small organic molecules, implementing transfer learning with graph convolutional neural networks. The paper shows a significant improvement in the performance of the models for target properties with a lack of data. The effects of the data set composition on the model's quality and the applicability domain of the resulting models are also considered.
Found 
Found 

Top-30

Journals

1
1

Publishers

1
2
3
1
2
3
  • We do not take into account publications without a DOI.
  • Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Share
Cite this
GOST | RIS | BibTex | MLA
Found error?