volume 6 issue 6 pages 428-442

Evaluation guidelines for machine learning tools in the chemical sciences

Andreas Bender 1
Nadine Schneider 2
Marwin Segler 3
W Patrick Walters 4
Ola Engkvist 5, 6
Tiago Rodrigues 7
2
 
Novartis Institutes for BioMedical Research, Novartis Pharma, Novartis Campus, Basel, Switzerland
3
 
Microsoft Research Cambridge, Cambridge, UK
4
 
Relay Therapeutics, Cambridge, USA
5
 
Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
Publication typeJournal Article
Publication date2022-05-24
scimago Q1
wos Q1
SJR13.441
CiteScore59.4
Impact factor51.7
ISSN23973358
General Chemistry
General Chemical Engineering
Abstract
Machine learning (ML) promises to tackle the grand challenges in chemistry and speed up the generation, improvement and/or ordering of research hypotheses. Despite the overarching applicability of ML workflows, one usually finds diverse evaluation study designs. The current heterogeneity in evaluation techniques and metrics leads to difficulty in (or the impossibility of) comparing and assessing the relevance of new algorithms. Ultimately, this may delay the digitalization of chemistry at scale and confuse method developers, experimentalists, reviewers and journal editors. In this Perspective, we critically discuss a set of method development and evaluation guidelines for different types of ML-based publications, emphasizing supervised learning. We provide a diverse collection of examples from various authors and disciplines in chemistry. While taking into account varying accessibility across research groups, our recommendations focus on reporting completeness and standardizing comparisons between tools. We aim to further contribute to improved ML transparency and credibility by suggesting a checklist of retro-/prospective tests and dissecting their importance. We envisage that the wide adoption and continuous update of best practices will encourage an informed use of ML on real-world problems related to the chemical sciences. Studies employing machine-learning (ML) tools in the chemical sciences often report their evaluations in a heterogeneous way. The evaluation guidelines provided in this Perspective should enable more rigorous ML reporting.
Found 
Found 

Top-30

Journals

1
2
3
4
5
6
Nature Communications
6 publications, 5.36%
Chemical Science
5 publications, 4.46%
Angewandte Chemie - International Edition
4 publications, 3.57%
Angewandte Chemie
4 publications, 3.57%
Journal of Chemical Information and Modeling
3 publications, 2.68%
Journal of Cheminformatics
3 publications, 2.68%
Journal of Physical Chemistry C
2 publications, 1.79%
Chemical Research in Toxicology
2 publications, 1.79%
Journal of Physical Chemistry A
2 publications, 1.79%
Chemistry of Materials
2 publications, 1.79%
Drug Discovery Today
2 publications, 1.79%
ACS Catalysis
2 publications, 1.79%
Journal of the American Chemical Society
2 publications, 1.79%
Molecules
1 publication, 0.89%
ACS Central Science
1 publication, 0.89%
Bioorganic and Medicinal Chemistry
1 publication, 0.89%
Journal of Power Sources
1 publication, 0.89%
Artificial Intelligence in the Life Sciences
1 publication, 0.89%
Journal of Chemical Physics
1 publication, 0.89%
Materials Advances
1 publication, 0.89%
Energy Advances
1 publication, 0.89%
Science
1 publication, 0.89%
Frontiers in Drug Discovery
1 publication, 0.89%
Nature Computational Science
1 publication, 0.89%
ACS Omega
1 publication, 0.89%
Information (Switzerland)
1 publication, 0.89%
Machine Learning: Science and Technology
1 publication, 0.89%
Nano-Micro Letters
1 publication, 0.89%
Responsive Materials
1 publication, 0.89%
1
2
3
4
5
6

Publishers

5
10
15
20
25
American Chemical Society (ACS)
23 publications, 20.54%
Elsevier
20 publications, 17.86%
Springer Nature
19 publications, 16.96%
Wiley
17 publications, 15.18%
Royal Society of Chemistry (RSC)
11 publications, 9.82%
Cold Spring Harbor Laboratory
6 publications, 5.36%
MDPI
4 publications, 3.57%
AIP Publishing
1 publication, 0.89%
American Association for the Advancement of Science (AAAS)
1 publication, 0.89%
Frontiers Media S.A.
1 publication, 0.89%
IOP Publishing
1 publication, 0.89%
OOO Zhurnal "Mendeleevskie Soobshcheniya"
1 publication, 0.89%
SAGE
1 publication, 0.89%
Taylor & Francis
1 publication, 0.89%
Environmental Health Perspectives
1 publication, 0.89%
Public Library of Science (PLoS)
1 publication, 0.89%
Institute of Electrical and Electronics Engineers (IEEE)
1 publication, 0.89%
Oxford University Press
1 publication, 0.89%
5
10
15
20
25
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
112
Share
Cite this
GOST |
Cite this
GOST Copy
Bender A. et al. Evaluation guidelines for machine learning tools in the chemical sciences // Nature Reviews Chemistry. 2022. Vol. 6. No. 6. pp. 428-442.
GOST all authors (up to 50) Copy
Bender A., Schneider N., Segler M., Patrick Walters W., Engkvist O., Rodrigues T. Evaluation guidelines for machine learning tools in the chemical sciences // Nature Reviews Chemistry. 2022. Vol. 6. No. 6. pp. 428-442.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1038/s41570-022-00391-9
UR - https://doi.org/10.1038/s41570-022-00391-9
TI - Evaluation guidelines for machine learning tools in the chemical sciences
T2 - Nature Reviews Chemistry
AU - Bender, Andreas
AU - Schneider, Nadine
AU - Segler, Marwin
AU - Patrick Walters, W
AU - Engkvist, Ola
AU - Rodrigues, Tiago
PY - 2022
DA - 2022/05/24
PB - Springer Nature
SP - 428-442
IS - 6
VL - 6
PMID - 37117429
SN - 2397-3358
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2022_Bender,
author = {Andreas Bender and Nadine Schneider and Marwin Segler and W Patrick Walters and Ola Engkvist and Tiago Rodrigues},
title = {Evaluation guidelines for machine learning tools in the chemical sciences},
journal = {Nature Reviews Chemistry},
year = {2022},
volume = {6},
publisher = {Springer Nature},
month = {may},
url = {https://doi.org/10.1038/s41570-022-00391-9},
number = {6},
pages = {428--442},
doi = {10.1038/s41570-022-00391-9}
}
MLA
Cite this
MLA Copy
Bender, Andreas, et al. “Evaluation guidelines for machine learning tools in the chemical sciences.” Nature Reviews Chemistry, vol. 6, no. 6, May. 2022, pp. 428-442. https://doi.org/10.1038/s41570-022-00391-9.