Journal of Chemical Information and Modeling

, volume 47 , issue 2 , pages 488-508

Evaluating Virtual Screening Methods: Good and Bad Metrics for the “Early Recognition” Problem

Jean-Francois Truchon ¹

Christopher I. Bayly ¹

Hide authors affiliations Show authors affiliations: 1 affiliation

Department of Medicinal Chemistry, Merck Frosst Centre for Therapeutic Research, 16711 TransCanada Highway, Kirkland, Québec, Canada H9H 3L1 |

Publication type: Journal Article

Publication date: 2007-02-09

American Chemical Society (ACS)

Journal of Chemical Information and Modeling

scimago Q1

wos Q1

SJR: 1.467

CiteScore: 9.8

Impact factor: 5.3

ISSN: 15499596, 1549960X

DOI: 10.1021/ci600426e

Copy DOI

PubMed ID: 17288412

General Chemistry

Computer Science Applications

General Chemical Engineering

Library and Information Sciences

Abstract

Many metrics are currently used to evaluate the performance of ranking methods in virtual screening (VS), for instance, the area under the receiver operating characteristic curve (ROC), the area under the accumulation curve (AUAC), the average rank of actives, the enrichment factor (EF), and the robust initial enhancement (RIE) proposed by Sheridan et al. In this work, we show that the ROC, the AUAC, and the average rank metrics have the same inappropriate behaviors that make them poor metrics for comparing VS methods whose purpose is to rank actives early in an ordered list (the "early recognition problem"). In doing so, we derive mathematical formulas that relate those metrics together. Moreover, we show that the EF metric is not sensitive to ranking performance before and after the cutoff. Instead, we formally generalize the ROC metric to the early recognition problem which leads us to propose a novel metric called the Boltzmann-enhanced discrimination of receiver operating characteristic that turns out to contain the discrimination power of the RIE metric but incorporates the statistical significance from ROC and its well-behaved boundaries. Finally, two major sources of errors, namely, the statistical error and the "saturation effects", are examined. This leads to practical recommendations for the number of actives, the number of inactives, and the "early recognition" importance parameter that one should use when comparing ranking methods. Although this work is applied specifically to VS, it is general and can be used to analyze any method that needs to segregate actives toward the front of a rank-ordered list.

Found

1 citation

Osolodkin Dmitry

🥼 🤝

PhD in Chemistry

80 publications, 1 714 citations

h-index: 23

Sechenov First Moscow State Medical University

M.P. Chumakov Federal Scientific Center for Research and Development of Immunobiological Drugs of the Russian Academy of Sciences

1 citation

Das Pundarikaksha

🥼 🤝

PhD in Biological/biomedical sciences, lecturer

7 publications, 40 citations, 6 reviews

h-index: 3

Assam Royal Global University

Research interests

Bioinformatics

CADD

Computational Biology

Digital Forensics

Forensic Biology

Molecular Modelling

1 citation

Shtro Anna

🥼 🤝

PhD in Biological/biomedical sciences

73 publications, 1 154 citations, 1 review

h-index: 20

Smorodintsev Research Institute of Influenza

Research interests

Pharmacology

Virology

1 citation

Sarfaraz Alam

🥼 🤝

PhD in Biological/biomedical sciences, senior lecturer

30 publications, 1 014 citations, 5 reviews

h-index: 18

Central Institute of Medicinal and Aromatic Plants

Research interests

Molecular docking

QSAR

1 citation

Malyshev Alexander

5 publications, 36 citations

h-index: 3

Lomonosov Moscow State University

Dukhov Research Institute of Automatics

Herzen Moscow Oncology Research Institute

Research interests

Computational chemistry

Top-30

Journals

	20 40 60 80 100 120 140
Journal of Chemical Information and Modeling	Journal of Chemical Information and Modeling, 139, 19.83% Journal of Chemical Information and Modeling 139 publications, 19.83%
Journal of Cheminformatics	Journal of Cheminformatics, 33, 4.71% Journal of Cheminformatics 33 publications, 4.71%
Journal of Computer-Aided Molecular Design	Journal of Computer-Aided Molecular Design, 32, 4.56% Journal of Computer-Aided Molecular Design 32 publications, 4.56%
Journal of Biomolecular Structure and Dynamics	Journal of Biomolecular Structure and Dynamics, 29, 4.14% Journal of Biomolecular Structure and Dynamics 29 publications, 4.14%
Molecules	Molecules, 24, 3.42% Molecules 24 publications, 3.42%
Molecular Informatics	Molecular Informatics, 17, 2.43% Molecular Informatics 17 publications, 2.43%
PLoS ONE	PLoS ONE, 15, 2.14% PLoS ONE 15 publications, 2.14%
Methods in Molecular Biology	Methods in Molecular Biology, 14, 2% Methods in Molecular Biology 14 publications, 2%
International Journal of Molecular Sciences	International Journal of Molecular Sciences, 13, 1.85% International Journal of Molecular Sciences 13 publications, 1.85%
Journal of Medicinal Chemistry	Journal of Medicinal Chemistry, 12, 1.71% Journal of Medicinal Chemistry 12 publications, 1.71%
Journal of Molecular Graphics and Modelling	Journal of Molecular Graphics and Modelling, 10, 1.43% Journal of Molecular Graphics and Modelling 10 publications, 1.43%
Scientific Reports	Scientific Reports, 9, 1.28% Scientific Reports 9 publications, 1.28%
European Journal of Medicinal Chemistry	European Journal of Medicinal Chemistry, 8, 1.14% European Journal of Medicinal Chemistry 8 publications, 1.14%
ACS Omega	ACS Omega, 7, 1% ACS Omega 7 publications, 1%
Frontiers in Pharmacology	Frontiers in Pharmacology, 6, 0.86% Frontiers in Pharmacology 6 publications, 0.86%
Frontiers in Chemistry	Frontiers in Chemistry, 6, 0.86% Frontiers in Chemistry 6 publications, 0.86%
Wiley Interdisciplinary Reviews: Computational Molecular Science	Wiley Interdisciplinary Reviews: Computational Molecular Science, 6, 0.86% Wiley Interdisciplinary Reviews: Computational Molecular Science 6 publications, 0.86%
Journal of Computational Chemistry	Journal of Computational Chemistry, 6, 0.86% Journal of Computational Chemistry 6 publications, 0.86%
Journal of Chemical Theory and Computation	Journal of Chemical Theory and Computation, 6, 0.86% Journal of Chemical Theory and Computation 6 publications, 0.86%
Briefings in Bioinformatics	Briefings in Bioinformatics, 6, 0.86% Briefings in Bioinformatics 6 publications, 0.86%
Chemical Science	Chemical Science, 5, 0.71% Chemical Science 5 publications, 0.71%
Future Medicinal Chemistry	Future Medicinal Chemistry, 5, 0.71% Future Medicinal Chemistry 5 publications, 0.71%
Molecular Diversity	Molecular Diversity, 5, 0.71% Molecular Diversity 5 publications, 0.71%
Journal of Molecular Modeling	Journal of Molecular Modeling, 5, 0.71% Journal of Molecular Modeling 5 publications, 0.71%
Nature Communications	Nature Communications, 5, 0.71% Nature Communications 5 publications, 0.71%
Bioorganic and Medicinal Chemistry	Bioorganic and Medicinal Chemistry, 5, 0.71% Bioorganic and Medicinal Chemistry 5 publications, 0.71%
Chemical Biology and Drug Design	Chemical Biology and Drug Design, 5, 0.71% Chemical Biology and Drug Design 5 publications, 0.71%
Lecture Notes in Computer Science	Lecture Notes in Computer Science, 5, 0.71% Lecture Notes in Computer Science 5 publications, 0.71%
Bioinformatics	Bioinformatics, 5, 0.71% Bioinformatics 5 publications, 0.71%
	20 40 60 80 100 120 140

Publishers

	20 40 60 80 100 120 140 160 180
American Chemical Society (ACS)	American Chemical Society (ACS), 176, 25.11% American Chemical Society (ACS) 176 publications, 25.11%
Springer Nature	Springer Nature, 154, 21.97% Springer Nature 154 publications, 21.97%
Elsevier	Elsevier, 75, 10.7% Elsevier 75 publications, 10.7%
Wiley	Wiley, 75, 10.7% Wiley 75 publications, 10.7%
MDPI	MDPI, 47, 6.7% MDPI 47 publications, 6.7%
Taylor & Francis	Taylor & Francis, 44, 6.28% Taylor & Francis 44 publications, 6.28%
Cold Spring Harbor Laboratory	Cold Spring Harbor Laboratory, 30, 4.28% Cold Spring Harbor Laboratory 30 publications, 4.28%
Royal Society of Chemistry (RSC)	Royal Society of Chemistry (RSC), 16, 2.28% Royal Society of Chemistry (RSC) 16 publications, 2.28%
Public Library of Science (PLoS)	Public Library of Science (PLoS), 16, 2.28% Public Library of Science (PLoS) 16 publications, 2.28%
Frontiers Media S.A.	Frontiers Media S.A., 15, 2.14% Frontiers Media S.A. 15 publications, 2.14%
Oxford University Press	Oxford University Press, 12, 1.71% Oxford University Press 12 publications, 1.71%
Bentham Science Publishers Ltd.	Bentham Science Publishers Ltd., 7, 1% Bentham Science Publishers Ltd. 7 publications, 1%
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 5, 0.71% Institute of Electrical and Electronics Engineers (IEEE) 5 publications, 0.71%
SAGE	SAGE, 3, 0.43% SAGE 3 publications, 0.43%
Hindawi Limited	Hindawi Limited, 3, 0.43% Hindawi Limited 3 publications, 0.43%
American Society for Biochemistry and Molecular Biology	American Society for Biochemistry and Molecular Biology, 2, 0.29% American Society for Biochemistry and Molecular Biology 2 publications, 0.29%
IGI Global	IGI Global, 2, 0.29% IGI Global 2 publications, 0.29%
American Society for Pharmacology and Experimental Therapeutics	American Society for Pharmacology and Experimental Therapeutics, 1, 0.14% American Society for Pharmacology and Experimental Therapeutics 1 publication, 0.14%
World Scientific	World Scientific, 1, 0.14% World Scientific 1 publication, 0.14%
Walter de Gruyter	Walter de Gruyter, 1, 0.14% Walter de Gruyter 1 publication, 0.14%
Scientific Research Publishing	Scientific Research Publishing, 1, 0.14% Scientific Research Publishing 1 publication, 0.14%
American Society for Microbiology	American Society for Microbiology, 1, 0.14% American Society for Microbiology 1 publication, 0.14%
Indian Drug Manufacturers' Association	Indian Drug Manufacturers' Association, 1, 0.14% Indian Drug Manufacturers' Association 1 publication, 0.14%
Palladin Institute of Biochemistry of the NASU	Palladin Institute of Biochemistry of the NASU, 1, 0.14% Palladin Institute of Biochemistry of the NASU 1 publication, 0.14%
Georg Thieme Verlag KG	Georg Thieme Verlag KG, 1, 0.14% Georg Thieme Verlag KG 1 publication, 0.14%
Autonomous Non-profit Organization Editorial Board of the journal Uspekhi Khimii	Autonomous Non-profit Organization Editorial Board of the journal Uspekhi Khimii, 1, 0.14% Autonomous Non-profit Organization Editorial Board of the journal Uspekhi Khimii 1 publication, 0.14%
Brieflands	Brieflands, 1, 0.14% Brieflands 1 publication, 0.14%
	20 40 60 80 100 120 140 160 180

We do not take into account publications without a DOI.
Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

704

Cite this

GOST |

Cite this

GOST Copy

Truchon J., Bayly C. I. Evaluating Virtual Screening Methods: Good and Bad Metrics for the “Early Recognition” Problem // Journal of Chemical Information and Modeling. 2007. Vol. 47. No. 2. pp. 488-508.

GOST all authors (up to 50) Copy

RIS |

Cite this

RIS Copy

TY - JOUR

DO - 10.1021/ci600426e

UR - https://doi.org/10.1021/ci600426e

TI - Evaluating Virtual Screening Methods: Good and Bad Metrics for the “Early Recognition” Problem

T2 - Journal of Chemical Information and Modeling

AU - Truchon, Jean-Francois

AU - Bayly, Christopher I.

PY - 2007

DA - 2007/02/09

PB - American Chemical Society (ACS)

SP - 488-508

IS - 2

VL - 47

PMID - 17288412

SN - 1549-9596

SN - 1549-960X

ER -

BibTex |

Cite this

BibTex (up to 50 authors) Copy

@article{2007_Truchon,

author = {Jean-Francois Truchon and Christopher I. Bayly},

title = {Evaluating Virtual Screening Methods: Good and Bad Metrics for the “Early Recognition” Problem},

journal = {Journal of Chemical Information and Modeling},

year = {2007},

volume = {47},

publisher = {American Chemical Society (ACS)},

month = {feb},

url = {https://doi.org/10.1021/ci600426e},

number = {2},

pages = {488--508},

doi = {10.1021/ci600426e}

}

MLA

Cite this

MLA Copy

Truchon, Jean-Francois, and Christopher I. Bayly. “Evaluating Virtual Screening Methods: Good and Bad Metrics for the “Early Recognition” Problem.” Journal of Chemical Information and Modeling, vol. 47, no. 2, Feb. 2007, pp. 488-508. https://doi.org/10.1021/ci600426e.

Publisher

American Chemical Society (ACS)

Journal

Journal of Chemical Information and Modeling

scimago Q1

wos Q1

SJR

1.467

CiteScore

9.8

Impact factor

5.3

ISSN

15499596 (Print)

1549960X (Electronic)