Journal of Chemical Information and Modeling, volume 47, issue 2, pages 488-508

Evaluating Virtual Screening Methods: Good and Bad Metrics for the “Early Recognition” Problem

Jean-Francois Truchon ¹

Christopher I. Bayly ¹

Department of Medicinal Chemistry, Merck Frosst Centre for Therapeutic Research, 16711 TransCanada Highway, Kirkland, Québec, Canada H9H 3L1 |

Publication type: Journal Article

Publication date: 2007-02-09

American Chemical Society (ACS)

Journal: Journal of Chemical Information and Modeling

scimago Q1

SJR: 1.396

CiteScore: 9.8

Impact factor: 5.6

ISSN: 15499596, 1549960X

DOI: 10.1021/ci600426e

Copy DOI

PubMed ID: 17288412

General Chemistry

Computer Science Applications

General Chemical Engineering

Library and Information Sciences

Abstract

Many metrics are currently used to evaluate the performance of ranking methods in virtual screening (VS), for instance, the area under the receiver operating characteristic curve (ROC), the area under the accumulation curve (AUAC), the average rank of actives, the enrichment factor (EF), and the robust initial enhancement (RIE) proposed by Sheridan et al. In this work, we show that the ROC, the AUAC, and the average rank metrics have the same inappropriate behaviors that make them poor metrics for comparing VS methods whose purpose is to rank actives early in an ordered list (the "early recognition problem"). In doing so, we derive mathematical formulas that relate those metrics together. Moreover, we show that the EF metric is not sensitive to ranking performance before and after the cutoff. Instead, we formally generalize the ROC metric to the early recognition problem which leads us to propose a novel metric called the Boltzmann-enhanced discrimination of receiver operating characteristic that turns out to contain the discrimination power of the RIE metric but incorporates the statistical significance from ROC and its well-behaved boundaries. Finally, two major sources of errors, namely, the statistical error and the "saturation effects", are examined. This leads to practical recommendations for the number of actives, the number of inactives, and the "early recognition" importance parameter that one should use when comparing ranking methods. Although this work is applied specifically to VS, it is general and can be used to analyze any method that needs to segregate actives toward the front of a rank-ordered list.

Found

	20 40 60 80 100 120 140
Journal of Chemical Information and Modeling	Journal of Chemical Information and Modeling, 134, 20.43% Journal of Chemical Information and Modeling 134 publications, 20.43%
Journal of Cheminformatics	Journal of Cheminformatics, 31, 4.73% Journal of Cheminformatics 31 publications, 4.73%
Journal of Computer-Aided Molecular Design	Journal of Computer-Aided Molecular Design, 31, 4.73% Journal of Computer-Aided Molecular Design 31 publications, 4.73%
Journal of Biomolecular Structure and Dynamics	Journal of Biomolecular Structure and Dynamics, 29, 4.42% Journal of Biomolecular Structure and Dynamics 29 publications, 4.42%
Molecules	Molecules, 22, 3.35% Molecules 22 publications, 3.35%
Molecular Informatics	Molecular Informatics, 17, 2.59% Molecular Informatics 17 publications, 2.59%
PLoS ONE	PLoS ONE, 14, 2.13% PLoS ONE 14 publications, 2.13%
International Journal of Molecular Sciences	International Journal of Molecular Sciences, 13, 1.98% International Journal of Molecular Sciences 13 publications, 1.98%
Methods in Molecular Biology	Methods in Molecular Biology, 13, 1.98% Methods in Molecular Biology 13 publications, 1.98%
Journal of Medicinal Chemistry	Journal of Medicinal Chemistry, 11, 1.68% Journal of Medicinal Chemistry 11 publications, 1.68%
Scientific Reports	Scientific Reports, 9, 1.37% Scientific Reports 9 publications, 1.37%
Journal of Molecular Graphics and Modelling	Journal of Molecular Graphics and Modelling, 9, 1.37% Journal of Molecular Graphics and Modelling 9 publications, 1.37%
European Journal of Medicinal Chemistry	European Journal of Medicinal Chemistry, 8, 1.22% European Journal of Medicinal Chemistry 8 publications, 1.22%
Frontiers in Chemistry	Frontiers in Chemistry, 6, 0.91% Frontiers in Chemistry 6 publications, 0.91%
Wiley Interdisciplinary Reviews: Computational Molecular Science	Wiley Interdisciplinary Reviews: Computational Molecular Science, 6, 0.91% Wiley Interdisciplinary Reviews: Computational Molecular Science 6 publications, 0.91%
Journal of Computational Chemistry	Journal of Computational Chemistry, 6, 0.91% Journal of Computational Chemistry 6 publications, 0.91%
ACS Omega	ACS Omega, 6, 0.91% ACS Omega 6 publications, 0.91%
Briefings in Bioinformatics	Briefings in Bioinformatics, 6, 0.91% Briefings in Bioinformatics 6 publications, 0.91%
Future Medicinal Chemistry	Future Medicinal Chemistry, 5, 0.76% Future Medicinal Chemistry 5 publications, 0.76%
Frontiers in Pharmacology	Frontiers in Pharmacology, 5, 0.76% Frontiers in Pharmacology 5 publications, 0.76%
Molecular Diversity	Molecular Diversity, 5, 0.76% Molecular Diversity 5 publications, 0.76%
Journal of Molecular Modeling	Journal of Molecular Modeling, 5, 0.76% Journal of Molecular Modeling 5 publications, 0.76%
Bioorganic and Medicinal Chemistry	Bioorganic and Medicinal Chemistry, 5, 0.76% Bioorganic and Medicinal Chemistry 5 publications, 0.76%
Chemical Biology and Drug Design	Chemical Biology and Drug Design, 5, 0.76% Chemical Biology and Drug Design 5 publications, 0.76%
Journal of Chemical Theory and Computation	Journal of Chemical Theory and Computation, 5, 0.76% Journal of Chemical Theory and Computation 5 publications, 0.76%
Lecture Notes in Computer Science	Lecture Notes in Computer Science, 5, 0.76% Lecture Notes in Computer Science 5 publications, 0.76%
Bioinformatics	Bioinformatics, 5, 0.76% Bioinformatics 5 publications, 0.76%
Chemical Science	Chemical Science, 4, 0.61% Chemical Science 4 publications, 0.61%
BMC Bioinformatics	BMC Bioinformatics, 4, 0.61% BMC Bioinformatics 4 publications, 0.61%
	20 40 60 80 100 120 140

	20 40 60 80 100 120 140 160 180
American Chemical Society (ACS)	American Chemical Society (ACS), 165, 25.15% American Chemical Society (ACS) 165 publications, 25.15%
Springer Nature	Springer Nature, 142, 21.65% Springer Nature 142 publications, 21.65%
Wiley	Wiley, 73, 11.13% Wiley 73 publications, 11.13%
Elsevier	Elsevier, 70, 10.67% Elsevier 70 publications, 10.67%
Taylor & Francis	Taylor & Francis, 44, 6.71% Taylor & Francis 44 publications, 6.71%
MDPI	MDPI, 44, 6.71% MDPI 44 publications, 6.71%
Cold Spring Harbor Laboratory	Cold Spring Harbor Laboratory, 28, 4.27% Cold Spring Harbor Laboratory 28 publications, 4.27%
Public Library of Science (PLoS)	Public Library of Science (PLoS), 15, 2.29% Public Library of Science (PLoS) 15 publications, 2.29%
Frontiers Media S.A.	Frontiers Media S.A., 13, 1.98% Frontiers Media S.A. 13 publications, 1.98%
Oxford University Press	Oxford University Press, 12, 1.83% Oxford University Press 12 publications, 1.83%
Royal Society of Chemistry (RSC)	Royal Society of Chemistry (RSC), 11, 1.68% Royal Society of Chemistry (RSC) 11 publications, 1.68%
Bentham Science Publishers Ltd.	Bentham Science Publishers Ltd., 7, 1.07% Bentham Science Publishers Ltd. 7 publications, 1.07%
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 5, 0.76% Institute of Electrical and Electronics Engineers (IEEE) 5 publications, 0.76%
SAGE	SAGE, 3, 0.46% SAGE 3 publications, 0.46%
Hindawi Limited	Hindawi Limited, 3, 0.46% Hindawi Limited 3 publications, 0.46%
American Society for Biochemistry and Molecular Biology	American Society for Biochemistry and Molecular Biology, 2, 0.3% American Society for Biochemistry and Molecular Biology 2 publications, 0.3%
American Society for Pharmacology and Experimental Therapeutics	American Society for Pharmacology and Experimental Therapeutics, 1, 0.15% American Society for Pharmacology and Experimental Therapeutics 1 publication, 0.15%
World Scientific	World Scientific, 1, 0.15% World Scientific 1 publication, 0.15%
Walter de Gruyter	Walter de Gruyter, 1, 0.15% Walter de Gruyter 1 publication, 0.15%
American Society for Microbiology	American Society for Microbiology, 1, 0.15% American Society for Microbiology 1 publication, 0.15%
Indian Drug Manufacturers' Association	Indian Drug Manufacturers' Association, 1, 0.15% Indian Drug Manufacturers' Association 1 publication, 0.15%
IGI Global	IGI Global, 1, 0.15% IGI Global 1 publication, 0.15%
Palladin Institute of Biochemistry of the NASU	Palladin Institute of Biochemistry of the NASU, 1, 0.15% Palladin Institute of Biochemistry of the NASU 1 publication, 0.15%
Georg Thieme Verlag KG	Georg Thieme Verlag KG, 1, 0.15% Georg Thieme Verlag KG 1 publication, 0.15%
Autonomous Non-profit Organization Editorial Board of the journal Uspekhi Khimii	Autonomous Non-profit Organization Editorial Board of the journal Uspekhi Khimii, 1, 0.15% Autonomous Non-profit Organization Editorial Board of the journal Uspekhi Khimii 1 publication, 0.15%
	20 40 60 80 100 120 140 160 180