Open Access
Open access
volume 7 issue 1 publication number 46710

Performance of machine-learning scoring functions in structure-based virtual screening

Publication typeJournal Article
Publication date2017-04-25
scimago Q1
wos Q1
SJR0.874
CiteScore6.7
Impact factor3.9
ISSN20452322
PubMed ID:  28440302
Multidisciplinary
Abstract
Classical scoring functions have reached a plateau in their performance in virtual screening and binding affinity prediction. Recently, machine-learning scoring functions trained on protein-ligand complexes have shown great promise in small tailored studies. They have also raised controversy, specifically concerning model overfitting and applicability to novel targets. Here we provide a new ready-to-use scoring function (RF-Score-VS) trained on 15 426 active and 893 897 inactive molecules docked to a set of 102 targets. We use the full DUD-E data sets along with three docking tools, five classical and three machine-learning scoring functions for model building and performance assessment. Our results show RF-Score-VS can substantially improve virtual screening performance: RF-Score-VS top 1% provides 55.6% hit rate, whereas that of Vina only 16.2% (for smaller percent the difference is even more encouraging: RF-Score-VS top 0.1% achieves 88.6% hit rate for 27.5% using Vina). In addition, RF-Score-VS provides much better prediction of measured binding affinity than Vina (Pearson correlation of 0.56 and −0.18, respectively). Lastly, we test RF-Score-VS on an independent test set from the DEKOIS benchmark and observed comparable results. We provide full data sets to facilitate further research in this area ( http://github.com/oddt/rfscorevs ) as well as ready-to-use RF-Score-VS ( http://github.com/oddt/rfscorevs_binary ).
Found 
Found 

Top-30

Journals

5
10
15
20
25
30
35
40
45
Journal of Chemical Information and Modeling
41 publications, 13.44%
Expert Opinion on Drug Discovery
12 publications, 3.93%
Molecules
10 publications, 3.28%
Briefings in Bioinformatics
10 publications, 3.28%
Wiley Interdisciplinary Reviews: Computational Molecular Science
8 publications, 2.62%
International Journal of Molecular Sciences
6 publications, 1.97%
Journal of Cheminformatics
6 publications, 1.97%
Drug Discovery Today
6 publications, 1.97%
Computational and Structural Biotechnology Journal
5 publications, 1.64%
ACS Omega
5 publications, 1.64%
Bioinformatics
5 publications, 1.64%
Frontiers in Pharmacology
4 publications, 1.31%
Journal of Computer-Aided Molecular Design
4 publications, 1.31%
Molecular Diversity
4 publications, 1.31%
PLoS ONE
4 publications, 1.31%
Journal of Biomolecular Structure and Dynamics
4 publications, 1.31%
Chemical Reviews
3 publications, 0.98%
Methods in Molecular Biology
3 publications, 0.98%
Mathematical Biology and Bioinformatics
3 publications, 0.98%
Current Topics in Medicinal Chemistry
2 publications, 0.66%
Current Medicinal Chemistry
2 publications, 0.66%
Current Issues in Molecular Biology
2 publications, 0.66%
Frontiers in Chemistry
2 publications, 0.66%
Scientific Reports
2 publications, 0.66%
Nature Machine Intelligence
2 publications, 0.66%
Molecular Biotechnology
2 publications, 0.66%
Journal of Advanced Research
2 publications, 0.66%
Journal of Molecular Graphics and Modelling
2 publications, 0.66%
Medical Oncology
2 publications, 0.66%
5
10
15
20
25
30
35
40
45

Publishers

10
20
30
40
50
60
American Chemical Society (ACS)
57 publications, 18.69%
Elsevier
54 publications, 17.7%
Springer Nature
38 publications, 12.46%
MDPI
26 publications, 8.52%
Wiley
23 publications, 7.54%
Taylor & Francis
20 publications, 6.56%
Cold Spring Harbor Laboratory
20 publications, 6.56%
Oxford University Press
16 publications, 5.25%
Bentham Science Publishers Ltd.
7 publications, 2.3%
Frontiers Media S.A.
7 publications, 2.3%
Public Library of Science (PLoS)
5 publications, 1.64%
Institute of Electrical and Electronics Engineers (IEEE)
4 publications, 1.31%
IntechOpen
4 publications, 1.31%
Institute of Mathematical Problems of Biology of RAS (IMPB RAS)
3 publications, 0.98%
IGI Global
2 publications, 0.66%
Royal Society of Chemistry (RSC)
2 publications, 0.66%
Proceedings of the National Academy of Sciences (PNAS)
2 publications, 0.66%
eLife Sciences Publications
2 publications, 0.66%
PeerJ
1 publication, 0.33%
World Scientific
1 publication, 0.33%
SAGE
1 publication, 0.33%
Hindawi Limited
1 publication, 0.33%
National Institute of Infectious Diseases
1 publication, 0.33%
Autonomous Non-profit Organization Editorial Board of the journal Uspekhi Khimii
1 publication, 0.33%
International Press of Boston
1 publication, 0.33%
10
20
30
40
50
60
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
305
Share
Cite this
GOST |
Cite this
GOST Copy
Wójcikowski M. et al. Performance of machine-learning scoring functions in structure-based virtual screening // Scientific Reports. 2017. Vol. 7. No. 1. 46710
GOST all authors (up to 50) Copy
Wójcikowski M., Ballester P. J., Siedlecki P. Performance of machine-learning scoring functions in structure-based virtual screening // Scientific Reports. 2017. Vol. 7. No. 1. 46710
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1038/srep46710
UR - https://doi.org/10.1038/srep46710
TI - Performance of machine-learning scoring functions in structure-based virtual screening
T2 - Scientific Reports
AU - Wójcikowski, Maciej
AU - Ballester, Pedro J
AU - Siedlecki, Pawel
PY - 2017
DA - 2017/04/25
PB - Springer Nature
IS - 1
VL - 7
PMID - 28440302
SN - 2045-2322
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@article{2017_Wójcikowski,
author = {Maciej Wójcikowski and Pedro J Ballester and Pawel Siedlecki},
title = {Performance of machine-learning scoring functions in structure-based virtual screening},
journal = {Scientific Reports},
year = {2017},
volume = {7},
publisher = {Springer Nature},
month = {apr},
url = {https://doi.org/10.1038/srep46710},
number = {1},
pages = {46710},
doi = {10.1038/srep46710}
}