Open Access
Open access
Bioinformatics, volume 26, issue 9, pages 1169-1175

A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking

Publication typeJournal Article
Publication date2010-03-17
Journal: Bioinformatics
Quartile SCImago
Q1
Quartile WOS
Q1
Impact factor5.8
ISSN13674803, 13674811, 14602059
Biochemistry
Computer Science Applications
Molecular Biology
Statistics and Probability
Computational Mathematics
Computational Theory and Mathematics
Abstract
Accurately predicting the binding affinities of large sets of diverse protein-ligand complexes is an extremely challenging task. The scoring functions that attempt such computational prediction are essential for analysing the outputs of molecular docking, which in turn is an important technique for drug discovery, chemical biology and structural biology. Each scoring function assumes a predetermined theory-inspired functional form for the relationship between the variables that characterize the complex, which also include parameters fitted to experimental or simulation data and its predicted binding affinity. The inherent problem of this rigid approach is that it leads to poor predictivity for those complexes that do not conform to the modelling assumptions. Moreover, resampling strategies, such as cross-validation or bootstrapping, are still not systematically used to guard against the overfitting of calibration data in parameter estimation for scoring functions.We propose a novel scoring function (RF-Score) that circumvents the need for problematic modelling assumptions via non-parametric machine learning. In particular, Random Forest was used to implicitly capture binding effects that are hard to model explicitly. RF-Score is compared with the state of the art on the demanding PDBbind benchmark. Results show that RF-Score is a very competitive scoring function. Importantly, RF-Score's performance was shown to improve dramatically with training set size and hence the future availability of more high-quality structural and interaction data is expected to lead to improved versions of RF-Score.pedro.ballester@ebi.ac.uk; jbom@st-andrews.ac.ukSupplementary data are available at Bioinformatics online.

Top-30

Journals

10
20
30
40
50
60
70
80
90
100
Journal of Chemical Information and Modeling
91 publications, 14.42%
Briefings in Bioinformatics
21 publications, 3.33%
Journal of Cheminformatics
14 publications, 2.22%
Bioinformatics
13 publications, 2.06%
Lecture Notes in Computer Science
13 publications, 2.06%
Molecules
12 publications, 1.9%
BMC Bioinformatics
12 publications, 1.9%
Journal of Computer-Aided Molecular Design
11 publications, 1.74%
ACS Omega
10 publications, 1.58%
Wiley Interdisciplinary Reviews: Computational Molecular Science
10 publications, 1.58%
International Journal of Molecular Sciences
9 publications, 1.43%
Drug Discovery Today
9 publications, 1.43%
Molecular Informatics
8 publications, 1.27%
Scientific Reports
8 publications, 1.27%
Computational and Structural Biotechnology Journal
7 publications, 1.11%
PLoS ONE
7 publications, 1.11%
Journal of Chemical Theory and Computation
7 publications, 1.11%
Methods in Molecular Biology
7 publications, 1.11%
Current Medicinal Chemistry
6 publications, 0.95%
Journal of Bioinformatics and Computational Biology
5 publications, 0.79%
Nature Machine Intelligence
5 publications, 0.79%
Journal of Molecular Graphics and Modelling
5 publications, 0.79%
Journal of Computational Chemistry
5 publications, 0.79%
Medical Advancements in Aging and Regenerative Technologies
5 publications, 0.79%
IEEE/ACM Transactions on Computational Biology and Bioinformatics
5 publications, 0.79%
Frontiers in Pharmacology
4 publications, 0.63%
Molecular Diversity
4 publications, 0.63%
Computers in Biology and Medicine
4 publications, 0.63%
Methods
4 publications, 0.63%
10
20
30
40
50
60
70
80
90
100

Publishers

20
40
60
80
100
120
140
American Chemical Society (ACS)
126 publications, 19.97%
Springer Nature
106 publications, 16.8%
Elsevier
87 publications, 13.79%
Wiley
65 publications, 10.3%
Oxford University Press
36 publications, 5.71%
MDPI
33 publications, 5.23%
Cold Spring Harbor Laboratory
27 publications, 4.28%
Institute of Electrical and Electronics Engineers (IEEE)
22 publications, 3.49%
IGI Global
13 publications, 2.06%
Royal Society of Chemistry (RSC)
12 publications, 1.9%
Bentham Science Publishers Ltd.
11 publications, 1.74%
Frontiers Media S.A.
11 publications, 1.74%
Taylor & Francis
11 publications, 1.74%
Public Library of Science (PLoS)
8 publications, 1.27%
World Scientific
5 publications, 0.79%
3 publications, 0.48%
IOP Publishing
3 publications, 0.48%
Association for Computing Machinery (ACM)
3 publications, 0.48%
American Physical Society (APS)
2 publications, 0.32%
Impact Journals
2 publications, 0.32%
The Royal Society
2 publications, 0.32%
Cambridge University Press
2 publications, 0.32%
Walter de Gruyter
2 publications, 0.32%
Hindawi Limited
2 publications, 0.32%
Proceedings of the National Academy of Sciences (PNAS)
2 publications, 0.32%
Annual Reviews
2 publications, 0.32%
IntechOpen
2 publications, 0.32%
Society for Industrial and Applied Mathematics (SIAM)
1 publication, 0.16%
AIP Publishing
1 publication, 0.16%
20
40
60
80
100
120
140
  • We do not take into account publications without a DOI.
  • Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
Share
Cite this
GOST |
Cite this
GOST Copy
Ballester P., Mitchell J. B. O. A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking // Bioinformatics. 2010. Vol. 26. No. 9. pp. 1169-1175.
GOST all authors (up to 50) Copy
Ballester P., Mitchell J. B. O. A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking // Bioinformatics. 2010. Vol. 26. No. 9. pp. 1169-1175.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1093/bioinformatics/btq112
UR - https://doi.org/10.1093/bioinformatics/btq112
TI - A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking
T2 - Bioinformatics
AU - Ballester, Pedro
AU - Mitchell, John B. O.
PY - 2010
DA - 2010/03/17
PB - Oxford University Press
SP - 1169-1175
IS - 9
VL - 26
PMID - 20236947
SN - 1367-4803
SN - 1367-4811
SN - 1460-2059
ER -
BibTex |
Cite this
BibTex Copy
@article{2010_Ballester,
author = {Pedro Ballester and John B. O. Mitchell},
title = {A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking},
journal = {Bioinformatics},
year = {2010},
volume = {26},
publisher = {Oxford University Press},
month = {mar},
url = {https://doi.org/10.1093/bioinformatics/btq112},
number = {9},
pages = {1169--1175},
doi = {10.1093/bioinformatics/btq112}
}
MLA
Cite this
MLA Copy
Ballester, Pedro, and John B. O. Mitchell. “A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking.” Bioinformatics, vol. 26, no. 9, Mar. 2010, pp. 1169-1175. https://doi.org/10.1093/bioinformatics/btq112.
Found error?