volume 57 issue 4 pages 1007-1012

Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions

Publication typeJournal Article
Publication date2017-04-05
scimago Q1
wos Q1
SJR1.467
CiteScore9.8
Impact factor5.3
ISSN15499596, 1549960X
General Chemistry
Computer Science Applications
General Chemical Engineering
Library and Information Sciences
Abstract
The prediction of protein-ligand binding affinity has recently been improved remarkably by machine-learning-based scoring functions. For example, using a set of simple descriptors representing the atomic distance counts, the RF-Score improves the Pearson correlation coefficient to about 0.8 on the core set of the PDBbind 2007 database, which is significantly higher than the performance of any conventional scoring function on the same benchmark. A few studies have been made to discuss the performance of machine-learning-based methods, but the reason for this improvement remains unclear. In this study, by systemically controlling the structural and sequence similarity between the training and test proteins of the PDBbind benchmark, we demonstrate that protein structural and sequence similarity makes a significant impact on machine-learning-based methods. After removal of training proteins that are highly similar to the test proteins identified by structure alignment and sequence alignment, machine-learning-based methods trained on the new training sets do not outperform the conventional scoring functions any more. On the contrary, the performance of conventional functions like X-Score is relatively stable no matter what training data are used to fit the weights of its energy terms.
Found 
Found 

Top-30

Journals

2
4
6
8
10
12
14
16
18
Journal of Chemical Information and Modeling
18 publications, 21.43%
Briefings in Bioinformatics
7 publications, 8.33%
Wiley Interdisciplinary Reviews: Computational Molecular Science
6 publications, 7.14%
Journal of Cheminformatics
3 publications, 3.57%
BMC Bioinformatics
3 publications, 3.57%
Journal of Computational Chemistry
3 publications, 3.57%
International Journal of Molecular Sciences
2 publications, 2.38%
Journal of Computer-Aided Molecular Design
2 publications, 2.38%
Chemical Physics Letters
2 publications, 2.38%
PLoS ONE
2 publications, 2.38%
Bioinformatics
2 publications, 2.38%
Journal of Physical Chemistry B
2 publications, 2.38%
Nature Machine Intelligence
2 publications, 2.38%
Molecular Informatics
1 publication, 1.19%
Current Medicinal Chemistry
1 publication, 1.19%
Biomolecules
1 publication, 1.19%
Frontiers in Pharmacology
1 publication, 1.19%
Frontiers in Bioinformatics
1 publication, 1.19%
Nature Reviews Molecular Cell Biology
1 publication, 1.19%
Theoretical Chemistry Accounts
1 publication, 1.19%
Computational and Structural Biotechnology Journal
1 publication, 1.19%
ACS Central Science
1 publication, 1.19%
Chemical Reviews
1 publication, 1.19%
Journal of Medicinal Chemistry
1 publication, 1.19%
Chemical Science
1 publication, 1.19%
IEEE/ACM Transactions on Computational Biology and Bioinformatics
1 publication, 1.19%
Nucleic Acids Research
1 publication, 1.19%
Drug Target Selection and Validation
1 publication, 1.19%
Annual Reports in Medicinal Chemistry
1 publication, 1.19%
2
4
6
8
10
12
14
16
18

Publishers

5
10
15
20
25
American Chemical Society (ACS)
23 publications, 27.38%
Springer Nature
16 publications, 19.05%
Wiley
11 publications, 13.1%
Oxford University Press
10 publications, 11.9%
Elsevier
8 publications, 9.52%
Cold Spring Harbor Laboratory
4 publications, 4.76%
MDPI
3 publications, 3.57%
Frontiers Media S.A.
2 publications, 2.38%
Public Library of Science (PLoS)
2 publications, 2.38%
Bentham Science Publishers Ltd.
1 publication, 1.19%
Royal Society of Chemistry (RSC)
1 publication, 1.19%
Institute of Electrical and Electronics Engineers (IEEE)
1 publication, 1.19%
OOO Zhurnal "Mendeleevskie Soobshcheniya"
1 publication, 1.19%
Hindawi Limited
1 publication, 1.19%
5
10
15
20
25
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
84
Share
Cite this
GOST |
Cite this
GOST Copy
Li Y., Yang J. Y. Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions // Journal of Chemical Information and Modeling. 2017. Vol. 57. No. 4. pp. 1007-1012.
GOST all authors (up to 50) Copy
Li Y., Yang J. Y. Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions // Journal of Chemical Information and Modeling. 2017. Vol. 57. No. 4. pp. 1007-1012.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1021/acs.jcim.7b00049
UR - https://doi.org/10.1021/acs.jcim.7b00049
TI - Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions
T2 - Journal of Chemical Information and Modeling
AU - Li, Yang
AU - Yang, Jian Yi
PY - 2017
DA - 2017/04/05
PB - American Chemical Society (ACS)
SP - 1007-1012
IS - 4
VL - 57
PMID - 28358210
SN - 1549-9596
SN - 1549-960X
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2017_Li,
author = {Yang Li and Jian Yi Yang},
title = {Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions},
journal = {Journal of Chemical Information and Modeling},
year = {2017},
volume = {57},
publisher = {American Chemical Society (ACS)},
month = {apr},
url = {https://doi.org/10.1021/acs.jcim.7b00049},
number = {4},
pages = {1007--1012},
doi = {10.1021/acs.jcim.7b00049}
}
MLA
Cite this
MLA Copy
Li, Yang, et al. “Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions.” Journal of Chemical Information and Modeling, vol. 57, no. 4, Apr. 2017, pp. 1007-1012. https://doi.org/10.1021/acs.jcim.7b00049.