Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions
Publication type: Journal Article
Publication date: 2017-04-05
scimago Q1
wos Q1
SJR: 1.467
CiteScore: 9.8
Impact factor: 5.3
ISSN: 15499596, 1549960X
PubMed ID:
28358210
General Chemistry
Computer Science Applications
General Chemical Engineering
Library and Information Sciences
Abstract
The prediction of protein-ligand binding affinity has recently been improved remarkably by machine-learning-based scoring functions. For example, using a set of simple descriptors representing the atomic distance counts, the RF-Score improves the Pearson correlation coefficient to about 0.8 on the core set of the PDBbind 2007 database, which is significantly higher than the performance of any conventional scoring function on the same benchmark. A few studies have been made to discuss the performance of machine-learning-based methods, but the reason for this improvement remains unclear. In this study, by systemically controlling the structural and sequence similarity between the training and test proteins of the PDBbind benchmark, we demonstrate that protein structural and sequence similarity makes a significant impact on machine-learning-based methods. After removal of training proteins that are highly similar to the test proteins identified by structure alignment and sequence alignment, machine-learning-based methods trained on the new training sets do not outperform the conventional scoring functions any more. On the contrary, the performance of conventional functions like X-Score is relatively stable no matter what training data are used to fit the weights of its energy terms.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
2
4
6
8
10
12
14
16
18
|
|
|
Journal of Chemical Information and Modeling
18 publications, 21.43%
|
|
|
Briefings in Bioinformatics
7 publications, 8.33%
|
|
|
Wiley Interdisciplinary Reviews: Computational Molecular Science
6 publications, 7.14%
|
|
|
Journal of Cheminformatics
3 publications, 3.57%
|
|
|
BMC Bioinformatics
3 publications, 3.57%
|
|
|
Journal of Computational Chemistry
3 publications, 3.57%
|
|
|
International Journal of Molecular Sciences
2 publications, 2.38%
|
|
|
Journal of Computer-Aided Molecular Design
2 publications, 2.38%
|
|
|
Chemical Physics Letters
2 publications, 2.38%
|
|
|
PLoS ONE
2 publications, 2.38%
|
|
|
Bioinformatics
2 publications, 2.38%
|
|
|
Journal of Physical Chemistry B
2 publications, 2.38%
|
|
|
Nature Machine Intelligence
2 publications, 2.38%
|
|
|
Molecular Informatics
1 publication, 1.19%
|
|
|
Current Medicinal Chemistry
1 publication, 1.19%
|
|
|
Biomolecules
1 publication, 1.19%
|
|
|
Frontiers in Pharmacology
1 publication, 1.19%
|
|
|
Frontiers in Bioinformatics
1 publication, 1.19%
|
|
|
Nature Reviews Molecular Cell Biology
1 publication, 1.19%
|
|
|
Theoretical Chemistry Accounts
1 publication, 1.19%
|
|
|
Computational and Structural Biotechnology Journal
1 publication, 1.19%
|
|
|
ACS Central Science
1 publication, 1.19%
|
|
|
Chemical Reviews
1 publication, 1.19%
|
|
|
Journal of Medicinal Chemistry
1 publication, 1.19%
|
|
|
Chemical Science
1 publication, 1.19%
|
|
|
IEEE/ACM Transactions on Computational Biology and Bioinformatics
1 publication, 1.19%
|
|
|
Nucleic Acids Research
1 publication, 1.19%
|
|
|
Drug Target Selection and Validation
1 publication, 1.19%
|
|
|
Annual Reports in Medicinal Chemistry
1 publication, 1.19%
|
|
|
2
4
6
8
10
12
14
16
18
|
Publishers
|
5
10
15
20
25
|
|
|
American Chemical Society (ACS)
23 publications, 27.38%
|
|
|
Springer Nature
16 publications, 19.05%
|
|
|
Wiley
11 publications, 13.1%
|
|
|
Oxford University Press
10 publications, 11.9%
|
|
|
Elsevier
8 publications, 9.52%
|
|
|
Cold Spring Harbor Laboratory
4 publications, 4.76%
|
|
|
MDPI
3 publications, 3.57%
|
|
|
Frontiers Media S.A.
2 publications, 2.38%
|
|
|
Public Library of Science (PLoS)
2 publications, 2.38%
|
|
|
Bentham Science Publishers Ltd.
1 publication, 1.19%
|
|
|
Royal Society of Chemistry (RSC)
1 publication, 1.19%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
1 publication, 1.19%
|
|
|
OOO Zhurnal "Mendeleevskie Soobshcheniya"
1 publication, 1.19%
|
|
|
Hindawi Limited
1 publication, 1.19%
|
|
|
5
10
15
20
25
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
84
Total citations:
84
Citations from 2025:
9
(10.71%)
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Li Y., Yang J. Y. Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions // Journal of Chemical Information and Modeling. 2017. Vol. 57. No. 4. pp. 1007-1012.
GOST all authors (up to 50)
Copy
Li Y., Yang J. Y. Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions // Journal of Chemical Information and Modeling. 2017. Vol. 57. No. 4. pp. 1007-1012.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1021/acs.jcim.7b00049
UR - https://doi.org/10.1021/acs.jcim.7b00049
TI - Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions
T2 - Journal of Chemical Information and Modeling
AU - Li, Yang
AU - Yang, Jian Yi
PY - 2017
DA - 2017/04/05
PB - American Chemical Society (ACS)
SP - 1007-1012
IS - 4
VL - 57
PMID - 28358210
SN - 1549-9596
SN - 1549-960X
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2017_Li,
author = {Yang Li and Jian Yi Yang},
title = {Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions},
journal = {Journal of Chemical Information and Modeling},
year = {2017},
volume = {57},
publisher = {American Chemical Society (ACS)},
month = {apr},
url = {https://doi.org/10.1021/acs.jcim.7b00049},
number = {4},
pages = {1007--1012},
doi = {10.1021/acs.jcim.7b00049}
}
Cite this
MLA
Copy
Li, Yang, et al. “Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein–Ligand Interactions.” Journal of Chemical Information and Modeling, vol. 57, no. 4, Apr. 2017, pp. 1007-1012. https://doi.org/10.1021/acs.jcim.7b00049.