Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction.
1
Collaborations Pharmaceuticals, Inc., 840 Main Campus Drive, Lab 3510, Raleigh, North Carolina 27606, United States
|
3
Molecular Materials Informatics, Inc., Montreal, Quebec H3J 2S1, Canada
|
Publication type: Journal Article
Publication date: 2018-08-16
scimago Q1
wos Q1
SJR: 0.968
CiteScore: 7.8
Impact factor: 4.5
ISSN: 15438384, 15438392
PubMed ID:
30114914
Drug Discovery
Pharmaceutical Science
Molecular Medicine
Abstract
Many chemicals that disrupt endocrine function have been linked to a variety of adverse biological outcomes. However, screening for endocrine disruption using in vitro or in vivo approaches is costly and time-consuming. Computational methods, e.g., quantitative structure-activity relationship models, have become more reliable due to bigger training sets, increased computing power, and advanced machine learning algorithms, such as multilayered artificial neural networks. Machine learning models can be used to predict compounds for endocrine disrupting capabilities, such as binding to the estrogen receptor (ER), and allow for prioritization and further testing. In this work, an exhaustive comparison of multiple machine learning algorithms, chemical spaces, and evaluation metrics for ER binding was performed on public data sets curated using in-house cheminformatics software (Assay Central). Chemical features utilized in modeling consisted of binary fingerprints (ECFP6, FCFP6, ToxPrint, or MACCS keys) and continuous molecular descriptors from RDKit. Each feature set was subjected to classic machine learning algorithms (Bernoulli Naive Bayes, AdaBoost Decision Tree, Random Forest, Support Vector Machine) and Deep Neural Networks (DNN). Models were evaluated using a variety of metrics: recall, precision, F1-score, accuracy, area under the receiver operating characteristic curve, Cohen's Kappa, and Matthews correlation coefficient. For predicting compounds within the training set, DNN has an accuracy higher than that of other methods; however, in 5-fold cross validation and external test set predictions, DNN and most classic machine learning models perform similarly regardless of the data set or molecular descriptors used. We have also used the rank normalized scores as a performance-criteria for each machine learning method, and Random Forest performed best on the validation set when ranked by metric or by data sets. These results suggest classic machine learning algorithms may be sufficient to develop high quality predictive models of ER activity.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
2
4
6
8
10
|
|
|
Molecular Pharmaceutics
10 publications, 7.19%
|
|
|
Environmental Science & Technology
7 publications, 5.04%
|
|
|
ACS Omega
5 publications, 3.6%
|
|
|
Journal of Chemical Information and Modeling
4 publications, 2.88%
|
|
|
Chemical Research in Toxicology
4 publications, 2.88%
|
|
|
Pharmaceutical Research
3 publications, 2.16%
|
|
|
Drug Discovery Today
3 publications, 2.16%
|
|
|
Chemosphere
3 publications, 2.16%
|
|
|
ACS Sustainable Chemistry and Engineering
3 publications, 2.16%
|
|
|
Molecular Pharmacology
2 publications, 1.44%
|
|
|
International Journal of Molecular Sciences
2 publications, 1.44%
|
|
|
Frontiers in Pharmacology
2 publications, 1.44%
|
|
|
Scientific Reports
2 publications, 1.44%
|
|
|
Nature Materials
2 publications, 1.44%
|
|
|
Bioorganic Chemistry
2 publications, 1.44%
|
|
|
Journal of Hazardous Materials
2 publications, 1.44%
|
|
|
Analytical Chemistry
2 publications, 1.44%
|
|
|
Methods in Molecular Biology
2 publications, 1.44%
|
|
|
Journal of Chemical Physics
1 publication, 0.72%
|
|
|
Journal of Systems Science and Complexity
1 publication, 0.72%
|
|
|
Journal of Chemical Health and Safety
1 publication, 0.72%
|
|
|
Molecules
1 publication, 0.72%
|
|
|
Journal of Personalized Medicine
1 publication, 0.72%
|
|
|
Cells
1 publication, 0.72%
|
|
|
Pharmaceutics
1 publication, 0.72%
|
|
|
Frontiers in Cardiovascular Medicine
1 publication, 0.72%
|
|
|
Frontiers in Endocrinology
1 publication, 0.72%
|
|
|
Computers, Materials and Continua
1 publication, 0.72%
|
|
|
Multimedia Tools and Applications
1 publication, 0.72%
|
|
|
2
4
6
8
10
|
Publishers
|
5
10
15
20
25
30
35
40
|
|
|
American Chemical Society (ACS)
40 publications, 28.78%
|
|
|
Elsevier
28 publications, 20.14%
|
|
|
Springer Nature
21 publications, 15.11%
|
|
|
MDPI
10 publications, 7.19%
|
|
|
Oxford University Press
6 publications, 4.32%
|
|
|
Frontiers Media S.A.
5 publications, 3.6%
|
|
|
Wiley
5 publications, 3.6%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
4 publications, 2.88%
|
|
|
Cold Spring Harbor Laboratory
3 publications, 2.16%
|
|
|
IGI Global
2 publications, 1.44%
|
|
|
AIP Publishing
1 publication, 0.72%
|
|
|
American Society for Pharmacology and Experimental Therapeutics
1 publication, 0.72%
|
|
|
Tech Science Press
1 publication, 0.72%
|
|
|
Social Science Electronic Publishing
1 publication, 0.72%
|
|
|
Hindawi Limited
1 publication, 0.72%
|
|
|
American Society of Clinical Oncology (ASCO)
1 publication, 0.72%
|
|
|
Annual Reviews
1 publication, 0.72%
|
|
|
Public Library of Science (PLoS)
1 publication, 0.72%
|
|
|
AME Publishing Company
1 publication, 0.72%
|
|
|
Radiological Society of North America (RSNA)
1 publication, 0.72%
|
|
|
The Company of Biologists
1 publication, 0.72%
|
|
|
World Scientific
1 publication, 0.72%
|
|
|
International Press of Boston
1 publication, 0.72%
|
|
|
5
10
15
20
25
30
35
40
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
139
Total citations:
139
Citations from 2024:
27
(19%)
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Russo D. P. et al. Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction. // Molecular Pharmaceutics. 2018. Vol. 15. No. 10. pp. 4361-4370.
GOST all authors (up to 50)
Copy
Russo D. P., Zorn K. M., Clark A. M., Zhu H., Ekins S. Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction. // Molecular Pharmaceutics. 2018. Vol. 15. No. 10. pp. 4361-4370.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1021/acs.molpharmaceut.8b00546
UR - https://doi.org/10.1021/acs.molpharmaceut.8b00546
TI - Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction.
T2 - Molecular Pharmaceutics
AU - Russo, Daniel P
AU - Zorn, Kimberley M.
AU - Clark, Alex M.
AU - Zhu, H
AU - Ekins, Sean
PY - 2018
DA - 2018/08/16
PB - American Chemical Society (ACS)
SP - 4361-4370
IS - 10
VL - 15
PMID - 30114914
SN - 1543-8384
SN - 1543-8392
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2018_Russo,
author = {Daniel P Russo and Kimberley M. Zorn and Alex M. Clark and H Zhu and Sean Ekins},
title = {Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction.},
journal = {Molecular Pharmaceutics},
year = {2018},
volume = {15},
publisher = {American Chemical Society (ACS)},
month = {aug},
url = {https://doi.org/10.1021/acs.molpharmaceut.8b00546},
number = {10},
pages = {4361--4370},
doi = {10.1021/acs.molpharmaceut.8b00546}
}
Cite this
MLA
Copy
Russo, Daniel P., et al. “Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction..” Molecular Pharmaceutics, vol. 15, no. 10, Aug. 2018, pp. 4361-4370. https://doi.org/10.1021/acs.molpharmaceut.8b00546.
Profiles