Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery.
T.M. Lane
1, 2
,
Daniel P Russo
1, 3
,
Kimberley M. Zorn
1
,
Alex M. Clark
4
,
Alexandru Korotcov
5
,
Valery Tkachenko
5
,
R.C. Reynolds
6
,
Alexander L. Perryman
7
,
Joel S. Freundlich
7, 8
,
1
Collaborations Pharmaceuticals, Inc., Main Campus Drive, Lab 3510, Raleigh, North Carolina 27606, United States
|
4
Molecular Materials Informatics, Inc., 1900 St. Jacques #302, Montreal H3J 2S1, Quebec, Canada
|
5
Science Data Software, LLC, 14914 Bradwill Court, Rockville, Maryland 20850, United States
|
Publication type: Journal Article
Publication date: 2018-04-19
scimago Q1
wos Q1
SJR: 0.968
CiteScore: 7.8
Impact factor: 4.5
ISSN: 15438384, 15438392
PubMed ID:
29672063
Drug Discovery
Pharmaceutical Science
Molecular Medicine
Abstract
Tuberculosis is a global health dilemma. In 2016, the WHO reported 10.4 million incidences and 1.7 million deaths. The need to develop new treatments for those infected with Mycobacterium tuberculosis ( Mtb) has led to many large-scale phenotypic screens and many thousands of new active compounds identified in vitro. However, with limited funding, efforts to discover new active molecules against Mtb needs to be more efficient. Several computational machine learning approaches have been shown to have good enrichment and hit rates. We have curated small molecule Mtb data and developed new models with a total of 18,886 molecules with activity cutoffs of 10 μM, 1 μM, and 100 nM. These data sets were used to evaluate different machine learning methods (including deep learning) and metrics and to generate predictions for additional molecules published in 2017. One Mtb model, a combined in vitro and in vivo data Bayesian model at a 100 nM activity yielded the following metrics for 5-fold cross validation: accuracy = 0.88, precision = 0.22, recall = 0.91, specificity = 0.88, kappa = 0.31, and MCC = 0.41. We have also curated an evaluation set ( n = 153 compounds) published in 2017, and when used to test our model, it showed the comparable statistics (accuracy = 0.83, precision = 0.27, recall = 1.00, specificity = 0.81, kappa = 0.36, and MCC = 0.47). We have also compared these models with additional machine learning algorithms showing Bayesian machine learning models constructed with literature Mtb data generated by different laboratories generally were equivalent to or outperformed deep neural networks with external test sets. Finally, we have also compared our training and test sets to show they were suitably diverse and different in order to represent useful evaluation sets. Such Mtb machine learning models could help prioritize compounds for testing in vitro and in vivo.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
2
4
6
8
10
|
|
|
Molecular Pharmaceutics
10 publications, 10.2%
|
|
|
ACS Omega
6 publications, 6.12%
|
|
|
Scientific Reports
4 publications, 4.08%
|
|
|
Pharmaceutical Research
4 publications, 4.08%
|
|
|
Environmental Science & Technology
4 publications, 4.08%
|
|
|
Journal of Chemical Information and Modeling
4 publications, 4.08%
|
|
|
Journal of Medicinal Chemistry
3 publications, 3.06%
|
|
|
ACS Chemical Neuroscience
3 publications, 3.06%
|
|
|
Molecular Pharmacology
2 publications, 2.04%
|
|
|
Nature Materials
2 publications, 2.04%
|
|
|
Tuberculosis
2 publications, 2.04%
|
|
|
Bioorganic Chemistry
2 publications, 2.04%
|
|
|
Chemical Biology and Drug Design
2 publications, 2.04%
|
|
|
Analytical Chemistry
2 publications, 2.04%
|
|
|
ACS Infectious Diseases
2 publications, 2.04%
|
|
|
Briefings in Bioinformatics
2 publications, 2.04%
|
|
|
Drug Metabolism and Disposition
2 publications, 2.04%
|
|
|
Current Topics in Medicinal Chemistry
1 publication, 1.02%
|
|
|
Journal of Chemical Health and Safety
1 publication, 1.02%
|
|
|
Journal of Computational Biophysics and Chemistry
1 publication, 1.02%
|
|
|
Frontiers in Chemistry
1 publication, 1.02%
|
|
|
Frontiers in Pharmacology
1 publication, 1.02%
|
|
|
Frontiers in Psychiatry
1 publication, 1.02%
|
|
|
Acta Biochimica et Biophysica Sinica
1 publication, 1.02%
|
|
|
Biophysical Reviews
1 publication, 1.02%
|
|
|
Brazilian Journal of Infectious Diseases
1 publication, 1.02%
|
|
|
Drug Discovery Today
1 publication, 1.02%
|
|
|
Antiviral Research
1 publication, 1.02%
|
|
|
Computational and Structural Biotechnology Journal
1 publication, 1.02%
|
|
|
2
4
6
8
10
|
Publishers
|
5
10
15
20
25
30
35
40
|
|
|
American Chemical Society (ACS)
37 publications, 37.76%
|
|
|
Elsevier
18 publications, 18.37%
|
|
|
Springer Nature
17 publications, 17.35%
|
|
|
Wiley
5 publications, 5.1%
|
|
|
Oxford University Press
4 publications, 4.08%
|
|
|
Cold Spring Harbor Laboratory
4 publications, 4.08%
|
|
|
Frontiers Media S.A.
3 publications, 3.06%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
3 publications, 3.06%
|
|
|
Royal Society of Chemistry (RSC)
2 publications, 2.04%
|
|
|
American Society for Pharmacology and Experimental Therapeutics
1 publication, 1.02%
|
|
|
Bentham Science Publishers Ltd.
1 publication, 1.02%
|
|
|
World Scientific
1 publication, 1.02%
|
|
|
Taylor & Francis
1 publication, 1.02%
|
|
|
5
10
15
20
25
30
35
40
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
98
Total citations:
98
Citations from 2024:
24
(24%)
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Lane T. et al. Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery. // Molecular Pharmaceutics. 2018. Vol. 15. No. 10. pp. 4346-4360.
GOST all authors (up to 50)
Copy
Lane T., Russo D. P., Zorn K. M., Clark A. M., Korotcov A., Tkachenko V., Reynolds R., Perryman A. L., Freundlich J. S., Ekins S. Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery. // Molecular Pharmaceutics. 2018. Vol. 15. No. 10. pp. 4346-4360.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1021/acs.molpharmaceut.8b00083
UR - https://doi.org/10.1021/acs.molpharmaceut.8b00083
TI - Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery.
T2 - Molecular Pharmaceutics
AU - Lane, T.M.
AU - Russo, Daniel P
AU - Zorn, Kimberley M.
AU - Clark, Alex M.
AU - Korotcov, Alexandru
AU - Tkachenko, Valery
AU - Reynolds, R.C.
AU - Perryman, Alexander L.
AU - Freundlich, Joel S.
AU - Ekins, Sean
PY - 2018
DA - 2018/04/19
PB - American Chemical Society (ACS)
SP - 4346-4360
IS - 10
VL - 15
PMID - 29672063
SN - 1543-8384
SN - 1543-8392
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2018_Lane,
author = {T.M. Lane and Daniel P Russo and Kimberley M. Zorn and Alex M. Clark and Alexandru Korotcov and Valery Tkachenko and R.C. Reynolds and Alexander L. Perryman and Joel S. Freundlich and Sean Ekins},
title = {Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery.},
journal = {Molecular Pharmaceutics},
year = {2018},
volume = {15},
publisher = {American Chemical Society (ACS)},
month = {apr},
url = {https://doi.org/10.1021/acs.molpharmaceut.8b00083},
number = {10},
pages = {4346--4360},
doi = {10.1021/acs.molpharmaceut.8b00083}
}
Cite this
MLA
Copy
Lane, T.M., et al. “Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery..” Molecular Pharmaceutics, vol. 15, no. 10, Apr. 2018, pp. 4346-4360. https://doi.org/10.1021/acs.molpharmaceut.8b00083.
Profiles