том 57 издание 4 страницы 710-716

Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds

Тип публикацииJournal Article
Дата публикации2017-04-10
SCImago Q1
Tоп 10% SCImago
WOS Q1
БС1
SJR1.43
CiteScore9.8
Impact factor5.3
ISSN15499596, 1549960X
General Chemistry
Computer Science Applications
General Chemical Engineering
Library and Information Sciences
Краткое описание
Support vector machine (SVM) modeling is one of the most popular machine learning approaches in chemoinformatics and drug design. The influence of training set composition and size on predictions currently is an underinvestigated issue in SVM modeling. In this study, we have derived SVM classification and ranking models for a variety of compound activity classes under systematic variation of the number of positive and negative training examples. With increasing numbers of negative training compounds, SVM classification calculations became increasingly accurate and stable. However, this was only the case if a required threshold of positive training examples was also reached. In addition, consideration of class weights and optimization of cost factors substantially aided in balancing the calculations for increasing numbers of negative training examples. Taken together, the results of our analysis have practical implications for SVM learning and the prediction of active compounds. For all compound classes under study, top recall performance and independence of compound recall of training set composition was achieved when 250–500 active and 500–1000 randomly selected inactive training instances were used. However, as long as ∼50 known active compounds were available for training, increasing numbers of 500–1000 randomly selected negative training examples significantly improved model performance and gave very similar results for different training sets.
Для доступа к списку цитирований публикации необходимо авторизоваться.
Для доступа к списку профилей, цитирующих публикацию, необходимо авторизоваться.

Топ-30

Журналы

1
2
3
4
5
6
ACS Omega
6 публикаций, 15%
Journal of Chemical Information and Modeling
3 публикации, 7.5%
bioRxiv
2 публикации, 5%
Applied Sciences (Switzerland)
1 публикация, 2.5%
International Journal of Molecular Sciences
1 публикация, 2.5%
Journal of Computer-Aided Molecular Design
1 публикация, 2.5%
Journal of Soils and Sediments
1 публикация, 2.5%
Acta Neurochirurgica
1 публикация, 2.5%
Cell Reports Physical Science
1 публикация, 2.5%
Energy
1 публикация, 2.5%
International Journal of Human Computer Studies
1 публикация, 2.5%
Ecotoxicology and Environmental Safety
1 публикация, 2.5%
Drug Discovery Today
1 публикация, 2.5%
Artificial Intelligence in the Life Sciences
1 публикация, 2.5%
Chemical Biology and Drug Design
1 публикация, 2.5%
Journal of Medicinal Chemistry
1 публикация, 2.5%
Chemical Reviews
1 публикация, 2.5%
Journal of Proteome Research
1 публикация, 2.5%
Expert Opinion on Drug Discovery
1 публикация, 2.5%
Artificial Intelligence Chemistry
1 публикация, 2.5%
Chemical Research in Toxicology
1 публикация, 2.5%
International Journal of Applied Earth Observation and Geoinformation
1 публикация, 2.5%
Frontiers in Nuclear Engineering
1 публикация, 2.5%
BMC Psychiatry
1 публикация, 2.5%
Journal of Organic Chemistry
1 публикация, 2.5%
Lecture Notes in Networks and Systems
1 публикация, 2.5%
Journal of Pharmaceutical and Biomedical Analysis
1 публикация, 2.5%
Science
1 публикация, 2.5%
BMC Medical Informatics and Decision Making
1 публикация, 2.5%
1
2
3
4
5
6

Издатели

2
4
6
8
10
12
14
American Chemical Society (ACS)
14 публикаций, 35%
Elsevier
10 публикаций, 25%
Springer Nature
6 публикаций, 15%
MDPI
2 публикации, 5%
openRxiv
2 публикации, 5%
Wiley
1 публикация, 2.5%
Taylor & Francis
1 публикация, 2.5%
Institute of Electrical and Electronics Engineers (IEEE)
1 публикация, 2.5%
Frontiers Media S.A.
1 публикация, 2.5%
American Association for the Advancement of Science (AAAS)
1 публикация, 2.5%
2
4
6
8
10
12
14
  • Мы не учитываем публикации, у которых нет DOI.
  • Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
 Войти с ORCID
Метрики
40
Поделиться
Цитировать
ГОСТ |
Цитировать
Rodríguez Pérez R., Vogt M., Bajorath J. Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds // Journal of Chemical Information and Modeling. 2017. Vol. 57. No. 4. pp. 710-716.
ГОСТ со всеми авторами (до 50) Скопировать
Rodríguez Pérez R., Vogt M., Bajorath J. Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds // Journal of Chemical Information and Modeling. 2017. Vol. 57. No. 4. pp. 710-716.
RIS |
Цитировать
TY - JOUR
DO - 10.1021/acs.jcim.7b00088
UR - https://doi.org/10.1021/acs.jcim.7b00088
TI - Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds
T2 - Journal of Chemical Information and Modeling
AU - Rodríguez Pérez, Raquel
AU - Vogt, Martin
AU - Bajorath, Jürgen
PY - 2017
DA - 2017/04/10
PB - American Chemical Society (ACS)
SP - 710-716
IS - 4
VL - 57
PMID - 28376613
SN - 1549-9596
SN - 1549-960X
ER -
BibTex |
Цитировать
BibTex (до 50 авторов) Скопировать
@article{2017_Rodríguez Pérez,
author = {Raquel Rodríguez Pérez and Martin Vogt and Jürgen Bajorath},
title = {Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds},
journal = {Journal of Chemical Information and Modeling},
year = {2017},
volume = {57},
publisher = {American Chemical Society (ACS)},
month = {apr},
url = {https://doi.org/10.1021/acs.jcim.7b00088},
number = {4},
pages = {710--716},
doi = {10.1021/acs.jcim.7b00088}
}
MLA
Цитировать
Rodríguez Pérez, Raquel, et al. “Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds.” Journal of Chemical Information and Modeling, vol. 57, no. 4, Apr. 2017, pp. 710-716. https://doi.org/10.1021/acs.jcim.7b00088.
Ошибка в публикации?