Open Access
LogLoss-BERAF: An ensemble-based machine learning model for constructing highly accurate diagnostic sets of methylation sites accounting for heterogeneity in prostate cancer
Тип публикации: Journal Article
Дата публикации: 2018-11-02
scimago Q1
wos Q2
БС1
SJR: 0.803
CiteScore: 5.4
Impact factor: 2.6
ISSN: 19326203
PubMed ID:
30388122
Multidisciplinary
Краткое описание
Although modern methods of whole genome DNA methylation analysis have a wide range of applications, they are not suitable for clinical diagnostics due to their high cost and complexity and due to the large amount of sample DNA required for the analysis. Therefore, it is crucial to be able to identify a relatively small number of methylation sites that provide high precision and sensitivity for the diagnosis of pathological states. We propose an algorithm for constructing limited subsamples from high-dimensional data to form diagnostic panels. We have developed a tool that utilizes different methods of selection to find an optimal, minimum necessary combination of factors using cross-entropy loss metrics (LogLoss) to identify a subset of methylation sites. We show that the algorithm can work effectively with different genome methylation patterns using ensemble-based machine learning methods. Algorithm efficiency, precision and robustness were evaluated using five genome-wide DNA methylation datasets (totaling 626 samples), and each dataset was classified into tumor and non-tumor samples. The algorithm produced an AUC of 0.97 (95% CI: 0.94–0.99, 9 sites) for prostate adenocarcinoma and an AUC of 1.0 (from 2 to 6 sites) for urothelial bladder carcinoma, two types of kidney carcinoma and colorectal carcinoma. For prostate adenocarcinoma we showed that identified differential variability methylation patterns distinguish cluster of samples with higher recurrence rate (hazard ratio for recurrence = 0.48, 95% CI: 0.05–0.92; log-rank test, p-value < 0.03). We also identified several clusters of correlated interchangeable methylation sites that can be used for the elaboration of biological interpretation of the resulting models and for further selection of the sites most suitable for designing diagnostic panels. LogLoss-BERAF is implemented as a standalone python code and open-source code is freely available from https://github.com/bioinformatics-IBCH/logloss-beraf along with the models described in this article.
Найдено
Ничего не найдено, попробуйте изменить настройки фильтра.
Найдено
Ничего не найдено, попробуйте изменить настройки фильтра.
Топ-30
Журналы
|
1
|
|
|
Current Topics in Medicinal Chemistry
1 публикация, 16.67%
|
|
|
Sustainability
1 публикация, 16.67%
|
|
|
Epigenetics and Chromatin
1 публикация, 16.67%
|
|
|
Frontiers in Genetics
1 публикация, 16.67%
|
|
|
1
|
Издатели
|
1
|
|
|
Bentham Science Publishers Ltd.
1 публикация, 16.67%
|
|
|
MDPI
1 публикация, 16.67%
|
|
|
Elsevier
1 публикация, 16.67%
|
|
|
Cold Spring Harbor Laboratory
1 публикация, 16.67%
|
|
|
Springer Nature
1 публикация, 16.67%
|
|
|
Frontiers Media S.A.
1 публикация, 16.67%
|
|
|
1
|
- Мы не учитываем публикации, у которых нет DOI.
- Статистика публикаций обновляется еженедельно.
Вы ученый?
Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
Метрики
6
Всего цитирований:
6
Цитирований c 2025:
1
(16.67%)
Цитировать
ГОСТ |
RIS |
BibTex |
MLA
Цитировать
ГОСТ
Скопировать
Babalyan K. et al. LogLoss-BERAF: An ensemble-based machine learning model for constructing highly accurate diagnostic sets of methylation sites accounting for heterogeneity in prostate cancer // PLoS ONE. 2018. Vol. 13. No. 11. p. e0204371.
ГОСТ со всеми авторами (до 50)
Скопировать
Babalyan K., Sultanov R., Generozov E. V., Sharova E., Kostryukova E., Larin A., Kanygina A., Govorun V., Arapidi G. P. LogLoss-BERAF: An ensemble-based machine learning model for constructing highly accurate diagnostic sets of methylation sites accounting for heterogeneity in prostate cancer // PLoS ONE. 2018. Vol. 13. No. 11. p. e0204371.
Цитировать
RIS
Скопировать
TY - JOUR
DO - 10.1371/journal.pone.0204371
UR - https://doi.org/10.1371/journal.pone.0204371
TI - LogLoss-BERAF: An ensemble-based machine learning model for constructing highly accurate diagnostic sets of methylation sites accounting for heterogeneity in prostate cancer
T2 - PLoS ONE
AU - Babalyan, K
AU - Sultanov, R.
AU - Generozov, Edward V.
AU - Sharova, E
AU - Kostryukova, E
AU - Larin, A.
AU - Kanygina, A
AU - Govorun, V.
AU - Arapidi, Georgij P.
PY - 2018
DA - 2018/11/02
PB - Public Library of Science (PLoS)
SP - e0204371
IS - 11
VL - 13
PMID - 30388122
SN - 1932-6203
ER -
Цитировать
BibTex (до 50 авторов)
Скопировать
@article{2018_Babalyan,
author = {K Babalyan and R. Sultanov and Edward V. Generozov and E Sharova and E Kostryukova and A. Larin and A Kanygina and V. Govorun and Georgij P. Arapidi},
title = {LogLoss-BERAF: An ensemble-based machine learning model for constructing highly accurate diagnostic sets of methylation sites accounting for heterogeneity in prostate cancer},
journal = {PLoS ONE},
year = {2018},
volume = {13},
publisher = {Public Library of Science (PLoS)},
month = {nov},
url = {https://doi.org/10.1371/journal.pone.0204371},
number = {11},
pages = {e0204371},
doi = {10.1371/journal.pone.0204371}
}
Цитировать
MLA
Скопировать
Babalyan, K., et al. “LogLoss-BERAF: An ensemble-based machine learning model for constructing highly accurate diagnostic sets of methylation sites accounting for heterogeneity in prostate cancer.” PLoS ONE, vol. 13, no. 11, Nov. 2018, p. e0204371. https://doi.org/10.1371/journal.pone.0204371.
Профили