Open Access
Open access
volume 16 issue 1 publication number 1739

Statistical framework for calling allelic imbalance in high-throughput sequencing data

Andrey Buyan 1, 2
Georgy Meshcheryakov 1
Viacheslav Safronov 3
Sergey Abramov 4, 5, 6
Alexandr Boytsov 4, 5, 6
Vladimir Nozdrin 3
Eugene Baulin 6, 7
Semyon Kolmykov 8
J. Vierstra 5
Fedor Kolpakov 8, 9
Vsevolod J. Makeev 4, 6, 10, 11
Publication typeJournal Article
Publication date2025-02-18
scimago Q1
wos Q1
SJR4.761
CiteScore23.4
Impact factor15.7
ISSN20411723
Abstract
High-throughput sequencing facilitates large-scale studies of gene regulation and allows tracing the associations of individual genomic variants with changes in gene regulation and expression. Compared to classic association studies, the assessment of an allelic imbalance at heterozygous variants captures functional variant effects with smaller sample sizes, higher sensitivity, and better resolution. Yet, identification of allele-specific variants from allelic read counts remains challenging due to data-dependent biases and overdispersion arising from technical and biological variability. We present MIXALIME, a novel computational framework for calling allele-specific variants in diverse omics data with a repertoire of statistical models accounting for read mapping bias and copy number variation. We benchmark MIXALIME with DNase-Seq, ATAC-Seq, and CAGE-Seq data, and we demonstrate that the allelic imbalance highlights causal variants in GWAS results. Finally, as a showcase of the large-scale practical application of MIXALIME, we present an atlas of variants exhibiting allele-specific chromatin accessibility, built from thousands of available datasets obtained from diverse cell types. The authors present a statistical and computational framework to identify allele-specific variants, i.e., single nucleotide variants exhibiting allele-specificity (allelic imbalance) in any type of omics assay. Application of this framework to thousands of datasets yields an atlas of chromatin altering variants in diverse cell types.
Found 
Found 

Top-30

Journals

1
Computers in Biology and Medicine
1 publication, 20%
Nucleic Acids Research
1 publication, 20%
Human Gene Therapy
1 publication, 20%
Methods in Ecology and Evolution
1 publication, 20%
1

Publishers

1
Elsevier
1 publication, 20%
Oxford University Press
1 publication, 20%
Cold Spring Harbor Laboratory
1 publication, 20%
SAGE
1 publication, 20%
Wiley
1 publication, 20%
1
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
5
Share
Cite this
GOST |
Cite this
GOST Copy
Buyan A. et al. Statistical framework for calling allelic imbalance in high-throughput sequencing data // Nature Communications. 2025. Vol. 16. No. 1. 1739
GOST all authors (up to 50) Copy
Buyan A., Meshcheryakov G., Safronov V., Abramov S., Boytsov A., Nozdrin V., Baulin E., Kolmykov S., Vierstra J., Kolpakov F., Makeev V. J., Kulakovskiy I. V. Statistical framework for calling allelic imbalance in high-throughput sequencing data // Nature Communications. 2025. Vol. 16. No. 1. 1739
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1038/s41467-024-55513-2
UR - https://www.nature.com/articles/s41467-024-55513-2
TI - Statistical framework for calling allelic imbalance in high-throughput sequencing data
T2 - Nature Communications
AU - Buyan, Andrey
AU - Meshcheryakov, Georgy
AU - Safronov, Viacheslav
AU - Abramov, Sergey
AU - Boytsov, Alexandr
AU - Nozdrin, Vladimir
AU - Baulin, Eugene
AU - Kolmykov, Semyon
AU - Vierstra, J.
AU - Kolpakov, Fedor
AU - Makeev, Vsevolod J.
AU - Kulakovskiy, Ivan V.
PY - 2025
DA - 2025/02/18
PB - Springer Nature
IS - 1
VL - 16
SN - 2041-1723
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@article{2025_Buyan,
author = {Andrey Buyan and Georgy Meshcheryakov and Viacheslav Safronov and Sergey Abramov and Alexandr Boytsov and Vladimir Nozdrin and Eugene Baulin and Semyon Kolmykov and J. Vierstra and Fedor Kolpakov and Vsevolod J. Makeev and Ivan V. Kulakovskiy},
title = {Statistical framework for calling allelic imbalance in high-throughput sequencing data},
journal = {Nature Communications},
year = {2025},
volume = {16},
publisher = {Springer Nature},
month = {feb},
url = {https://www.nature.com/articles/s41467-024-55513-2},
number = {1},
pages = {1739},
doi = {10.1038/s41467-024-55513-2}
}