Open Access
Open access
Human Genome Variation, volume 7, issue 1, publication number 40

A reference catalog of DNA palindromes in the human genome and their variations in 1000 Genomes

Madhavi K. Ganapathiraju 1, 2, 3
Sandeep Subramanian 1, 3
Srilakshmi Chaparala 1
Kalyani B Karunakaran 4
1
 
Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, USA
2
 
Intelligent Systems Program, School of Computing and Information, University of Pittsburgh, Pittsburgh, USA
Publication typeJournal Article
Publication date2020-11-20
Quartile SCImago
Q3
Quartile WOS
Impact factor1.5
ISSN2054345X
Biochemistry
Molecular Biology
Genetics
Abstract
A palindrome in DNA is like a palindrome in language, but when read backwards, it is a complement of the forward sequence; effectively, the two halves of a sequence complement each other from its midpoint like in a double strand of DNA. Palindromes are distributed throughout the human genome and play significant roles in gene expression and regulation. Palindromic mutations are linked to many human diseases, such as neuronal disorders, mental retardation, and various cancers. In this work, we computed and analyzed the palindromic sequences in the human genome and studied their conservation in personal genomes using 1000 Genomes data. We found that ~30% of the palindromes exhibit variation, some of which are caused by rare variants. The analysis of disease/trait-associated single-nucleotide polymorphisms in palindromic regions showed that disease-associated risk variants are 14 times more likely to be present in palindromic regions than in other regions. The catalog of palindromes in the reference genome and 1000 Genomes is being made available here with details on their variations in each individual genome to serve as a resource for future and retrospective whole-genome studies identifying statistically significant palindrome variations associated with diseases or traits and their roles in disease mechanisms. A toolkit to detect DNA patterns has pinpointed millions of palindromes within the human genome, which could further the understanding of their association with disease. Madhavi K. Ganapathiraju of the University of Pittsburgh and colleagues used their Biological Language Modelling Toolkit to mine the human reference genome and the genomes of 2504 individuals from the 1000 Genomes project for palindromes: DNA sequences with two complementary halves, where folding at the midpoint leads to complementary base pairs lining up with each other. They identified their locations and lengths and looked for known disease-associated genetic variants, called single nucleotide polymorphisms, within them. They found links to diseases such as diabetes, obesity, Alzheimer’s disease, Crohn’s disease, and some cancers. The data was compiled into the Catalog of Palindromic Sequences to serve as a resource for investigating palindromic variations associated with disease.

Top-30

Journals

1
2
International Journal of Molecular Sciences
2 publications, 9.09%
Genes
1 publication, 4.55%
Nucleic Acids Research
1 publication, 4.55%
New Journal of Chemistry
1 publication, 4.55%
Viruses
1 publication, 4.55%
Advances in Genetics
1 publication, 4.55%
Frontiers in Immunology
1 publication, 4.55%
PLoS Genetics
1 publication, 4.55%
Chemistry - A European Journal
1 publication, 4.55%
Advances in Cancer Research
1 publication, 4.55%
mBio
1 publication, 4.55%
Russian Chemical Reviews
1 publication, 4.55%
European Journal of Haematology
1 publication, 4.55%
1
2

Publishers

1
2
3
4
MDPI
4 publications, 18.18%
Elsevier
3 publications, 13.64%
Wiley
3 publications, 13.64%
Research Square Platform LLC
2 publications, 9.09%
Oxford University Press
1 publication, 4.55%
Royal Society of Chemistry (RSC)
1 publication, 4.55%
Cold Spring Harbor Laboratory
1 publication, 4.55%
Frontiers Media S.A.
1 publication, 4.55%
Institute of Electrical and Electronics Engineers (IEEE)
1 publication, 4.55%
Public Library of Science (PLoS)
1 publication, 4.55%
American Society for Microbiology
1 publication, 4.55%
Autonomous Non-profit Organization Editorial Board of the journal Uspekhi Khimii
1 publication, 4.55%
1
2
3
4
  • We do not take into account publications without a DOI.
  • Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
Share
Cite this
GOST |
Cite this
GOST Copy
Ganapathiraju M. K. et al. A reference catalog of DNA palindromes in the human genome and their variations in 1000 Genomes // Human Genome Variation. 2020. Vol. 7. No. 1. 40
GOST all authors (up to 50) Copy
Ganapathiraju M. K., Subramanian S., Chaparala S., Karunakaran K. B. A reference catalog of DNA palindromes in the human genome and their variations in 1000 Genomes // Human Genome Variation. 2020. Vol. 7. No. 1. 40
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1038/s41439-020-00127-5
UR - https://doi.org/10.1038/s41439-020-00127-5
TI - A reference catalog of DNA palindromes in the human genome and their variations in 1000 Genomes
T2 - Human Genome Variation
AU - Ganapathiraju, Madhavi K.
AU - Subramanian, Sandeep
AU - Chaparala, Srilakshmi
AU - Karunakaran, Kalyani B
PY - 2020
DA - 2020/11/20
PB - Springer Nature
IS - 1
VL - 7
SN - 2054-345X
ER -
BibTex
Cite this
BibTex Copy
@article{2020_Ganapathiraju,
author = {Madhavi K. Ganapathiraju and Sandeep Subramanian and Srilakshmi Chaparala and Kalyani B Karunakaran},
title = {A reference catalog of DNA palindromes in the human genome and their variations in 1000 Genomes},
journal = {Human Genome Variation},
year = {2020},
volume = {7},
publisher = {Springer Nature},
month = {nov},
url = {https://doi.org/10.1038/s41439-020-00127-5},
number = {1},
doi = {10.1038/s41439-020-00127-5}
}
Found error?