Open Access
Open access

GenBase: A Nucleotide Sequence Database

Congfan Bu 1, 2
Xinchang Zheng 1, 2, 3
Xuetong Zhao 1, 2
Tianyi Xu 1, 2
Xue Bai 1, 2
Yaokai Jia 1, 2
Mei-Li Chen 1, 2
Lili Hao Lili Hao 1, 2
Jingfa Xiao 1, 2, 4
Zhang Zhang 1, 2, 4
Wenming Zhao 1, 2, 4
Bixia Tang 1, 2
Yiming Bao 1, 2, 4
Publication typeJournal Article
Publication date2024-06-24
SJR
CiteScore
Impact factor
ISSN16720229
Abstract

The rapid advancement of sequencing technologies poses challenges in managing the large volume and exponential growth of sequence data efficiently and on time. To address this issue, we present GenBase (https://ngdc.cncb.ac.cn/genbase), an open-access data repository that follows the International Nucleotide Sequence Database Collaboration (INSDC) data standards and structures, for efficient nucleotide sequence archiving, searching, and sharing. As a core resource within the National Genomics Data Center (NGDC), of the China National Center for Bioinformation (CNCB; https://ngdc.cncb.ac.cn), GenBase offers bilingual submission pipeline and services, as well as local submission assistance in China. GenBase also provides a unique Excel format for metadata description and feature annotation of nucleotide sequences, along with a real-time data validation system to streamline sequence submissions. As of April 23, 2024, GenBase received 68,251 nucleotide sequences and 689,574 annotated protein sequences across 414 species from 2319 submissions. Out of these, 63,614 (93%) nucleotide sequences and 620,640 (90%) annotated protein sequences have been released and are publicly accessible through GenBase’s web search system, File Transfer Protocol (FTP), and Application Programming Interface (API). Additionally, in collaboration with INSDC, GenBase has constructed an effective data exchange mechanism with GenBank and started sharing released nucleotide sequences. Furthermore, GenBase integrates all sequences from GenBank with daily updates, demonstrating its commitment to actively contributing to global sequence data management and sharing.

Found 

Top-30

Journals

1
2
3
Viruses
3 publications, 9.68%
Nucleic Acids Research
2 publications, 6.45%
Cell
2 publications, 6.45%
Zoosystematics and Evolution
2 publications, 6.45%
PhytoKeys
2 publications, 6.45%
Science of the Total Environment
1 publication, 3.23%
Frontiers in Public Health
1 publication, 3.23%
Journal of Maternal-Fetal and Neonatal Medicine
1 publication, 3.23%
BMC Plant Biology
1 publication, 3.23%
Scientific Reports
1 publication, 3.23%
Plant Diversity
1 publication, 3.23%
Cell Regeneration
1 publication, 3.23%
Mitochondrial DNA Part B: Resources
1 publication, 3.23%
Virology Journal
1 publication, 3.23%
BMC Cancer
1 publication, 3.23%
Nature Communications
1 publication, 3.23%
iScience
1 publication, 3.23%
Insect Biochemistry and Molecular Biology
1 publication, 3.23%
Frontiers in Plant Science
1 publication, 3.23%
Journal of Fungi
1 publication, 3.23%
Nature
1 publication, 3.23%
Transboundary and Emerging Diseases
1 publication, 3.23%
Movement Disorders
1 publication, 3.23%
1
2
3

Publishers

1
2
3
4
5
6
7
Springer Nature
7 publications, 22.58%
Elsevier
6 publications, 19.35%
MDPI
4 publications, 12.9%
Pensoft Publishers
4 publications, 12.9%
Oxford University Press
2 publications, 6.45%
Frontiers Media S.A.
2 publications, 6.45%
Taylor & Francis
2 publications, 6.45%
Cold Spring Harbor Laboratory
2 publications, 6.45%
Wiley
2 publications, 6.45%
1
2
3
4
5
6
7
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
32
Share
Cite this
GOST |
Cite this
GOST Copy
Bu C. et al. GenBase: A Nucleotide Sequence Database // Genomics, Proteomics and Bioinformatics. 2024. Vol. 22. No. 3.
GOST all authors (up to 50) Copy
Bu C., Zheng X., Zhao X., Xu T., Bai X., Jia Y., Chen M., Lili Hao L. H., Xiao J., Zhang Z., Zhao W., Tang B., Bao Y. GenBase: A Nucleotide Sequence Database // Genomics, Proteomics and Bioinformatics. 2024. Vol. 22. No. 3.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1093/gpbjnl/qzae047
UR - https://academic.oup.com/gpb/advance-article/doi/10.1093/gpbjnl/qzae047/7698051
TI - GenBase: A Nucleotide Sequence Database
T2 - Genomics, Proteomics and Bioinformatics
AU - Bu, Congfan
AU - Zheng, Xinchang
AU - Zhao, Xuetong
AU - Xu, Tianyi
AU - Bai, Xue
AU - Jia, Yaokai
AU - Chen, Mei-Li
AU - Lili Hao, Lili Hao
AU - Xiao, Jingfa
AU - Zhang, Zhang
AU - Zhao, Wenming
AU - Tang, Bixia
AU - Bao, Yiming
PY - 2024
DA - 2024/06/24
PB - Oxford University Press
IS - 3
VL - 22
PMID - 38913867
SN - 1672-0229
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@article{2024_Bu,
author = {Congfan Bu and Xinchang Zheng and Xuetong Zhao and Tianyi Xu and Xue Bai and Yaokai Jia and Mei-Li Chen and Lili Hao Lili Hao and Jingfa Xiao and Zhang Zhang and Wenming Zhao and Bixia Tang and Yiming Bao},
title = {GenBase: A Nucleotide Sequence Database},
journal = {Genomics, Proteomics and Bioinformatics},
year = {2024},
volume = {22},
publisher = {Oxford University Press},
month = {jun},
url = {https://academic.oup.com/gpb/advance-article/doi/10.1093/gpbjnl/qzae047/7698051},
number = {3},
doi = {10.1093/gpbjnl/qzae047}
}