Open Access
Efficient COI barcoding using high throughput single-end 400 bp sequencing
Chentao Yang
1
,
Yuxuan Zheng
2
,
Shangjin Tan
1
,
Guanliang Meng
1
,
Wei RAO
1
,
Caiqing Yang
2
,
David G. Bourne
3, 4, 5
,
Paul A. O’Brien
3, 4, 5
,
Junqiang Xu
1
,
Sha Liao
1
,
Ao Chen
1
,
Xiaowei Chen
1
,
Xinrui Jia
2
,
Ai-bing Zhang
2
,
Shanlin Liu
1, 6
1
BGI-shenzhen, Shenzhen, China
|
Publication type: Journal Article
Publication date: 2020-12-04
scimago Q1
wos Q2
SJR: 1.003
CiteScore: 5.9
Impact factor: 3.7
ISSN: 14712164
PubMed ID:
33276723
Genetics
Biotechnology
Abstract
Over the last decade, the rapid development of high-throughput sequencing platforms has accelerated species description and assisted morphological classification through DNA barcoding. However, the current high-throughput DNA barcoding methods cannot obtain full-length barcode sequences due to read length limitations (e.g. a maximum read length of 300 bp for the Illumina’s MiSeq system), or are hindered by a relatively high cost or low sequencing output (e.g. a maximum number of eight million reads per cell for the PacBio’s SEQUEL II system). Pooled cytochrome c oxidase subunit I (COI) barcodes from individual specimens were sequenced on the MGISEQ-2000 platform using the single-end 400 bp (SE400) module. We present a bioinformatic pipeline, HIFI-SE, that takes reads generated from the 5′ and 3′ ends of the COI barcode region and assembles them into full-length barcodes. HIFI-SE is written in Python and includes four function modules of filter, assign, assembly and taxonomy. We applied the HIFI-SE to a set of 845 samples (30 marine invertebrates, 815 insects) and delivered a total of 747 fully assembled COI barcodes as well as 70 Wolbachia and fungi symbionts. Compared to their corresponding Sanger sequences (72 sequences available), nearly all samples (71/72) were correctly and accurately assembled, including 46 samples that had a similarity score of 100% and 25 of ca. 99%. The HIFI-SE pipeline represents an efficient way to produce standard full-length barcodes, while the reasonable cost and high sensitivity of our method can contribute considerably more DNA barcodes under the same budget. Our method thereby advances DNA-based species identification from diverse ecosystems and increases the number of relevant applications.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
1
2
3
|
|
|
Invertebrate Systematics
3 publications, 13.04%
|
|
|
Zootaxa
3 publications, 13.04%
|
|
|
Molecular Ecology
2 publications, 8.7%
|
|
|
Mycological Progress
1 publication, 4.35%
|
|
|
Arthropod Systematics and Phylogeny
1 publication, 4.35%
|
|
|
Molecular Ecology Resources
1 publication, 4.35%
|
|
|
Water (Switzerland)
1 publication, 4.35%
|
|
|
Systematic Biology
1 publication, 4.35%
|
|
|
Zoological Journal of the Linnean Society
1 publication, 4.35%
|
|
|
Diversity
1 publication, 4.35%
|
|
|
Ecotoxicology and Environmental Safety
1 publication, 4.35%
|
|
|
PeerJ
1 publication, 4.35%
|
|
|
Genome Biology and Evolution
1 publication, 4.35%
|
|
|
Biodiversity Data Journal
1 publication, 4.35%
|
|
|
Forensic Science International: Genetics
1 publication, 4.35%
|
|
|
American Naturalist
1 publication, 4.35%
|
|
|
BMC Genomics
1 publication, 4.35%
|
|
|
1
2
3
|
Publishers
|
1
2
3
|
|
|
Springer Nature
3 publications, 13.04%
|
|
|
Wiley
3 publications, 13.04%
|
|
|
CSIRO Publishing
3 publications, 13.04%
|
|
|
Oxford University Press
3 publications, 13.04%
|
|
|
Magnolia Press
3 publications, 13.04%
|
|
|
Pensoft Publishers
2 publications, 8.7%
|
|
|
MDPI
2 publications, 8.7%
|
|
|
Elsevier
2 publications, 8.7%
|
|
|
PeerJ
1 publication, 4.35%
|
|
|
University of Chicago Press
1 publication, 4.35%
|
|
|
1
2
3
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
23
Total citations:
23
Citations from 2024:
10
(43.48%)
Cite this
GOST |
RIS |
BibTex
Cite this
GOST
Copy
Yang C. et al. Efficient COI barcoding using high throughput single-end 400 bp sequencing // BMC Genomics. 2020. Vol. 21. No. 1. 862
GOST all authors (up to 50)
Copy
Yang C., Zheng Y., Tan S., Meng G., RAO W., Yang C., Bourne D. G., O’Brien P. A., Xu J., Liao S., Chen A., Chen X., Jia X., Zhang A., Liu S. Efficient COI barcoding using high throughput single-end 400 bp sequencing // BMC Genomics. 2020. Vol. 21. No. 1. 862
Cite this
RIS
Copy
TY - JOUR
DO - 10.1186/s12864-020-07255-w
UR - https://doi.org/10.1186/s12864-020-07255-w
TI - Efficient COI barcoding using high throughput single-end 400 bp sequencing
T2 - BMC Genomics
AU - Yang, Chentao
AU - Zheng, Yuxuan
AU - Tan, Shangjin
AU - Meng, Guanliang
AU - RAO, Wei
AU - Yang, Caiqing
AU - Bourne, David G.
AU - O’Brien, Paul A.
AU - Xu, Junqiang
AU - Liao, Sha
AU - Chen, Ao
AU - Chen, Xiaowei
AU - Jia, Xinrui
AU - Zhang, Ai-bing
AU - Liu, Shanlin
PY - 2020
DA - 2020/12/04
PB - Springer Nature
IS - 1
VL - 21
PMID - 33276723
SN - 1471-2164
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2020_Yang,
author = {Chentao Yang and Yuxuan Zheng and Shangjin Tan and Guanliang Meng and Wei RAO and Caiqing Yang and David G. Bourne and Paul A. O’Brien and Junqiang Xu and Sha Liao and Ao Chen and Xiaowei Chen and Xinrui Jia and Ai-bing Zhang and Shanlin Liu},
title = {Efficient COI barcoding using high throughput single-end 400 bp sequencing},
journal = {BMC Genomics},
year = {2020},
volume = {21},
publisher = {Springer Nature},
month = {dec},
url = {https://doi.org/10.1186/s12864-020-07255-w},
number = {1},
pages = {862},
doi = {10.1186/s12864-020-07255-w}
}