TY - JOUR
T1 - Comprehensive evaluation of non-hybrid genome assembly tools for third-generation PacBio long-read sequence data
AU - Jayakumar, Vasanthan
AU - Sakakibara, Yasubumi
N1 - Funding Information:
This work was supported by JSPS KAKENHI Grant Number 16H06279.
Publisher Copyright:
© 2017 The Author 2017. Published by Oxford University Press.
PY - 2017/11/2
Y1 - 2017/11/2
N2 - Long reads obtained from third-generation sequencing platforms can help overcome the long-standing challenge of the de novo assembly of sequences for the genomic analysis of non-model eukaryotic organisms. Numerous long-read-aided de novo assemblies have been published recently, which exhibited superior quality of the assembled genomes in comparison with those achieved using earlier second-generation sequencing technologies. Evaluating assemblies is important in guiding the appropriate choice for specific research needs. In this study, we evaluated 10 long-read assemblers using a variety of metrics on Pacific Biosciences PacBio) data sets from different taxonomic categories with considerable differences in genome size. The results allowed us to narrow down the list to a few assemblers that can be effectively applied to eukaryotic assembly projects. Moreover, we highlight how best to use limited genomic resources for effectively evaluating the genome assemblies of non-model organisms.
AB - Long reads obtained from third-generation sequencing platforms can help overcome the long-standing challenge of the de novo assembly of sequences for the genomic analysis of non-model eukaryotic organisms. Numerous long-read-aided de novo assemblies have been published recently, which exhibited superior quality of the assembled genomes in comparison with those achieved using earlier second-generation sequencing technologies. Evaluating assemblies is important in guiding the appropriate choice for specific research needs. In this study, we evaluated 10 long-read assemblers using a variety of metrics on Pacific Biosciences PacBio) data sets from different taxonomic categories with considerable differences in genome size. The results allowed us to narrow down the list to a few assemblers that can be effectively applied to eukaryotic assembly projects. Moreover, we highlight how best to use limited genomic resources for effectively evaluating the genome assemblies of non-model organisms.
KW - PacBio SMRT
KW - assembly evaluation
KW - de novo assembly
KW - single-molecule sequencing
KW - third-generation sequencing
UR - http://www.scopus.com/inward/record.url?scp=85056804052&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85056804052&partnerID=8YFLogxK
U2 - 10.1093/bib/bbx147
DO - 10.1093/bib/bbx147
M3 - Article
C2 - 29112696
AN - SCOPUS:85056804052
SN - 1467-5463
VL - 20
SP - 866
EP - 876
JO - Briefings in Bioinformatics
JF - Briefings in Bioinformatics
IS - 3
M1 - bbx147
ER -