Skip to main content

Table 1 Short read datasets for assembler assessment

From: Parallelized short read assembly of large genomes using de Bruijn graphs

  Bacillus Bordetella E.coli Yoruban male
library 160bp 198bp* 200bp 200bp
read length 36 36 36 36~42
no. of reads 16,633,474 12,549,138 20,816,448 3,758,659,514
coverage 142× 111× 162× 44×
genome size 4,215,606 4,086,189 4,639,675 3,101,788,170**
  1. * uses an estimated insert size from assembly due to the unavailability of the real library insert size; ** uses the total length of all scaffolds in the GRCh37/hg19 build human reference sequence.