Skip to main content

Table 2 Assembly results. Assembly results for different data sets. Coverage is the fraction of the reference sequence covered by the generated contigs. Results are shown within different levels of minimum contig accuracy (97%, 99%, 99.9%). Minimum length of a contig considered for this analysis was 100 bases. All contig lengths are expressed in kilo bases. L c is the length of contig c whereas S c is its accuracy score. S a is the total assembly score. For further illustration of these terms, refer to the end of the Methods section.

From: Crystallizing short-read assemblies around seeds

Species # of reads (million) # of seeds Coverage (%) N50 length (kb) Max S a
    97 99 99.9 97 99 99.9 L c (S c )  
M. genitallium 2.42 5 96.3 88.9 64.6 45.8 36.7 3.5 86.1 (99.1) 96.1
S. suis 8.36 7 97.8 93.6 86.3 31.5 25.6 9.5 170.7 (95) 95.7
E. coli 19.53 10 98.2 96.1 88.1 41.8 32.4 12 165 (97.4) 97.1