Skip to main content

Table 1 Library size summary when using segments compared to the reference transcriptome in terms of the total number of sequences, number of sequence bases, and total FASTA file sizes

From: Yanagi: Fast and interpretable segment-based alternative splicing and gene expression analysis

 

Transcriptome

 

Segments

   

L=40

L=100

L=1000

L=10000

BDGP6

      

Number of bases (Gb)

90

 

39

41

71

90

Number of Sequences

34,681

 

54,680

53,694

48,741

34,625

FASTA File Size (MB)

89

 

44

47

76

92

GRCh38

      

Number of bases (Gb)

278

 

147

181

308

281

Number of Sequences

182,435

 

544,991

541,361

264,083

183,165

FASTA File Size (MB)

276

 

206

239

338

302

  1. With L=100, using segments achieves 54% and 35% compression rates over the transcriptome in terms of number of bases for fruit fly and human genomes, respectively.