Skip to main content

Table 4 Comparison of outlier ditags with the matched database sequences.

From: Discarding duplicate ditags in LongSAGE analysis may introduce significant error

Obs

Pred

Tag structure

THCa

Gene name

34

970

CATGGAGCACACCCTGAATCACACCAGAATCACCCTGACATG

  
  

CATGGAGCACACCCTGAATCAC                    

2401106

Carboxypeptidase

  

C CACCAGAATCACCCTGACATG

2434341

Trypsin I

17

289

CATGGTGTGTGCTGGAGGGTACACCAGAATCACCCTGACATG

  
  

CATGGTGTGTGCTGGAGGGTAC                    

2431718

Elastase IIIA

  

C CACCAGAATCACCCTGACATG

2434341

Trypsin I

14

913

CATGTCAGGGTGATTCTGGTGAGGAAGCCCACACAGAACATG

  
  

CATGTCAGGGTGATTCTGGTG G

2434341

Trypsin I

  

A AGGAAGCCCACACAGAACATG

2434342

Trypsin I

13

641

CATGACGCTGGACGCTCCAAGCACCAGAATCACCCTGACATG

  
  

CATGACGCTGGACGCTCCAAGC

2407612

Colipase

  

C CACCAGAATCACCCTGACATG

2434341

Trypsin I

9

101

CATGTCAGGGTGATTCTGGTGTGATTGCCGAGCCAGAGCATG

  
  

CATGTCAGGGTGATTCTGGTG G

2434341

Trypsin I

  

                    GTGATTGCCGAGCCAGAGCATG

2237360

Phospholipase A2b

4

127

CATGTCAGGGTGATTCTGGTGCTGGCGCTTCTGACCATCATG

  
  

CATGTCAGGGTGATTCTGGTG G

2434341

Trypsin I

  

GCTGGCGCTTCTGACCATCATG

2401106

Carboxypeptidase c

4

85

CATGACGCTGGACGCTCCAAGTGATTCAGGGTGTGCTCCATG

  
  

CATGACGCTGGACGCTCCAAG C

2407612

Colipase

  

GTGATTCAGGGTGTGCTCCATG

2401106

Carboxypeptidase

2

267

CATGGAGCACACCCTGAATCAAACAAAGCTGGTCACGCCATG

  
  

CATGGAGCACACCCTGAATCA C

2401106

Carboxypeptidase

  

                    AAACAAAGCTGGTCACGCCATG

2254617

Elastase IIIB

86

8

CATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATG

  
  

CATGACAGTAAGAGAATTATGC                    

 

β-lactamase

  

                    GCAGTGCTGCCATAACCATG

 

Inv. β-lactamase

  1. aTentative Human Contig number from The Institute for Genomic Research (TIGR).
  2. bThe match to Phospholipase A2 is not perfect (GTGATTGCCGAGCCAGAGCAC G)
  3. cThe tag matches an inverted sequence from carboxypeptidase.