Skip to main content

Table 2 Paralog identification by QualitySNP in human UniGene datasets.

From: QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species

 

D-value > = 0.6

D-value > = 0.9

UniGene

No. of clustera

confirmed

unconfirmed

No. of clusterb

confirmed

unconfirmed

Hs.300701

4

4

0

0

0

0

Hs.533717

4

0

4

1

0

1

Hs.12956

3

0

3

1

0

1

Hs.22543

1

1

0

1

1

0

Hs.468478

1

1

0

1

1

0

Hs.591503

1

0

1

0

0

0

Hs.567284

1

0

1

0

0

0

Hs.510172

1

0

1

1

0

1

Hs.406754

10

10

0

4

4

0

Hs.510635

29

28

1

16

16

0

Hs.61635

1

1

0

0

0

0

Hs.631881

1

0

1

0

0

0

Hs.104741

1

0

1

0

0

0

Hs.534639

1

1

0

0

0

0

Hs.18069

3

3

0

1

1

0

Total

62

49(79.03%)

13(20.97%)

26

23 (88.46%)

3 (11.54%)

  1. a clusters with D value more than 0.6 are considered as clusters containing paralogs by QualitySNP; b clusters with D value more than 0.9 are considered as clusters containing paralogs. Confirmed: Number of clusters that were proven to contain paralogous sequences