Skip to main content

Table 2 Percentage of problematic and singleton domain sequences.

From: Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint

 

Percentage of domains

Sequence dataset

Transmembrane & problematic

Singleton

SP-trEMBL

18.5

22.6

integr8_263

17.9

24.9

A thaliana

17.5

16.0

B anthracis

20.3

8.6

C elegans

19.8

22.1

D melanogaster

18.7

18.7

E coli

15.7

7.3

H sapiens

15.9

20.9

S cerevisiae

14.9

24.7

T maritima

13.4

12.7

  1. The percentage of problematic and singleton domain sequences in Swiss-Prot & TrEMBL, 263 completed genomes and eight model genomes Problematic domains are defined as those containing helical transmembrane helices or significant regions of low complexity or coiled-coil