Skip to main content

Table 1 MisPred analysis of Swiss-Prot entries

From: Identification and correction of abnormal, incomplete and mispredicted proteins in public databases

UniProtKB/Swiss-Prot

Conflict 1

Number of proteins

Identified as containing an extracellular domain

Percentage

Identified as suspicious by MisPred

Percentage*

False positives

Percentage*

True errors

Percentage*

Annotated as fragment or chimera by UniProt

Identified as abnormal only by MisPred

Homo sapiens

15638

1431

9.2%

15

1.05%

10

0.70%

5

0.35%

4

1

Mus musculus

13186

1198

9.1%

12

1.00%

7

0.58%

5

0.42%

2

3

Rattus norvegicus

6043

599

9.9%

18

3.01%

2

0.33%

16

2.67%

14

2

Gallus gallus

1635

194

11.9%

22

11.34%

3

1.55%

19

9.79%

12

7

Danio rerio

1290

64

5.0%

4

6.25%

3

4.69%

1

1.56%

1

0

Caenorhabditis elegans

2999

119

4.0%

9

7.56%

1

0.84%

8

6.72%

0

8

Drosophila melanogaster

2463

147

6.0%

5

3.40%

3

2.04%

2

1.36%

1

1

Conflict 2

Number of proteins

Identified as containing an extra- and an intracellular domain

Percentage

Identified as suspicious by MisPred

Percentage*

False positives

Percentage*

True errors

Percentage*

Annotated as fragment or chimera by UniProt

Identified as abnormal only by MisPred

Homo sapiens

15638

43

0.3%

8

18.6%

8

18.6%

0

0.0%

0

0

Mus musculus

13186

42

0.3%

6

14.3%

6

14.3%

0

0.0%

0

0

Rattus norvegicus

6043

19

0.3%

2

10.5%

2

10.5%

0

0.0%

0

0

Gallus gallus

1635

10

0.6%

1

10.0%

1

10.0%

0

0.0%

0

0

Danio rerio

2999

2

0.1%

0

0.0%

0

0.0%

0

0.0%

0

0

Caenorhabditis elegans

1290

5

0.4%

1

20.0%

1

20.0%

0

0.0%

0

0

Drosophila melanogaster

2463

8

0.3%

1

12.5%

1

12.5%

0

0.0%

0

0

Conflict 3

Number of proteins

  

Identified as suspicious by MisPred

Percentage*

False positives

Percentage*

True errors

Percentage*

Annotated as fragment or chimera by UniProt

Identified as abnormal only by MisPred

Homo sapiens

15638

  

0

0.0%

0

0.0%

0

0.0%

0

0

Mus musculus

13186

  

0

0.0%

0

0.0%

0

0.0%

0

0

Rattus norvegicus

6043

  

0

0.0%

0

0.0%

0

0.0%

0

0

Gallus gallus

1635

  

0

0.0%

0

0.0%

0

0.0%

0

0

Danio rerio

2999

  

0

0.0%

0

0.0%

0

0.0%

0

0

Caenorhabditis elegans

1290

  

0

0.0%

0

0.0%

0

0.0%

0

0

Drosophila melanogaster

2463

  

0

0.0%

0

0.0%

0

0.0%

0

0

Conflict 4

Number of proteins

Proteins containing domains suitable for the study of domain integrity

Percentage

Identified as suspicious by MisPred

Percentage*

False positives

Percentage*

True errors

Percentage*

Annotated as fragment or chimera by UniProt

Identified as abnormal only by MisPred

Homo sapiens

15638

6973

44.6%

10

0.14%

6

0.09%

4

0.06%

3

1

Mus musculus

13186

5808

44.0%

3

0.05%

2

0.03%

1

0.02%

1

0

Rattus norvegicus

6043

2756

45.6%

14

0.51%

0

0.00%

14

0.51%

13

1

Gallus gallus

1635

755

46.2%

8

1.06%

0

0.00%

8

1.06%

8

0

Danio rerio

1290

355

27.5%

1

0.28%

0

0.00%

1

0.28%

1

0

Caenorhabditis elegans

2999

1215

40.5%

2

0.16%

0

0.00%

2

0.16%

0

2

Drosophila melanogaster

2463

1203

48.8%

0

0.00%

0

0.00%

0

0.00%

0

0

Conflict 5

Number of proteins

  

Identified as suspicious by MisPred

Percentage*

False positives

Percentage*

True errors

Percentage*

Annotated as fragment or chimera by UniProt

Identified as abnormal only by MisPred

Homo sapiens

15638

  

5

0.03%

3

0.02%

2

0.01%

0

2

Mus musculus

13186

  

0

0.00%

0

0.00%

0

0.00%

0

0

Rattus norvegicus

6043

  

5

0.08%

3

0.05%

2

0.03%

0

2

Gallus gallus

1635

  

0

0.00%

0

0.00%

0

0.00%

0

0

Danio rerio

1290

  

18

1.40%

18

1.40%

0

0.00%

0

0

Caenorhabditis elegans

2999

  

0

0.00%

0

0.00%

0

0.00%

0

0

Drosophila melanogaster

2463

  

0

0.00%

0

0.00%

0

0.00%

0

0

  1. *Values for suspicious, false positive and true positive sequences are expressed as percentage of the proteins relevant for the given conflict.