Skip to main content

Table 1 MisPred analysis of Swiss-Prot entries

From: Identification and correction of abnormal, incomplete and mispredicted proteins in public databases

UniProtKB/Swiss-Prot
Conflict 1 Number of proteins Identified as containing an extracellular domain Percentage Identified as suspicious by MisPred Percentage* False positives Percentage* True errors Percentage* Annotated as fragment or chimera by UniProt Identified as abnormal only by MisPred
Homo sapiens 15638 1431 9.2% 15 1.05% 10 0.70% 5 0.35% 4 1
Mus musculus 13186 1198 9.1% 12 1.00% 7 0.58% 5 0.42% 2 3
Rattus norvegicus 6043 599 9.9% 18 3.01% 2 0.33% 16 2.67% 14 2
Gallus gallus 1635 194 11.9% 22 11.34% 3 1.55% 19 9.79% 12 7
Danio rerio 1290 64 5.0% 4 6.25% 3 4.69% 1 1.56% 1 0
Caenorhabditis elegans 2999 119 4.0% 9 7.56% 1 0.84% 8 6.72% 0 8
Drosophila melanogaster 2463 147 6.0% 5 3.40% 3 2.04% 2 1.36% 1 1
Conflict 2 Number of proteins Identified as containing an extra- and an intracellular domain Percentage Identified as suspicious by MisPred Percentage* False positives Percentage* True errors Percentage* Annotated as fragment or chimera by UniProt Identified as abnormal only by MisPred
Homo sapiens 15638 43 0.3% 8 18.6% 8 18.6% 0 0.0% 0 0
Mus musculus 13186 42 0.3% 6 14.3% 6 14.3% 0 0.0% 0 0
Rattus norvegicus 6043 19 0.3% 2 10.5% 2 10.5% 0 0.0% 0 0
Gallus gallus 1635 10 0.6% 1 10.0% 1 10.0% 0 0.0% 0 0
Danio rerio 2999 2 0.1% 0 0.0% 0 0.0% 0 0.0% 0 0
Caenorhabditis elegans 1290 5 0.4% 1 20.0% 1 20.0% 0 0.0% 0 0
Drosophila melanogaster 2463 8 0.3% 1 12.5% 1 12.5% 0 0.0% 0 0
Conflict 3 Number of proteins    Identified as suspicious by MisPred Percentage* False positives Percentage* True errors Percentage* Annotated as fragment or chimera by UniProt Identified as abnormal only by MisPred
Homo sapiens 15638    0 0.0% 0 0.0% 0 0.0% 0 0
Mus musculus 13186    0 0.0% 0 0.0% 0 0.0% 0 0
Rattus norvegicus 6043    0 0.0% 0 0.0% 0 0.0% 0 0
Gallus gallus 1635    0 0.0% 0 0.0% 0 0.0% 0 0
Danio rerio 2999    0 0.0% 0 0.0% 0 0.0% 0 0
Caenorhabditis elegans 1290    0 0.0% 0 0.0% 0 0.0% 0 0
Drosophila melanogaster 2463    0 0.0% 0 0.0% 0 0.0% 0 0
Conflict 4 Number of proteins Proteins containing domains suitable for the study of domain integrity Percentage Identified as suspicious by MisPred Percentage* False positives Percentage* True errors Percentage* Annotated as fragment or chimera by UniProt Identified as abnormal only by MisPred
Homo sapiens 15638 6973 44.6% 10 0.14% 6 0.09% 4 0.06% 3 1
Mus musculus 13186 5808 44.0% 3 0.05% 2 0.03% 1 0.02% 1 0
Rattus norvegicus 6043 2756 45.6% 14 0.51% 0 0.00% 14 0.51% 13 1
Gallus gallus 1635 755 46.2% 8 1.06% 0 0.00% 8 1.06% 8 0
Danio rerio 1290 355 27.5% 1 0.28% 0 0.00% 1 0.28% 1 0
Caenorhabditis elegans 2999 1215 40.5% 2 0.16% 0 0.00% 2 0.16% 0 2
Drosophila melanogaster 2463 1203 48.8% 0 0.00% 0 0.00% 0 0.00% 0 0
Conflict 5 Number of proteins    Identified as suspicious by MisPred Percentage* False positives Percentage* True errors Percentage* Annotated as fragment or chimera by UniProt Identified as abnormal only by MisPred
Homo sapiens 15638    5 0.03% 3 0.02% 2 0.01% 0 2
Mus musculus 13186    0 0.00% 0 0.00% 0 0.00% 0 0
Rattus norvegicus 6043    5 0.08% 3 0.05% 2 0.03% 0 2
Gallus gallus 1635    0 0.00% 0 0.00% 0 0.00% 0 0
Danio rerio 1290    18 1.40% 18 1.40% 0 0.00% 0 0
Caenorhabditis elegans 2999    0 0.00% 0 0.00% 0 0.00% 0 0
Drosophila melanogaster 2463    0 0.00% 0 0.00% 0 0.00% 0 0
  1. *Values for suspicious, false positive and true positive sequences are expressed as percentage of the proteins relevant for the given conflict.