Skip to main content

Table 2 Number of protein sequence errors detected in Uniprot primate sequences, for each of the error types

From: Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes

Primate N-terminal extension N-terminal deletion C-terminal extension C-terminal deletion Internal insertion Internal deletion Mismatched segment Total errors
Callithrix jacchus 1427 398 532 274 1315 1593 694 6233
Chlorocebus Sabaeus 914 1870 480 766 1840 2901 992 9763
Gorilla gorilla gorilla 820 1104 389 392 828 3211 1043 7787
Macaca fascicularis 918 667 448 295 964 2016 703 6011
Macaca mulatta 1657 402 661 235 1702 1403 561 6621
Nomascus leucogenys 757 1446 478 554 1186 6744 2641 13,806
Otolemur garnettii 603 1879 271 682 1417 2352 1887 9091
Papio Anubis 1134 434 500 286 1108 2342 1091 6895
Pongo abelii 1183 1673 370 981 1280 4877 802 11,166
Pan troglodytes 867 391 444 227 796 1606 601 4932
TOTAL 10,280 10,264 4573 4692 12,436 29,045 11,015 82,305