Skip to main content

Table 2 Number of protein sequence errors detected in Uniprot primate sequences, for each of the error types

From: Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes

Primate

N-terminal extension

N-terminal deletion

C-terminal extension

C-terminal deletion

Internal insertion

Internal deletion

Mismatched segment

Total errors

Callithrix jacchus

1427

398

532

274

1315

1593

694

6233

Chlorocebus Sabaeus

914

1870

480

766

1840

2901

992

9763

Gorilla gorilla gorilla

820

1104

389

392

828

3211

1043

7787

Macaca fascicularis

918

667

448

295

964

2016

703

6011

Macaca mulatta

1657

402

661

235

1702

1403

561

6621

Nomascus leucogenys

757

1446

478

554

1186

6744

2641

13,806

Otolemur garnettii

603

1879

271

682

1417

2352

1887

9091

Papio Anubis

1134

434

500

286

1108

2342

1091

6895

Pongo abelii

1183

1673

370

981

1280

4877

802

11,166

Pan troglodytes

867

391

444

227

796

1606

601

4932

TOTAL

10,280

10,264

4573

4692

12,436

29,045

11,015

82,305