Skip to main content

Table 4 Number of sequence errors identified before and after correction of 603 primate sequences

From: Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes

Error type

No. of errors before correction

No. of errors after correction

Difference

Internal insertion

340

132

− 208

Internal deletion

816

785

− 31

Mismatch

833

263

− 570

N-terminal insertion

32

32

0

N-terminal deletion

80

90

 + 10

C-terminal insertion

29

26

− 4

C-terminal deletion

54

52

− 2

TOTAL

2184

1379

− 805

Mean % identity

91.9%

95.0%

 + 3.1%

Mean % coverage

89.5%

90.7%

 + 1.2%