Skip to main content

Table 5 Overall performance of homology-based (PSI-BLAST) prediction for the identification of plastid vs. non-plastid proteins and the classification of diverse plastid-types.

From: Identification and characterization of plastid-type proteins from sequence-attributed features using machine learning

 

No. of sequences

H

C

P

(%)

A

(%)

Phase-I:

     

Plastids

2844

2731

1443

52.84

50.74

Non-plastids

2844

2726

2337

85.73

82.17

Phase-II:

     

Chloroplast

542

483

167

34.58

30.81

Chromoplast

177

172

17

9.88

9.61

Etioplast

220

204

4

1.96

1.82

Amyloplast

232

219

42

19.18

18.10

  1. *at e-value = 0.001; H = Number of total hits; C = Number of correct or true hits; P = Percent of correct hits calculated as (C/H*100); A = Percent accuracy calculated as (C/total number of proteins in a particular class * 100).