Skip to main content

Table 2 Breakdown of the (a) virus protein dataset and (b) plant protein dataset

From: mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines

(a) Viral protein dataset

Label

Subcellular location

No. of locative proteins

1

Viral capsid

8

2

Host cell membrane

33

3

Host endoplasmic reticulum

20

4

Host cytoplasm

87

5

Host nucleus

84

6

Secreted

20

Total number of locative proteins ( N loc v )

252

Total number of actual proteins ( N act v )

207

(b) Plant protein dataset

Label

Subcellular location

No. of locative proteins

1

Cell membrane

56

2

Cell wall

32

3

Chloroplast

286

4

Cytoplasm

182

5

Endoplasmic reticulum

42

6

Extracellular

22

7

Golgi apparatus

21

8

Mitochondrion

150

9

Nucleus

152

10

Peroxisome

21

11

Plastid

39

12

Vacuole

52

Total number of locative proteins ( N loc p )

1055

Total number of actual proteins ( N act p )

978