Skip to main content

Table 3 Datasets of protein subcellular localization*

From: ProtPlat: an efficient pre-training platform for protein classification based on FastText

Dataset

cy

mi

nu

Sp

Total

Animals_train

302

153

803

632

1890

Animals_test

137

35

363

172

707

Fungi_train

181

177

589

72

1019

Fungi_test

30

11

122

16

179

Plants_train

52

57

60

35

204

Plants_test

6

10

61

6

83

  1. *cy denotes cytoplasm, mi denotes mitochondrion, nu denotes nucleus, and sp denotes secretory pathway