Skip to main content

Table 1 Statistics of the datasets that are used in this study

From: SigUNet: signal peptide recognition based on semantic segmentation

Organism

Signal Peptides

Transmembrane

Cytosolic or Nuclear

Total

Train

Comp

Train

Comp

Train

Comp

SignalP

 Eukaryotes

1640

606

987

939

5133

1000

7760

 Gram-positive

208

48

117

117

360

213

685

 Gram-negative

423

104

523

523

912

260

1858

SPDS17

 Eukaryotes

46

323

689

1058

 Gram-positive

9

189

240

438

 Gram-negative

23

89

99

211

  1. The SignalP dataset is from the UniProtKB/Swiss-Prot in accordance with the identity list in Pertersen et al.’s study [12]; The SPDS17 dataset is from the UniProtKB/Swiss-Prot in accordance with the identity list in Savojardo et al.’s study [6].