Skip to main content

Table 1 The statistics of datasets employed in this study

From: SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models

 

Number of positive motifs

Number of negative motifs

Total

Training dataset

1019

7937

8956

Independent test dataset

216

1412

1628

Total

1235

9349

10,584