Skip to main content

Table 1 Key statistics of the datasets used during this study

From: Automated annotation of rare-cell types from single-cell RNA-sequencing data through synthetic oversampling

Dataset

Imb. ratio

Minority samples

CV folds

Oversampling nbd

Glial cells

506.94

17

3

3

Prl cardio

26.21

625

10

30

Brain atlas

348.5

624

10

30

  1. The column ’Oversampling nbd’ shows the number of nearest neighbours considered for each minority class data points to generate synthetic samples