BMC Bioinformatics

Table 1 Key statistics of the datasets used during this study

From: Automated annotation of rare-cell types from single-cell RNA-sequencing data through synthetic oversampling

Dataset	Imb. ratio	Minority samples	CV folds	Oversampling nbd
Glial cells	506.94	17	3	3
Prl cardio	26.21	625	10	30
Brain atlas	348.5	624	10	30

The column ’Oversampling nbd’ shows the number of nearest neighbours considered for each minority class data points to generate synthetic samples

Back to article page

ISSN: 1471-2105

Contact us

General enquiries: journalsubmissions@springernature.com