Skip to main content

Table 1 Median prediction accuracy in the 15 × 5,000 curated Helianthus datasets

From: Reliable genomic strategies for species classification of plant genetic resources

Accessions per species

Misclassification rate (%)

RF

NB

NJ

1-NN

3-NN

 

6.25

0.63

0.88

0.81

0.81

0.75

2

12.50

0.56

0.75

0.75

0.75

0.63

 

18.75

0.50

0.69

0.69

0.69

0.63

 

6.25

0.84

0.97

0.91

0.78

0.97

4

12.50

0.81

0.88

0.84

0.75

0.94

 

18.75

0.78

0.81

0.78

0.69

0.88

 

6.25

0.96

0.96

0.96

0.94

0.96

6

12.50

0.94

0.85

0.92

0.88

0.94

 

18.75

0.90

0.77

0.88

0.81

0.88

 

6.25

0.95

0.88

0.94

0.83

0.98

8

12.50

0.94

0.81

0.89

0.78

0.95

 

18.75

0.92

0.70

0.84

0.72

0.91

 

6.25

0.98

0.88

0.93

0.91

0.98

10

12.50

0.96

0.75

0.89

0.85

0.94

 

18.75

0.96

0.65

0.85

0.79

0.89

 

Median

0.92

0.81

0.88

0.79

0.94

  1. Classifiers are Random Forest (RF), Naive Bayes (NB), Neighbour-Joining (NJ), 1-Nearest Neighbour (1-NN), and 3-Nearest Neighbours (3-NN) respectively. For each parameter combination, the highest median score is presented in bold