Skip to main content
Fig. 4 | BMC Bioinformatics

Fig. 4

From: Random forest versus logistic regression: a large-scale benchmark experiment

Fig. 4

Influence of n and p: subsampling experiment based on dataset ID=310. Top: Boxplot of the performance (acc) of RF (dark) and LR (white) for N=50 sub-datasets extracted from the OpenML dataset with ID=310 by randomly picking nn observations and p<p features. Bottom: Boxplot of the differences in performances Δacc=AccRFAccLR between RF and LR. p{1,2,3,4,5,6}. n{5e2,1e3,5e3,1e4}. Performance is evaluated through 5-fold-cross-validation repeated 2 times

Back to article page