Table 4 p-values for Wilcoxon sum test with Bonferroni correction in NEJM format and Cohen’s d effect size interpretation for hill climbing versus simulated annealing run on Table 3 dataset

From: Euler diagrams drawn with ellipses area-proportionally (Edeap)

  Area difference Objective function Evaluated solutions Time
p-values 0.71 0.63 < .001 (***) < .001 (***)
Cohen d effect size 1 (large) 1.03 (large)
  1. Following the New England Journal of Medicine (NEJM) practice [28], we regard p-values of less than 0.05 as statistically significant with one asterisk, p-values of less than 0.01 with two asterisks, and p-values less than 0.001 with three asterisks