Skip to main content

Table 4 Precision, recall and F 1-scores of predicted clean base pairs for both ProDeGe and acdc on the simulated and benchmark data sets

From: acdc – Automated Contamination Detection and Confidence estimation for single-cell genome data

Data set

Precision

Recall

F 1

Precision

Recall

F 1

 

ProDeGe

ProDeGe

ProDeGe

acdc

acdc

acdc

simulated (kingdom)

No result

No result

No result

1.00

1.00

1.00

simulated (phylum)

No result

No result

No result

0.99

0.98

0.99

simulated (class)

No result

No result

No result

1.00

0.99

0.99

simulated (order)

No result

No result

No result

0.99

0.98

0.99

simulated (family)

No result

No result

No result

1.00

1.00

1.00

simulated (genus)

0.22

0.32

0.22

0.95

0.97

0.96

simulated (species)

0.50

0.33

0.36

0.38

0.77

0.46

benchmark (E.coli)

1.00

0.88

0.93

0.97

0.99

0.98

benchmark (M.ruber)

1.00

0.73

0.83

0.99

0.99

0.99

benchmark (P.heparinus)

1.00

0.70

0.81

1.00

1.00

1.00

  1. Each row contains average values of the given sub data set. Bold values depict the best performing entry. Entries marked as “no result” either produced an empty clean fasta file or did not finish computation