acdc – Automated Contamination Detection and Confidence estimation for single-cell genome data

BMC Bioinformatics

Table 4 Precision, recall and F ₁-scores of predicted clean base pairs for both ProDeGe and acdc on the simulated and benchmark data sets

Data set	Precision	Recall	F ₁	Precision	Recall	F ₁
	ProDeGe	ProDeGe	ProDeGe	acdc	acdc	acdc
simulated (kingdom)	No result	No result	No result	1.00	1.00	1.00
simulated (phylum)	No result	No result	No result	0.99	0.98	0.99
simulated (class)	No result	No result	No result	1.00	0.99	0.99
simulated (order)	No result	No result	No result	0.99	0.98	0.99
simulated (family)	No result	No result	No result	1.00	1.00	1.00
simulated (genus)	0.22	0.32	0.22	0.95	0.97	0.96
simulated (species)	0.50	0.33	0.36	0.38	0.77	0.46
benchmark (E.coli)	1.00	0.88	0.93	0.97	0.99	0.98
benchmark (M.ruber)	1.00	0.73	0.83	0.99	0.99	0.99
benchmark (P.heparinus)	1.00	0.70	0.81	1.00	1.00	1.00

Each row contains average values of the given sub data set. Bold values depict the best performing entry. Entries marked as “no result” either produced an empty clean fasta file or did not finish computation

ISSN: 1471-2105