Fig. 1From: Profile hidden Markov model sequence analysis can help remove putative pseudogenes from DNA barcoding and metabarcoding datasetsOverview of methods to determine COI nuMT characteristics and test methods for nuMT removal. Dataflow for our a artificial DNA barcode dataset, b perturbed community datasets, and c real freshwater COI metabarcode dataset. Abbreviations: BOLD = Barcode of Life Data System; COI = cytochrome c oxidase subunit I mtDNA gene; HMM = hidden Markov model; NCBI = National Centre for Biotechnology Information; nt = nucleotide; ORF = open reading frameBack to article page