Skip to main content
Figure 1 | BMC Bioinformatics

Figure 1

From: TagDust2: a generic method to extract reads from sequencing data

Figure 1

Overview of the TagDust2 workflow. 1) A user specifies the expected read architecture as a sequence of pre-defined blocks. Here there are four of such blocks. 2) A HMM is constructed by concatenating the pre-defined blocks in the order given by the -1 …command line options. For example -2 B:GTA,AAC is translated into the second (red) part of the HMM and models the presence of two mutually exclusive barcode sequences. 3) Reads are scanned with the HMM and each nucleotide is labelled by the block it belongs to. In the example shown the three letter barcode GTA is recognised in the raw sequence. 4) Based on the labelling of the sequence, a barcode is assigned to each read and remaining sequences are trimmed.

Back to article page