Skip to main content
Fig. 1 | BMC Bioinformatics

Fig. 1

From: BiSpark: a Spark-based highly scalable aligner for bisulfite sequencing data

Fig. 1

Analysis workflow within BiSpark consists of 4 processing phases: (1) Distributing the reads into key-value pairs, (2) Transforming reads into ‘three-letter’ reads and mapping to transformed reference genome, (3) Aggregating mapping results and filtering ambiguous reads, and (4) Profiling the methylation information for each read. The figure depicts the case when library of input data is a non-directional

Back to article page