Skip to main content
Figure 5 | BMC Bioinformatics

Figure 5

From: FIGG: Simulating populations of whole genome sequences for heterogeneous data analyses

Figure 5

MapReduce framework. MapReduce provides a general framework to process partitionable data. The Map phase may either gather metadata statistics on a sequence fragment and write them to HBase (Job 1) or apply the variation frequencies and rules to a fragment (Job 2). The Reduce phase, if it is specified, is responsible for assembling the mutated fragments into FASTA formatted chromosome files (Job 3) or it may simply output additional metadata to HBase for use in other processing tasks.

Back to article page