Skip to main content
Figure 3 | BMC Bioinformatics

Figure 3

From: FIGG: Simulating populations of whole genome sequences for heterogeneous data analyses

Figure 3

FIGG MapReduce jobs. Three discrete MapReduce jobs have been set up to generate unique whole genome sequences. The first job simply fragments the reference or "parent" genome into the distributed database, HBase. The second job reads all the fragments for the parent genome from the database, mutates them using the provided frequency information and again saves them to the database to ensure reproducibility. The final job generates FASTA formatted files, per chromosome, for the mutated genomes.

Back to article page