From: A hybrid computational strategy to address WGS variant analysis in >5000 samples
Stage | # of core hours | Time (days) | Data (TB) generated | Data (TB) (upload/ download) | Median execution time /unit | # of parallel execution threads | Optimal instance (core/mem) |
---|---|---|---|---|---|---|---|
Slicing /Repack | ~48Â k/~26Â k | 5 | 360 | 180/0 | ~1.15Â h/sample (slicing)/~22Â h/bin (repacking) | 5297 (slicing) 300 (repacking) | 8 cores, 16GB/ 4 cores, 4GB |
Calling | ~1.4 million | 14 | 120 | 0/2 | ~60Â hrs/bin | 2797 | 8 cores, 16GB |
Genotype Likelihood | ~15.6Â k | 1 | 6 | 0/2 | ~1.5Â h/BAM | 5297 | 2 cores, 8GB |
Imputation and Phasing | ~3.7 million | 30 | 2 | 2/2 | ~30Â h/bin @ Rhea ~13Â h/bin @ Blue BioU | 265Â k | 32 cores |
Total | ~5.2 million | 50 | 488 | 182/6 | -- | -- | -- |