From: Critical assessment of on-premise approaches to scalable genome analysis
Storage use (GB) | Annotations | Notes | |
---|---|---|---|
Original VCF | 93.77 | No | – |
BCFtools(VCF) | 16.75 | No | bgzip + csi file |
BCFtools(BCF) | 19.60 | No | bgzip + csi file |
SnpSift | 16.75 | No | Same as BCFtools(VCF) |
GEMINI | 119.48 | Yes | SQLite file size |
Hail | 17.55 | No | Matrix table folder size |
OpenCGA | 103.27 | Yes | MongoDB collection size |