Fig. 1From: Critical assessment of on-premise approaches to scalable genome analysisThe general workflow of a genomics data science solution. The input is a VCF file after a variant calling pipeline which could undergo transformation into a storage system. Variants are then annotated with a variety of sources and fed back into the storage. The contents of the VCF file can be queried via a client or a program for later analysisBack to article page