Skip to main content

Table 2 Relevant sequence features obtained in the preprocessing stage

From: ADS-HCSpark: A scalable HaplotypeCaller leveraging adaptive data segmentation to accelerate variant calling on Spark

Sequence features

Comment

Index ID

Index number of data block

Interval

Interval length of all the alignment sequence in the data block

Record Num

Number of all the alignment sequence in the data block

CIGAR_I

Sum of the insertion lengths of all the alignment sequence in the data block

CIGAR_D

Sum of the deletion lengths of all the alignment sequence in the data block