Fig. 2From: Evaluating the necessity of PCR duplicate removal from next-generation sequencing data and a comparison of approachesWe constructed a Venn diagram using the variant datasets. The datasets correspond to the three pipelines: removing duplicates using SAMTools, removing duplicates using Picard, and ignoring duplicates. The blue circle is the Picard dataset, red is the no duplicates removed dataset, and green is the SAMTools datasetBack to article page