Skip to main content


Figure 2 | BMC Bioinformatics

Figure 2

From: Information compression exploits patterns of genome composition to discriminate populations and highlight regions of evolutionary interest

Figure 2

Compression efficiency against heterozygosity resolves human populations. Compression efficiency (y-axis) against heterozygosity (x-axis) for the three human data sets: (A) HapMap 3 (1,184 individuals across 11 populations), (B) HGDP (1,043 individuals representing 51 populations from 14 geographical regions) and (C) Pan-Asia (1,928 individuals across 75 populations). We find that plotting the genome-wide compression efficiency data versus genome-wide heterozygosity reinforces the strength of the population discrimination in all cases. Although the broad relationship is negative, some populations clearly have similar genome-wide heterozygosity but very different compression efficiency. The spatial resolution is consistent with previous phylogeographic reconstructions based on Fixation Index and Principal Components Analysis, illustrated by the coloured boxes in the first panel representing Asian and Africans as extreme with the Europeans intermediate.

Back to article page