Skip to main content

Table 5 D (h, ) for smaller samples. Measure of discrepancy between phase-known and phase-unknown frequency predictions for smaller samples randomly selected from larger data sets.

From: Haplotype frequency estimation error analysis in the presence of missing genotype data

Data set

# individuals

% missing alleles

D (h, )

% increase from complete data

7 loci SNP

300

0

0.049354

0

7 loci SNP

300

10

0.067410

37

7 loci SNP

100

0

0.104105

0

7 loci SNP

100

10

0.147034

41

7 loci SNP

50

0

0.155912

0

7 loci SNP

50

10

0.229097

47

multiallelic

300

0

0.153865

0

multiallelic

300

10

0.202678

32

multiallelic

100

0

0.227170

0

multiallelic

100

10

0.238658

5

multiallelic

50

0

0.320917

0

multiallelic

50

10

0.372827

16