Skip to main content
Figure 1 | BMC Bioinformatics

Figure 1

From: Copy number variation signature to predict human ancestry

Figure 1

Overview of Method. Affymetrix SNP6.0 probe signal intensity data are normalized and summarized for copy number analysis. The mean log2ratio of each probe is compared between two populations of different ancestries using the t-test. The resulting t-statistic for each probe is formatted with chromosome position and imported into GADA to identify common ancestry CNVs (caCNVs). The t-statistics follow a normal distribution, with the t-statistic values in the tails representing the common ancestry probes. Finally, the sum of the log2ratios for each CNV is calculated and used as features in linear discriminant analysis to identify a minimum set of caCNVs required to classify the populations.

Back to article page