Skip to main content
Figure 2 | BMC Bioinformatics

Figure 2

From: Very Important Pool (VIP) genes – an application for microarray-based molecular signatures

Figure 2

The detailed process for identifying a very important pool (VIP) of genes. X1 and X2 are, respectively, the gene expression profiles for class 1 samples and class 2 samples in the training set. X1mand X2mare samples randomly selected from X1 and X2 in the mthbagging step. X 1 m ' MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeiwaG1aa0baaSqaaiabigdaXiabd2gaTbqaaiabcEcaNaaaaaa@3062@ and X 2 m ' MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeiwaG1aa0baaSqaaiabikdaYiabd2gaTbqaaiabcEcaNaaaaaa@3064@ are the genes remaining after filtering genes from X1mand X2m, respectively. Malinowski's factor indicator function (IND) is calculated with equations R E k = i = k + 1 g λ i / p ( n k ) MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOuaiLaemyrau0aaSbaaSqaaiabdUgaRbqabaGccqGH9aqpdaGcaaqcfayaamaalyaabaWaaabCaeaacqaH7oaBdaWgaaqaaiabdMgaPbqabaaabaGaemyAaKMaeyypa0Jaem4AaSMaey4kaSIaeGymaedabaGaem4zaCgacqGHris5aaqaaiabdchaWjabcIcaOiabd6gaUjabgkHiTiabdUgaRjabcMcaPaaaaSqabaaaaa@447E@ and IND k = RE k /(n - k)2, where λ i is the ith eigenvalue of the total g eigenvalues; n is the number of samples and p is the number of genes. The optimum number (k) of components corresponds to the IND minimum. E11 and E21 are the residue matrices after projecting X1mand X2minto the PCA space for class 1, respectively, while E22 and E12 are the residue matrices after projecting X2mand X1minto the PCA space for class 2, respectively. The discrimination power (DP) of a gene j is calculated with the equation: D P j = [ ( e j 12 ) T ( e j 12 ) + ( e j 21 ) T ( e j 21 ) ] / [ ( e j 11 ) T ( e j 11 ) + ( e j 22 ) T ( e j 22 ) ] MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiraqKaemiuaa1aaSbaaSqaaiabdQgaQbqabaGccqGH9aqpcqGGBbWwdaWcgaqaaiabcIcaOiabhwgaLnaaDaaaleaacqWGQbGAaeaacqaIXaqmcqaIYaGmaaGccqGGPaqkdaahaaWcbeqaaiabbsfaubaakiabcIcaOiabhwgaLnaaDaaaleaacqWGQbGAaeaacqaIXaqmcqaIYaGmaaGccqGGPaqkcqGHRaWkcqGGOaakcqWHLbqzdaqhaaWcbaGaemOAaOgabaGaeGOmaiJaeGymaedaaOGaeiykaKYaaWbaaSqabeaacqqGubavaaGccqGGOaakcqWHLbqzdaqhaaWcbaGaemOAaOgabaGaeGOmaiJaeGymaedaaOGaeiykaKIaeiyxa0fabaGaei4waSLaeiikaGIaeCyzau2aa0baaSqaaiabdQgaQbqaaiabigdaXiabigdaXaaakiabcMcaPmaaCaaaleqabaGaeeivaqfaaOGaeiikaGIaeCyzau2aa0baaSqaaiabdQgaQbqaaiabigdaXiabigdaXaaakiabcMcaPiabgUcaRiabcIcaOiabhwgaLnaaDaaaleaacqWGQbGAaeaacqaIYaGmcqaIYaGmaaGccqGGPaqkdaahaaWcbeqaaiabbsfaubaakiabcIcaOiabhwgaLnaaDaaaleaacqWGQbGAaeaacqaIYaGmcqaIYaGmaaGccqGGPaqkcqGGDbqxaaaaaa@7112@ , where e j 11 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeCyzau2aa0baaSqaaiabdQgaQbqaaiabigdaXiabigdaXaaaaaa@3096@ , e j 12 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeCyzau2aa0baaSqaaiabdQgaQbqaaiabigdaXiabikdaYaaaaaa@3098@ , e j 22 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeCyzau2aa0baaSqaaiabdQgaQbqaaiabikdaYiabikdaYaaaaaa@309A@ , and e j 21 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeCyzau2aa0baaSqaaiabdQgaQbqaaiabikdaYiabigdaXaaaaaa@3098@ are the j columns of residue matrices E11, E12, E22, and E21, respectively.

Back to article page