Skip to main content
Fig. 5 | BMC Bioinformatics

Fig. 5

From: Classification based on extensions of LS-PLS using logistic regression: application to clinical and multiple genomic data

Fig. 5

Distribution of misclassification rates and AUCs for the somatic CNA data estimated based on 100 samples using the six methods. GLM and R-PLS denote the misclassification rates and AUCs obtained from applying the GLM to the clinical data alone and PLS to the CNA data alone, respectively. LS-PCR denotes the approach derived from PCR, where CNA data are analyzed using PCA and IRLS can thus be applied to the merged data set of PCA scores and clinical data. LS-PLS-IRLS, R-LS-PLS, and IR-LS-PLS denote the misclassification rates and AUCs obtained from the newly proposed LS-PLS approaches combining CNA and clinical data from the brest cancer data set. The number of gene expression variables to pre-select pred is set to 500 in the SIS procedure. The color code for the methods is similar to that used in Fig. 1

Back to article page