Skip to main content
Fig. 3 | BMC Bioinformatics

Fig. 3

From: Classification based on extensions of LS-PLS using logistic regression: application to clinical and multiple genomic data

Fig. 3

Distribution of misclassification rates and AUCs for central nervous system data, estimated from 100 samples using the six methods. GLM and R-PLS denote the misclassification rates and AUCs obtained from applying the GLM to the clinical data alone and PLS to the gene expression data alone, respectively. LS-PCR denotes the approach derived from PCR, where gene expression data are analyzed using PCA and IRLS can thus be applied to the merged data set of PCA scores and clinical data. LS-PLS-IRLS, R-LS-PLS, and IR-LS-PLS denote the misclassification rates and AUCs obtained from the newly proposed LS-PLS approaches combining expression and clinical data from the central nervous system data set. The number of gene expression variables to pre-select pred is set to 500 in the SIS procedure. The color code for the methods is similar to that in Fig. 1

Back to article page