Skip to main content

Table 7 Variable selection for prostate cancer data (Gleason 5 vs. Gleason 7)

From: Greedy feature selection for glycan chromatography data with the generalized Dirichlet distribution

  GDFS CFS rpart Predominant glycans (GDFS method)
Peaks 1-14  
Peak 15 FA2BG2S1, A3G3
Peak 16  
Peak 17  
Peak 18  
Peak 19 A3G3S2
Peak 20  
Peak 21 A3G3S3
Peak 22  
Peak 23 A4G4S4
Peak 24* A4F1G4S4
  1. Features selected from the prostate cancer dataset (Gleason 5 vs. Gleason 7 cases) by the proposed GDFS method (GDFS), correlation-based feature selection (CFS), and recursive partitioning (rpart). Features that were selected in 90% more of the cross-validation models are marked with . Also listed are the predominant glycan structures corresponding to each selected peak. Detailed N-glycan composition of human serum was described in Royle et al. [9], and these peaks were also assigned in Saldova et al. [24]. *Peak 24 was the one most frequently selected by rpart, but less than 80% of the time.