Skip to main content

Table 3 Pseudocode for a Monte Carlo wrapper-based feature selection algorithm

From: Win percentage: a novel measure for assessing the suitability of machine classifiers for biological problems

   1. xout = -∞
   2. For i = 1 to N
S i = randomSubset(S)
(x i , C i ) = performance(S i )
If x i >xout,
Sout = S i , xout = x i , cout = randomElement(C i )
   3. output Sout, xout, cout
  1. The input, S, is the set of all features, and N is the total number of feature subsets to draw randomly. The variable x i is the performance of the top classifier for subset S i , and C i is the label of the top classifier. Sout, xout, and cout, return the top performing feature set, top estimated performance, and top classifier, respectively.