Skip to main content

Table 1 Q2 and P2 values for amine data set. The mean and standard deviations for the P2 and Q2 values obtained with the amine data set using two different variable selection methods (corrfilter and PLSfilter) and two different regression methods (PLS and RR). The 5-fold cross validation procedure was repeated 100 times, using 100 different random partitions of the data. N D and N L values were selected in an inner 5-fold cross validation loop by optimizing the Q2 value. For one random partition of the amine data into five cross validation groups, one P2 and five Q2 values were obtained. For every random partition the mean Q2 is computed. The mean and standard deviations were computed based on the 100 P2 values and the 100 mean Q2 values.

From: Unbiased descriptor and parameter selection confirms the potential of proteochemometric modelling

Filter

Regression

P2 (mean ± std)

mean Q2 (mean ± std)

no filter

PLS

0.52 ± 0.021

0.49 ± 0.011

no filter

RR

0.53 ± 0.022

0.49 ± 0.012

corrfilter

PLS

0.49 ± 0.028

0.76 ± 0.0057

corrfilter

RR

0.44 ± 0.038

0.76 ± 0.0085

PLSfilter

PLS

0.52 ± 0.025

0.90 ± 0.0026

PLSfilter

RR

0.51 ± 0.027

0.90 ± 0.0056