Skip to main content
Fig. 3 | BMC Bioinformatics

Fig. 3

From: A new method for exploring gene–gene and gene–environment interactions in GWAS with tree ensemble methods and SHAP values

Fig. 3

All data available is divided into three subsets: Ranking data, fitting data and evaluation data. The ranking data is used to rank features by importance in order to remove noise. The fitting data is used to fit models by using the ranking derived from the ranking data. The evaluation data is finally used to explain what is considered important with respect to the predictions from the models trained on the fitting data

Back to article page