Skip to main content

Table 2 Top species included in the GA selected 1, 2, 3, 4, 5 or 6 PCs produced with different data sets

From: Selection of microbial biomarkers with genetic algorithm and principal component analysis

Dataset for creating PCAHigh contribution variables (high coefficients in the corresponding PC) included in the most selected components
Comp1Comp2Comp3Comp4Comp5Comp6
Whole (PC1, PC7, PC2, PC27, PC11, PC15)Prausnitzii–aGnavus+Eutactus+aMoorei–Eggerthii–aZeae+a
Eutactus–aFaecis–aPrausnitzii+aObeum–Dispar–aGnavus–
Formicigenerans–aCopri+Aerofaciens–Lenta+aAdolescentis+Stutzeri+a
Catus–aMuciniphila–aCatus–Animalis–Mucilaginosa–aBromii+a
Faecis–aAdolescentis–aAdolescentis–`Torques–Aerofaciens+Fragilis+a
Obesity (PC14, PC18, PC2, PC4, PC19, PC16)Eutactus–aUniformis+Dolichum–Producta–Caccae+aFormicigenerans+a
Bromii+Catus–aLenta–Prausnitzii+aParainfluenzae+aBromii–
Adolescents–aDispar+Aerofaciens+aAerofaciens–Formicigenerans+aDistasonis–
Formicigenerans+Faecis+Producta–Fragilis–Adolescentis–Eutactus+a
Producta–aDistasonis–aGnavus–Faecis+aDispar–Perfringens+a
Healthy (PC1, PC34, PC23, PC28, PC3, PC5)Prausnitzii–aStutzeri–aCallidus–aOvatus–Copri+Copri+a
Eutactus–aZeae+Moorei+Longum+aMuciniphila–aMuciniphila+a
Catus–aGnavus+Formigenes+Distasonis+aFormigenes–aPrausnitzii–
Formicigenerans–aDispar+Prausnitzii+Fragilis–Catus+Formigenes+a
Faecis–aLenta–aCatus–aAerofaciens–Biforme+Eutactus+a
  1. Comp1, Comp2, Comp3, Comp4, Comp5 and Comp6 represent the 6 PCs selected by GA. For experiment with whole dataset they are PC1, PC7, PC2, PC27, PC11 and PC15 respectively; for experiment with obesity sample, they are PC14, PC18, PC2, PC4, PC19 and PC16; for experiment with healthy sample, they are PC1, PC34, PC23, PC28, PC3 and PC5
  2. aSpecies has a positive correlation with the probability of having healthy body mass
  3. + Positive correlation with the corresponding PC
  4. – Negative correlation with the corresponding PC