Skip to main content

A permutable MLP-like architecture for disease prediction from gut metagenomic data

Abstract

Metagenomic data plays a crucial role in analyzing the relationship between microbes and diseases. However, the limited number of samples, high dimensionality, and sparsity of metagenomic data pose significant challenges for the application of deep learning in data classification and prediction. Previous studies have shown that utilizing the phylogenetic tree structure to transform metagenomic abundance data into a 2D matrix input for convolutional neural networks (CNNs) improves classification performance. Inspired by the success of a Permutable MLP-like architecture in visual recognition, we propose Metagenomic Permutator (MetaP), which applied the Permutable MLP-like network structure to capture the phylogenetic information of microbes within the 2D matrix formed by phylogenetic tree. Our experiments demonstrate that our model achieved competitive performance compared to other deep neural networks and traditional machine learning, and has good prospects for multi-classification and large sample sizes. Furthermore, we utilize the SHAP (SHapley Additive exPlanations) method to interpret our model predictions, identifying the microbial features that are associated with diseases.

Peer Review reports

Introduction

Metagenomics has made significant advances in the fields of microbial ecology, evolution, and diversity by applying a suite of genomic technologies and bioinformatics tools to directly access the genetic content of entire biological communities over the past two decades [1]. The development of next-generation sequencing technologies has enabled researchers to study the human microbiome, and has made significant progress in the characterization of the microbiome associated with healthy and diseased individuals [2]. In particular, shotgun metagenomic sequencing technology has enabled researchers to obtain a higher resolution profile of the microbial community at the species- and strain-levels [3, 4]. Extensive metagenomic studies have shown that the gut microbiota is closely associated with various host diseases, including obesity [5], type 2 diabetes [6], cirrhosis [7], etc. [8].

There are trillions of microbes inhabiting the human intestine, forming a complex and stable ecosystem that is closely related to human health [9]. The characteristics of a metagenomic sample are commonly described by the relative abundance of microbial taxa at one of the taxonomic levels(i.e. Phylum, Class, Order, Family, Genus, and Species) [10]. Although next-generation sequencing technologies have greatly enhanced our capacity to characterize the metagenomic data, we still face challenges related to high dimensionality, limited sample sizes, and sparse features in metagenomic data.

In the past few years, disease prediction models based on metagenomic data can be broadly categorized into two major types: 1) traditional machine learning-based prediction models, and 2) deep learning-based prediction models. Traditional machine learning models are commonly used for metagenomic data include Random Forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO), and Support Vector Machines (SVM) [11,12,13]. These methods utilize unordered vectors of species-level relative abundance or strain-specific markers for performing supervised classification learning. However, this high-dimensional and sparse tabular data is not the optimal data format for achieving efficient deep learning [14].

One distinctive characteristic of metagenomic data is the phylogenetic tree that establishes connections among microbial species, with closely related species exhibiting similar relationships in the environment and disease [15]. Therefore, the input features of deep learning models typically integrate phylogenetic information to transform raw data into “synthetic images” or 2D matrices. Recent studies have demonstrated that transforming raw metagenomic features into feature representations that incorporate phylogenetic tree information can significantly enhance the classification performance of deep learning models [10, 15,16,17]. Marculescu et al. [16] and Nguyen et al. [17] introduced a phylogenetic-sorting method that enables effective exploration of convolutional neural networks for disease classification. The phylogenetic-sorting approach involves extracting phylogenetic information by sorting the microbial features based on their taxonomic annotation in alphabetical order, by concatenating the strings of their taxonomy [16, 17]. However, this approach disregards the relationships between species sharing the same ancestor but having greater alphabetical distances in their names. Reiman et al. [10] and Zhu et al. [15] respectively developed PopPhy-CNN and Cascade Deep Forest, which were trained using 2D matrices incorporating embedded phylogenetic tree information as input data. However, their embedding approach [10, 15] uses the 2D matrix that includes all nodes of the phylogenetic tree, resulting in sparsity in the upper part of the matrix due to the branching and hierarchical structure of the tree.

In this study, we addressed the challenge of data sparsity in the matrix when transforming raw metagenomic features into matrix representations that incorporate phylogenetic tree information, achieved by reducing the matrix height to solely focus on phylogenetic structure information derived from leaf nodes. This strategy aims to preserve the integrity of the phylogenetic tree structure while enhancing computational efficiency, resulting in a denser representation in the form of the 2D matrix. Moreover, by drawing inspiration from the successful utilization of a Permutable MLP-like architecture in the field of image classification [18], we introduced MetaP, the Permutable MLP-like architecture for the classification of metagenomic data that incorporates the embedded phylogenetic information. We demonstrated the competitive performance and robustness of MetaP in both binary and multiclass classification tasks through benchmark testing with several well-established machine learning methods (RF, SVM) [11] and other deep learning methods (MetaNN [16], PopPhy-CNN [10]) on three publicly available datasets. Finally, we incorporated the SHAP [19] method for interpretable analysis of our model predictions, aiming to uncover the associations between the identified crucial microbes and specific diseases.

Materials and methods

In this section, we outline the key components as depicted in Fig. 1. Additionally, we discuss the application of the SHAP method in interpreting model outputs and provide an overview of the dataset source.

Fig. 1
figure 1

Overview of MetaP model. A Example of populating and embedding phylogenetic tree. We only consider embedding the phylogenetic structure information derived from the leaf nodes of the phylogenetic tree into the matrix, thereby avoiding data sparsity within the matrix. B Basic architecture of the proposed MetaP. We split the matrix into fixed-size patches, linearly embed each of them, and then feed them into a sequence of standard Permutators [18] for feature encoding. A global average pooling layer followed by a fully-connected layer is finally used to predict the class

Representing metagenomic data using 2D matrices

The process of representing metagenomic abundance data using 2D matrices based on the phylogenetic tree, similar to PopPhy [20], is depicted in Fig. 1A. We utilized PhyloT [21] to generate phylogenetic trees, assuming a constant distance of one between nodes in the tree. Each feature in the metagenomic samples is referred to as an Operational Taxonomic Unit (OTU), and we use species-level relative abundances obtained from metagenomic sequencing as the values for these OTUs. For each raw metagenomic dataset, we first filter out OTUs that exhibit a prevalence lower than the threshold of 10% in all classes, and then generate a corresponding pruned phylogenetic tree based on the observed OTUs. This tree serves as a template to construct a populated tree for each sample in the dataset. The leaf nodes are then populated with their corresponding OTU abundance values, while the abundance of each non-leaf node is determined by summing the abundances of its child nodes. The 2D matrix representation is generated by performing a level-order traversal of the phylogenetic tree, populating the matrix in a top-to-bottom and left-to-right order, and assigning zeros to empty positions without any nodes. However, since the OTU value represents the species-level relative abundance within the sample, ranging from 0 to 100, and with the cumulative sum of each sample equating to 100, employing the aforementioned method to populate the node values of parent nodes as the sum of their respective child node values leads to diminished discriminability in the upper portion of the tree. Additionally, the branching and hierarchical structure of the tree also result in sparsity in the upper part of the matrix. This prompts us to reduce the matrix height to include only the phylogenetic structure information derived from the leaf nodes, allowing us to obtain a more compact and efficient matrix representation of the data.

A permutable MLP-like architecture

The Permutable MLP-like architecture [18], proposed by Hou et al., is a conceptually simple and data efficient MLP-like architecture designed for visual recognition. In this study, we explore the application of this architecture in the classification of metagenomic data. The basic architecture of the Permutable MLP-like networks can be found in Fig. 1B. Our network takes a 2D matrix as input and uniformly splits it into a sequence of non-overlapping matrix patches. Each patch is mapped into a linear embedding (or called token), using the same projection matrix. These tokens are subsequently fed into a sequence of Permutators to encode both spatial and channel information. The resulting tokens are finally processed using global average pooling followed by a linear classifier to make class predictions.

As the bottom-right corner of Fig. 1B illustrates, in order to learn the spatial and channel information of the matrix, Hou et al. [18] introduces the Permutator block, which consists of two components: Permute-MLP and Channel-MLP. As shown in Fig. 2, The Permute-MLP module accepts 3-dimensional token representations and consists of three branches, each responsible for encoding information along the height, width, or channel dimension [18]. The Channel-MLP module shares a similar structure to the feed forward layer in Transformers [22] that comprises two fully-connected layers with a GELU activation in the middle [18]. Mathematically, given an input C-dim tokens \(X \in \mathbb {R}^{H \times W \times C}\), the formulation of Permutator can be written as follows:

$$\begin{aligned} \textbf{Y}= & {} {\text {Permute-MLP}}({\text {LN}}(\textbf{X})) + \textbf{X}, \end{aligned}$$
(1)
$$\begin{aligned} \textbf{Z}= & {} {\text {Channel-MLP}}({\text {LN}}(\textbf{Y})) + \textbf{Y}, \end{aligned}$$
(2)

where, LN refers to LayerNorm [18]. The output Z will serve as the input to the next Permutator block until the last one [18].

Fig. 2
figure 2

Basic structure of the Permute-MLP layer. The Permute-MLP layer contains three branches dedicated to encoding features across the dimensions of height, width, and channel. Subsequently, the outputs from these branches are combined through element-wise addition, followed by a fully-connected layer for feature fusion

Interpreting model predictions using SHAP

Shapley Additive Explanations (SHAP) [19], proposed by Lundberg and Lee in 2017, is a method for interpreting the outputs of any complex models. This method is grounded in a solid theoretical framework derived from game theory, aiming to estimate SHAP values, which represent the contribution of each input feature to the final model prediction. In our experiment, we employed the Kernel SHAP Explainer module(available at https://github.com/slundberg/shap), which utilizes a specially-weighted local linear regression approach to estimate SHAP values. This method relies on a background dataset for training and simulates feature absence by substituting the feature with the values it takes in the background dataset. The importance of each feature in the kernel Shapley value is determined by comparing the prediction value obtained with the feature to the prediction value obtained without it. Positive SHAP values suggest that a feature has a positive impact on the model’s prediction, while negative values suggest a negative impact. The magnitude of the value indicates the strength of the contribution, with larger magnitudes indicating greater influence. In our experiment, to obtain the SHAP values corresponding to each feature, we performed a 10-fold cross-validation and computed the average of the SHAP values obtained from each fold. During each fold of training, due to computational resource limitations, we utilized the k-means [23] function to summarize the training dataset as the background dataset for Kernel SHAP, and then calculated the average SHAP value for all features of the samples in the test set.

Datasets

In this study, we select publicly available datasets from the MetAML package [11]: Cirrhosis, Type 2 Diabetes (T2D), and Obesity datasets. Table 1 provides a summary of these datasets. The Cirrhosis dataset comprises 114 cirrhotic patients and 118 healthy individuals. The T2D dataset consists of 170 patients with type 2 diabetes and 174 instances of control samples. The Obesity dataset consists of 253 samples, where 89 samples have a BMI lower than \(25\,{\text {kg/m}}^2\), while the remaining 164 samples have a BMI greater than \(30\,{\text {kg/m}}^2\). In the MetAML study [11], all data were obtained through metagenomic shotgun sequencing [24], and the OTUs for each sample were obtained using MetaPhlAn2 [25] with default parameters. In our study, the OTUs are aggregated at the species level within each dataset. For any OTU that is classified as “unclassified” at the species level, the label of the next highest taxonomic level is assigned to that OTU.

Table 1 Summary of the datasets used in the experiments

Results and discussion

In this section, we will provide an exposition of the experimental setup and subsequently delve into an analysis and discussion of the experimental results.

Table 2 Classification performance comparison of MetaP with existing methods

Experimental setup

The Permutable MLP-like network structure parameter settings for our model applied to metagenomic classification are described as follows. Due to the requirement of ensuring divisibility between the patch dimensions and the 2D matrix, we implemented specific parameter settings. Specifically, the patch width was set to 3, and during the construction of the 2D matrix, we ensured that its width was divisible by 3. Furthermore, the patch height was set to match the reduced matrix height, which typically does not exceed 8. For example, in the T2D dataset, the matrix height was only 5.

There are several hyper parameters in MetaP, including the number of Permutator blocks \(P_n\), the channel dimension split segments \(S_n\), the hidden units in the neural network \(h_u\), the initial learning rate lr of the Adam [26] optimizer, the batch size bs, the number of iterations epoch, and two parameters (step size \(\alpha\) and decay factor \(\gamma\)) associated with the Step learning rate scheduler (StepLR) provided by PyTorch for learning rate scheduling. These parameters are explored over different combinations from the following ranges: \(P_n \in \{1, 2, 3\}\), \(S_n \in \{2, 4, 6, 8, 10\}\), \(h_u \in \{32, 48, 64, 128\}\), \(lr \in \{0.05, 0.005, 0.0005, 0.00005\}\), \(bs \in \{16, 32, 64, 128\}\), \(epoch \in \{10, 20, 30, 40, 50\}\), \(\alpha \in \{5, 10\}\), \(\gamma \in \{0.1, 0.3, 0.5, 0.7\}\). After adjustments, we identified the optimal parameters for MetaP in subsequent experiments as \(P_n=1, S_n=8, h_u=48, lr=0.0005, bs=32, epoch=20, \alpha =5, \gamma =0.5\).

In order to verify the effectiveness of our proposed method, we compared it with the MetAML [11], MetaNN [16], and PopPhy-CNN [10] methods proposed in previous studies. Besides, for the other comparison models, we followed the recommended hyperparameter settings and model architectures outlined in their respective original papers or the provided open-source code. In our experiment, we performed 10-fold cross-validation 10 times and then computed the average to ensure the accuracy of our results. We employed five metrics, namely Area under curve-receiver operating characteristic (AUC), Matthews correlation coefficient (MCC), Precision, Recall, and F1 score, to comprehensively evaluate the model from different perspectives.

We separately evaluated the performance of our model in binary and multiclass classification tasks. For binary classification task, we conducted experiments on the three publicly available datasets mentioned in previous sections. For multiclass classification task, we created a Multi-Disease dataset consisting of 829 samples with multiple diseases by combining these three datasets based on species-level abundance features present in each sample.

Fig. 3
figure 3

Comparing the ROC curves of MetaP and other five methods based on 10-fold cross-validation on the three disease datasets

Table 3 Comparison of PopPhy-CNN and MetaP on dense versus sparse matrix classification performance

The classification performance of models

The classification performance results of different models on the dataset are presented in Table 2, and for better visual comparison, the corresponding ROC curves of RF, SVM, MLPNN, 1DCNN, PopPhy-CNN, and MetaP are shown in Fig. 3. Among the machine learning algorithms, Random Forest(RF) exhibits the best classification performance in the metagenomic datasets. Our experimental results demonstrate that our proposed MetaP model achieved the best classification performance on the Obesity, T2D, and Multi-Disease datasets, while exhibiting slightly lower performance compared to RF on the Cirrhosis dataset. We attribute the superior classification performance of our MetaP model on these datasets to their larger sample size compared to the Cirrhosis dataset. Previous studies [27] have also demonstrated that deep learning models generally outperform traditional machine learning methods when applied to datasets with a larger number of samples.

Additionally, we trained the PopPhy-CNN and MetaP models on both sparse and dense matrices. Our experimental results, as shown in Table 3, demonstrate that in most cases, reducing the matrix height to mitigate data sparsity does not have an adverse impact on the model’s classification performance. On the contrary, the model’s classification performance exhibits a slight improvement, which may be attributed to the enhanced focus on important features and the reduction of interference from redundant information. However, the PopPhy-CNN model using convolutional neural network experiences a slight decline in classification performance for the Obesity dataset with fewer features, whereas our proposed MetaP model remains unaffected. Overall, we believe that reducing the matrix height in this manner is effective in improving computational efficiency without compromising classification performance.

Identification of disease-associated microbial features

To interpret the black box nature of the MetaP model and identify the microbial features that play a significant role in classification, we utilized the Kernel SHAP method on our model to obtain the SHAP values corresponding to each feature from a 10-fold cross-validation. In Fig. 4, we present the top 20 important microbial features based on the SHAP values in each of the three disease datasets, along with their average relative abundances in the samples. Similarly to previous studies [11], We found that the importance of each microbial feature was not strongly correlated with its average relative abundance across samples, indicating the complexity of the microbial system.

Fig. 4
figure 4

The top 20 microbial features with the highest SHAP values in each of the three disease datasets. For each species, the SHAP values on the vertical axis decrease from top to bottom, while the two horizontal bars represent the average relative abundance observed in the healthy samples (depicted in green) and the diseased samples (depicted in red). The positive or negative signs inside the parentheses indicate the positive or negative values of the SHAP values

In the Cirrhosis dataset, which exhibited the best performance in terms of model classification, our model identified the following important microbial features: the Veillonella, Lactobacillus, and Streptococcus genera. These important features were also identified in the original study [7, 28]. In the T2D and Obesity datasets, which exhibited lower classification accuracy, the important microbes identified by our model can serve as candidate sets for future experimental studies investigating the association between microbes and diseases. And our model also detected important species that have been previously reported in the literature. For example, associations between specific species such as Lactobacillus mucosae [29] and Olsenella spp. [30] with T2D patients, as well as associations between species like Ruminococcus bromii [31] and Eubacterium siraeum [32] with Obesity patients were identified. Additionally, we observed a correlation between the positive/negative SHAP values and the differences in the average abundance of microbes between the patients and healthy individuals, particularly in datasets with good classification performance. Microbes that have a higher relative abundance in the diseased population often exhibit positive SHAP values. For instance, in the Cirrhosis dataset, among the top 20 important microbes, only Megamonas spp. and Adlercreutzia equolifaciens showed higher abundance in the healthy population with negative SHAP values, while the remaining microbes exhibited higher relative abundance in the diseased population. In conclusion, we believe that improving the classification performance of models on metagenomic data can unveil potential associations between microbes and diseases, warranting further investigation.

Conclusion

We applied a Permutable MLP-like architecture for disease prediction from metagenomic data. This network structure, compared to deep neural networks, allowed us to effectively capture the phylogenetic information of microbes within the 2D matrix formed by embedding the phylogenetic tree. Additionally, we improved the embedding method of the phylogenetic tree based on PopPhy into the matrix by reducing the matrix height to accommodate only the phylogenetic structure information derived from the leaf nodes, thereby mitigating data sparsity and redundant information. We evaluated the performance of our model on three publicly available datasets and compared its effectiveness with established methods such as MetAML, MetaNN, and PopPhy-CNN, for both binary and multi-class classification tasks. Lastly, we presented the interpretability of our model’s predictions by incorporating the SHAP method.

We believe that our proposed MetaP model has contributed to the advancement of deep learning in the field of gut microbiome research, enabling the exploration of potential associations between gut microbiome and human diseases. Moreover, we found that deep learning models are better suited for large-scale sample datasets, indicating that data augmentation for metagenomic data could be a promising research direction. In recent years, both metagenomics research and deep learning techniques have been rapidly advancing, it is imperative for us to actively explore the rational integration of these two fields, leveraging the research achievements in computer science to propel the development of metagenomic data analysis.

Availability of data and materials

The datasets and the code used in this study can be found at https://github.com/ansjiang/MetaP.

References

  1. Thomas T, Gilbert J, Meyer F. Metagenomics—a guide from sampling to data analysis. Microb Inform Exp. 2012;2(1):3.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Qin J, Li R, et al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010;464(7285):59–65. https://doi.org/10.1038/nature08821.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Quince C, Walker AW, et al. Shotgun metagenomics, from sampling to analysis. Nat Biotechnol. 2017;35(9):833–44. https://doi.org/10.1038/nbt.3935.

    Article  CAS  PubMed  Google Scholar 

  4. Scholz M, Ward DV, et al. Strain-level microbial epidemiology and population genomics from shotgun metagenomics. Nat Methods. 2016;13(5):435–8. https://doi.org/10.1038/nmeth.3802.

    Article  CAS  PubMed  Google Scholar 

  5. Harley ITW, Karp CL. Obesity and the gut microbiome: striving for causality. Mol Metab. 2012;1(1–2):21–31.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Qin J, Li Y, et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature. 2012;490(7418):55–60. https://doi.org/10.1038/nature11450.

    Article  CAS  PubMed  Google Scholar 

  7. Qin N, Yang F, et al. Alterations of the human gut microbiome in liver cirrhosis. Nature. 2014;513(7516):59–64. https://doi.org/10.1038/nature13568.

    Article  CAS  PubMed  Google Scholar 

  8. Marchesi JR, Adams DH, Fava F, et al. The gut microbiota and host health: a new clinical frontier. Gut. 2016;65(2):330–9. https://doi.org/10.1136/gutjnl-2015-309990.

    Article  PubMed  Google Scholar 

  9. Lozupone CA, Stombaugh JI, et al. Diversity, stability and resilience of the human gut microbiota. Nature. 2012;489(7415):220–30. https://doi.org/10.1038/nature11550.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Reiman D, Metwally AA, Sun J, Dai Y. Popphy-cnn: a phylogenetic tree embedded architecture for convolutional neural networks to predict host phenotype from metagenomic data. IEEE J Biomed Health Inform. 2020;24(10):2993–3001. https://doi.org/10.1109/JBHI.2020.2993761.

    Article  PubMed  Google Scholar 

  11. Pasolli E, Truong DT, et al. Machine learning meta-analysis of large metagenomic datasets: tools and biological insights. PLoS Comput Biol. 2016;12(7):1–26. https://doi.org/10.1371/journal.pcbi.1004977.

    Article  CAS  Google Scholar 

  12. Wu H, Cai L, et al. Metagenomics biomarkers selected for prediction of three different diseases in Chinese population. BioMed Res Int. 2018;2018:2936257. https://doi.org/10.1155/2018/2936257.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Zhu Z, Ren J, et al. Micropro: using metagenomic unmapped reads to provide insights into human microbiota and disease associations. Genome Biol. 2019;20(1):154. https://doi.org/10.1186/s13059-019-1773-5.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Shwartz-Ziv R, Armon A. Tabular data: deep learning is not all you need. Inf Fusion. 2022;81:84–90.

    Article  Google Scholar 

  15. Zhu Q, Zhu Q. et al.: The phylogenetic tree based deep forest for metagenomic data classification. In: 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 279–282, 2018. https://doi.org/10.1109/BIBM.2018.8621463

  16. Lo C, Marculescu R. Metann: accurate classification of host phenotypes from metagenomic data using neural networks. BMC Bioinform. 2019;20(12):314. https://doi.org/10.1186/s12859-019-2833-2.

    Article  Google Scholar 

  17. Nguyen TH, Prifti E, Chevaleyre Y, Sokolovska N, Zucker J-D. Disease Classification in Metagenomics with 2D Embeddings and Deep Learning 2018.

  18. Hou Q, Jiang Z, et al. Vision permutator: a permutable mlp-like architecture for visual recognition. IEEE Trans Pattern Anal Mach Intell. 2023;45(1):1328–34. https://doi.org/10.1109/TPAMI.2022.3145427.

    Article  PubMed  Google Scholar 

  19. Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. Advances in neural information processing systems, vol 30, 2017.

  20. Reiman D, Metwally A, Dai Y. Using convolutional neural networks to explore the microbiome. In: 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 4269–4272, 2017. https://doi.org/10.1109/EMBC.2017.8037799

  21. PhyloT: Phylot: A tree generator. https://phylot.biobyte.de/ (2019)

  22. Vaswani A, Shazeer N et al. Attention is all you need. Advances in neural information processing systems, vol 30, 2017.

  23. Likas A, Vlassis N, Verbeek JJ. The global k-means clustering algorithm. Pattern Recognit. 2003;36(2):451–61.

    Article  Google Scholar 

  24. Segata N, Boernigen D, Tickle TL, et al. Computational meta’omics for microbial community studies. Mol Syst Biol. 2013;9(1):666. https://doi.org/10.1038/msb.2013.22.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Truong DT, Franzosa EA, et al. Metaphlan2 for enhanced metagenomic taxonomic profiling. Nat Methods. 2015;12(10):902–3. https://doi.org/10.1038/nmeth.3589.

    Article  CAS  PubMed  Google Scholar 

  26. Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 2014.

  27. Li Y, Huang C, Ding L, et al. Deep learning in bioinformatics: Introduction, application, and perspective in the big data era. Methods. 2019;166:4–21. https://doi.org/10.1016/j.ymeth.2019.04.008

    Article  CAS  PubMed  Google Scholar 

  28. Sung CM, Lin Y-F, et al. Predicting clinical outcomes of cirrhosis patients with hepatic encephalopathy from the fecal microbiome. Cell Mol Gastroenterol Hepatol. 2019;8(2):301–18.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Tsai C-Y, Lu H-C, et al. Gut microbial signatures for glycemic responses of glp-1 receptor agonists in type 2 diabetic patients: a pilot study. Front Endocrinol. 2022;12:1850.

    Article  Google Scholar 

  30. Shuai M, Zhang G, et al. Human gut antibiotic resistome and progression of diabetes. Adv Sci. 2022;9(11):2104965.

    Article  Google Scholar 

  31. Kasai C, Sugimoto K, Moritani I, Tanaka J, Oya Y, Inoue H, Tameda M, Shiraki K, Ito M, Takei Y, et al. Comparison of the gut microbiota composition between obese and non-obese individuals in a Japanese population, as analyzed by terminal restriction fragment length polymorphism and next-generation sequencing. BMC Gastroenterol. 2015;15:1–10.

    Article  CAS  Google Scholar 

  32. Chen H, Tang N, Ye Q, Yu X, Yang R, Cheng H, Zhang G, Zhou X. Alternation of the gut microbiota in metabolically healthy obesity: an integrated multiomics analysis. Front Cell Infect Microbiol. 2022;12:1648.

    Article  Google Scholar 

Download references

Acknowledgements

The authors thank Shenzhen University Big Data Research Center for computing hardware facilities.

Funding

The National Natural Science Foundation Project of China (82171526), Beijing Talents Project (2020A38) and Open Fund of National Engineering Laboratory for Big Data System Computing Technology (Grant No. SZU-BDSC-OF2024-19).

Author information

Authors and Affiliations

Authors

Contributions

CJ contributed primarily to code implementation and manuscript writing. JY critically revised the manuscript. XL conceived and supervised the project and guided the writing of the manuscript. XP provided guidance on experimental design and methodology.

Corresponding authors

Correspondence to Xiaogang Peng or Xiaozheng Li.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

All authors have read the manuscript and consented for publication.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jiang, C., Yang, J., Peng, X. et al. A permutable MLP-like architecture for disease prediction from gut metagenomic data. BMC Bioinformatics 25, 246 (2024). https://doi.org/10.1186/s12859-024-05856-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12859-024-05856-w

Keywords