Skip to main content

A novel microbe-drug association prediction model based on graph attention networks and bilayer random forest

Abstract

Background

In recent years, the extensive use of drugs and antibiotics has led to increasing microbial resistance. Therefore, it becomes crucial to explore deep connections between drugs and microbes. However, traditional biological experiments are very expensive and time-consuming. Therefore, it is meaningful to develop efficient computational models to forecast potential microbe-drug associations.

Results

In this manuscript, we proposed a novel prediction model called GARFMDA by combining graph attention networks and bilayer random forest to infer probable microbe-drug correlations. In GARFMDA, through integrating different microbe-drug-disease correlation indices, we constructed two different microbe-drug networks first. And then, based on multiple measures of similarity, we constructed a unique feature matrix for drugs and microbes respectively. Next, we fed these newly-obtained microbe-drug networks together with feature matrices into the graph attention network to extract the low-dimensional feature representations for drugs and microbes separately. Thereafter, these low-dimensional feature representations, along with the feature matrices, would be further inputted into the first layer of the Bilayer random forest model to obtain the contribution values of all features. And then, after removing features with low contribution values, these contribution values would be fed into the second layer of the Bilayer random forest to detect potential links between microbes and drugs.

Conclusions

Experimental results and case studies show that GARFMDA can achieve better prediction performance than state-of-the-art approaches, which means that GARFMDA may be a useful tool in the field of microbe-drug association prediction in the future. Besides, the source code of GARFMDA is available at https://github.com/KuangHaiYue/GARFMDA.git

Peer Review reports

Background

A multitude of microbial communities, including bacteria, fungi, viruses, and other microbes, have been found in the human body, which are intimately linked to human health and are crucial to numerous physiological processes, including immune regulation, vitamin production, and the maintenance of digestive function [1, 2]. However, some microorganisms may be associated with the development of disease under specific circumstances. For instance, an imbalance of human gut bacteria can lead to the risk of high blood pressure [3].

In recent years, the misuse and irrational use of antibiotics, mutation and horizontal gene transfer of microbial genes, and the spread of microorganisms in the medical and social environments have led to microbial resistance to antibiotics, which makes effective antibiotic treatment ineffective and poses a serious challenge to clinical treatment [4]. Therefore, in order to address the problem of microbial resistance, it is meaningful to develop efficient computational models to detect microbial resistance and find new antibiotics, because these computational models can infer latent microbe-drug associations and thus provide a simple and efficient way to address microbial resistance.

For the last few years, a number of databases of microbial-drug associations, including MDAD [5], aBiofilm [6], and Drugvirus [7], have been adopted by researchers to construct an abundance of calculation models to identify possible microbe-drug associations. For example, in 2019, Zhu et al. [8] created a prediction model named HMDAKATZ based on the KATZ measure. In 2021, Deng et al. [9] devised a method called Graph2MDA by constructing multimodal attribute graphs as inputs of variogram autoencoders to discover details about every node and the complete graph. Long et al. [10] introduced the metapath2vec scheme for learning low-dimensional embedded representations of microorganisms and drugs and designed a partial dichotomous network projection recommendation algorithm and proposed a novel calculation method named HNERMDA. In 2023, Ma et al. [11] combined graph attention networks and CNN-based classifiers to construct a model called GACNNMDA. Huang et al. [12] designed a model named GNAEMDA based on graph normalized convolutional networks. Cheng et al. [13] designed a model called NIRBMMDA based on the neighbourhood-based inference and the restricted Boltzmann machine. Li et al. [14] combined matrix decomposition and a three-layer heterogeneous network to create a model called MFTLHNMDA to infer microbe-drug associations.

In this article, in order to improve the performance of prediction models, we designed a new prediction model named GARFMDA by combining graph attention network (GAT) and two-layer random forest (RF). In GARFMDA, a two-layer GAT was adopted first to learn the low-dimensional feature representations of microbes and drugs. And then, a two-layer random forest model was introduced to obtain the contribution values of all features as well as predict possible associations between microorganisms and drugs after eliminating those low-contribution features. Additionally, we conducted extensive case studies and comparison experiments to assess the prediction performance of GARFMDA. And as a result, GARFMDA achieved satisfactory results in the field of possible microbe-drug relationship prediction and outperformed existing representative competing methods.

Data sources

In this section, we will first download known microbe-drug associations from the MDAD database (https://figshare.com/search?q=10.6084%2Fm9.figshare.24798456), which consists of 2470 validated microbe-drug associations, including 1373 drugs and 173 microbes. Subsequently, we will download additional data on microbe, drug and disease associations from the database proposed by Wang et al. [14], which contains 70,315 reported drug-disease connections and 15,633 reported microbe-disease connections. Following a rigorous screening procedure to eliminate disease-related correlations for which there is no known association between medications or microorganisms in the MDAD database, we finally obtain 109 unique drug-disease connections covering 1,121 drugs and 233 diseases, and 109 unique microbe-disease connections covering 402 microbes and 73 diseases from the database proposed by Wang et al. Furthermore, we have also gathered 138 known microbe-microbe interactions, encompassing 123 microbe in MDAD, and 5586 known drug-drug relationships, from the data collection created by Deng et al. [9], which covers 1228 drugs in MDAD. Additional files 1, 2, 3, 4, 5, 6, 7, 8 and Table 1 below provides information on the aforementioned facts.

Table 1 Specifics of the newly-downloaded dataset

Methods

As shown in Fig. 1, GARFMDA is composed of the following three main parts:

Fig.1
figure 1

Flowchart of GARFMDA

Part 1: Firstly, based on the newly-downloaded datasets on microbes, drugs and diseases, two different heterogeneous microbe-drug networks \(H{N}_{1}\) and \(H{N}_{2}\) will be constructed.

Part 2: And then, based on multiple similarity metrics of microbe and drug, a feature matrix will be created for microbes and drugs separately, which will be then fed into the GAT along with \(H{N}_{1}\) and \(H{N}_{2}\) to learn the low-dimensional feature representations for microbes and drugs respectively.

Part 3: Finally, these two newly-obtained low-dimensional feature representations, along with two feature matrices, will be inputted into a two-layer random forest model to compute the probability scores of drug-microbe relationships.

Construction of two heterogeneous microbe-drug networks

For any given database D, let \({n}_{r}\) and \({n}_{m}\) stand for the numbers of drugs and microorganisms newly downloaded from D respectively, then we can construct a adjacency matrix \({D}^{1}\in {R}^{{n}_{r}*{n}_{m}}\) between microbes and drugs as follows: for any given microbe \({m}_{j}\) and drug \({r}_{i}\), if there is a known relationship between them in D, there is \({D}^{1}\left(i,j\right)=1\), otherwise there is \({D}^{1}\left(i,j\right)=0\).

Similarly, based on the newly-downloaded datasets of known connections between microbes and drugs, microbes and diseases, and drugs and diseases, we may create another microbe-drug adjacency matrix \({D}^{2}\in {R}^{{n}_{r}*{n}_{m}}\) as follows: for a given microbe \({m}_{j}\), drug \({r}_{i}\) and disease \({d}_{k}\), if there exist a known relationship between \({m}_{j}\) and \({d}_{k}\), as well as a known association between \({r}_{i}\) and \({d}_{k}\), then there is \({D}^{2}\left(i,j\right)=1\), otherwise there is \({D}^{2}\left(i,j\right)=0\).

Hence, based on above two adjacency matrices \({D}^{1}\) and \({D}^{2}\), it is simple to build two heterogeneous microbe-drug networks \(H{N}_{1}\) and \(H{N}_{2}\) according to the following way:

Firstly, in \({D}^{v}\left(v=\mathrm{1,2}\right)\), let \({D}^{v}\left({r}_{i}\right)\) and \({D}^{v}\left({m}_{j}\right)\) denote the i-th row and j-th column of \({D}^{v}\) separately, then for any two given drugs \({r}_{i}\) and \({r}_{j}\), we will calculate the Gaussian Interaction Profile (GIP) kernel similarity between them as follows:

$${A}_{rg}^{v}\left({r}_{i},{r}_{j}\right)= exp\left(-{\gamma }^{1}\| {D}^{v}\left({r}_{i}\right)- {D}^{v}\left({r}_{j}\right){\| }^{2}\right)$$
(1)
$${\gamma }^{1}=1/\left(\frac{1}{{n}_{r}}\sum_{i=1}^{{n}_{r}}\| {D}^{v}\left({r}_{i}\right){\| }^{2}\right)$$
(2)

where ‖·‖ denotes the Frobenius norm.

Obviously, based on above Eq. (1), we can obtain a GIP kernel similarity matrix \({A}_{rg}^{v}\in {R}^{{n}_{r}*{n}_{r}}\) for drugs.

In a similar way, for any two given microbes \({m}_{i}\) and \({m}_{j}\), we can also calculate the GIP kernel similarity between them as follows:

$${A}_{mg}^{v}\left({m}_{i},{m}_{j}\right)= exp\left(-{\gamma }^{2}\| {D}^{v}\left({m}_{i}\right)- {D}^{v}\left({m}_{j}\right){\| }^{2}\right)$$
(3)
$${\gamma }^{2}=1/\left(\frac{1}{{n}_{m}}\sum_{i=1}^{{n}_{m}}\| {D}^{v}\left({m}_{i}\right){\| }^{2}\right)$$
(4)

Obviously, based on above Eq. (3), we can obtain a GIP kernel similarity matrix \({A}_{mg}^{v}\in {R}^{{n}_{m}*{n}_{m}}\) for microbes as well.

Next, based on the assumption that when two nodes have highly dissimilar interaction characteristics, they are less comparable to each other [15], for any two given drugs \({r}_{i}\) and \({r}_{j}\), we will calculate the Hamming Interaction Profile (HIP) similarity between them as follows:

$${A}_{rh}^{v}\left({r}_{i},{r}_{j}\right)=1-\frac{|{D}^{v}\left({r}_{i}\right)!={D}^{v}\left({r}_{j}\right) |}{|{D}^{v}\left({r}_{i}\right)|}$$
(5)

Here, |\({D}^{v}\left({r}_{i}\right)\)| represents the number of elements in \({D}^{v}\left({r}_{i}\right)\), and \(|{D}^{v}\left({r}_{i}\right)!={D}^{v}\left({r}_{j}\right) |\) indicates the number of distinct elements between \({D}^{v}\left({r}_{i}\right)\) and \({D}^{v}\left({r}_{j}\right)\).

Similarly, for any two given microbe \({m}_{i}\) and \({m}_{j}\), the HIP similarity between them can be determined as follows:

$${A}_{mh}^{v}\left({m}_{i},{m}_{j}\right)=1-\frac{|{D}^{v}\left({m}_{i}\right)!={D}^{v}\left({m}_{j}\right) |}{|{D}^{v}\left(m\right)|}$$
(6)

Here, \(|{D}^{v}\left({m}_{i}\right)!={D}^{v}\left({m}_{j}\right) |\) indicates the number of distinct elements between \({D}^{v}\left({m}_{i}\right)\) and \({D}^{v}\left({m}_{j}\right)\), and \(|{D}^{v}\left(m\right)|\) denotes the number of elements in \({D}^{v}\left(m\right)\).

Hence, based on above Eqs. (5) and (6), we can obtain two HIP similarity matrices \({A}_{rh}^{v}\in {R}^{{n}_{r}*{n}_{r}}\) and \({A}_{mh}^{v}\in {R}^{{n}_{m}*{n}_{m}}\) for drugs and microbes separately.

Finally, for any two given drugs \({r}_{i}\) and \({r}_{j}\), it is evident that we can construct an integrated similarity between them by integrating \({A}_{rg}^{v}\) and \({A}_{rh}^{v}\) as follows:

$$A_{r}^{v} \left( {r_{i} ,r_{j} } \right) = \left\{ {\begin{array}{*{20}l} {1: } \hfill & {{\text{if there is a known association between}}\; r_{i} \;{\text{and }}\;r_{j} } \hfill \\ {\frac{{A_{rg}^{v} \left( {r_{i} ,r_{j} } \right) + A_{rh}^{v} \left( {r_{i} ,r_{j} } \right)}}{2}: } \hfill & {\text{ therwise}} \hfill \\ \end{array} } \right.$$
(7)

Similarly, for any two given microbes \({m}_{i}\) and \({m}_{j}\), we can construct an integrated similarity between them by integrating \({A}_{mg}^{v}\) and \({A}_{mh}^{v}\) as follows:

$$A_{m}^{v} \left( {m_{i} ,m_{j} } \right) = \left\{ {\begin{array}{*{20}l} {1 :} \hfill & { {\text{if there is a known association between}}\; m_{i} \; {\text{and}}\; m_{j} } \hfill \\ {\frac{{A_{mg}^{v} \left( {m_{i} ,m_{j} } \right) + A_{mh}^{v} \left( {m_{i} ,m_{j} } \right)}}{2}:} \hfill & {{\text{otherwise}}} \hfill \\ \end{array} } \right.$$
(8)

Hence, based on above Eqs. (7) and (8), we can finally obtain two new matrices \({H}^{1}\in {R}^{\left({n}_{r}+{n}_{m}\right)*\left({n}_{r}+{n}_{m}\right)}\) and \({H}^{2}\in {R}^{\left({n}_{r}+{n}_{m}\right)*\left({n}_{r}{+n}_{m}\right)}\) as follows:

$${H}^{1}=\left[\begin{array}{cc}{A}_{r}^{1}& {D}^{1}\\ {\left({D}^{1}\right)}^{T}& {A}_{m}^{1}\end{array}\right]$$
(9)
$${H}^{2}=\left[\begin{array}{cc}{A}_{r}^{2}& {D}^{2}\\ {\left({D}^{2}\right)}^{T}& {A}_{m}^{2}\end{array}\right]$$
(10)

Obviously, based on the above two matrices H1 and H2, two heterogeneous microbe-drug networks \({{\text{HN}}}_{1}\) and \({{\text{HN}}}_{2}\) can be constructed respectively.

Extracting low-dimensional feature representations for microbes and drugs by GAT

Constructing unique feature matrix for microbes and drugs

In this section, we will first adopt the SIMCOMP2 [16] to determine the structural similarity between any two given drugs \({r}_{i}\) and \({r}_{j}\), and obtain a new drug structural similarity matrix \({A}_{rc}\). Next, we will utilize the method presented by Kamneva [17] to determine the functional similarity between any two given microorganisms \({m}_{i}\) and \({m}_{j}\), and create a new microbe functional similarity matrix \({A}_{mf}\). And then, we will further perform RWR [39] on \({A}_{r}^{v}\) and \({A}_{m}^{v}\) separately in the following way:

$${q}_{i}^{l+1}=\lambda Q{q}_{i}^{l}+(1-\lambda ){\beta }_{i}$$
(11)

In above equations, Q is the matrix of transition probabilities, \({q}_{i}^{l}\) is the likelihood of node i transferring to the node l, and \({\beta }_{i}\in {R}^{1*n}\) is the starting odds vector for the node i, and the j-th element in \({\beta }_{i}\) is defined as follows:

$$\beta_{i,j} = \left\{ {\begin{array}{*{20}l} {1:} \hfill & {{\text{if}}\; i = j} \hfill \\ {0:} \hfill & {otherwise} \hfill \\ \end{array} } \right.$$
(12)

Obviously, based on above Eqs. (11) and (12), we can obtain two different matrices \({A}_{rr}^{v}\) and \({A}_{mm}^{v}\) based on \({A}_{r}^{v}\) and \({A}_{m}^{v}\) respectively.

Thereafter, based on above newly obtained matrices, we can construct a unique feature matrix to preserve more original features of microbes and drugs as follows:

$${S}^{v}=\left[\begin{array}{c}{F}_{r}^{v}\\ {F}_{m}^{v}\end{array}\right]$$
(13)

where,

$${F}_{r}^{v}= \left[{A}_{rc};{D}^{v};{A}_{rr}^{v};{D}^{v}\right]$$
(14)
$${F}_{m}^{v}=\left[{\left({A}^{v}\right)}^{T};{S}_{mf};{\left({A}^{v}\right)}^{T};{S}_{mm}^{v}\right]$$
(15)

From above Eqs. (13), (14) and (15), it is clear that there is \({S}^{v}\in {R}^{\left({n}_{r}+{n}_{m}\right)*{k}_{1}}\) \(\left(v=\mathrm{1,2}\right)\), where, k1 represents the number of columns in \({S}^{v}\).

The structure of the two-layer GAT

Encoder: To determine the degree of similarity between any given node i and one of its neighboring node j in \({H}^{v}\left(v=\mathrm{1,2}\right)\), we will compute the similarity coefficient \({e}_{ij}\) between them as follows:

$${e}_{ij}= LeakyRelu\left(\alpha \left[{W}^{v}{S}^{v}\left(i\right);{W}^{v}{S}^{v}\left(j\right)\right]\right),j\in {\varphi }_{i}^{v}$$
(16)
$$LeakyRelu\left( x \right) = \left\{ {\begin{array}{*{20}l} x \hfill & {x > 0} \hfill \\ {\mu x} \hfill & {otherwise} \hfill \\ \end{array} } \right.$$
(17)

where \({S}^{v}\left(i\right)\) denotes the i-th row of \({S}^{v}\), \(\alpha\) is an operation for feature mapping, \({W}^{v}\) is a trainable weight matrix, \({\varphi }_{i}^{v}\) is the collection of nodes that are adjacent to i in \({H}^{v}\), and \(\mu\) is a hyper-parameter varying between 0 and 1.

Based on above Eq. (16), for any two given nodes i and j, then the attention score \({\rho }_{ij}\) between them can be calculated as follows:

$${\rho }_{ij}= \frac{exp\left({e}_{ij}\right)}{\sum_{k\in {\varphi }_{i}^{v}}exp\left({e}_{ik}\right)}$$
(18)

Obviously, based on above attention score \({\rho }_{ij}\), a new feature of node i, representing the weighted sum of the features of its neighboring nodes, can be obtained as follows:

$${M}^{v}\left(i\right)=Relu\left(\sum_{j\in {\varphi }_{i}^{v}}{\rho }_{ij}{W}^{v}{S}^{v}\left(j\right)\right)$$
(19)
$$Relu\left( x \right) = \left\{ {\begin{array}{*{20}l} x \hfill & {x > 0} \hfill \\ 0 \hfill & {otherwise} \hfill \\ \end{array} } \right.$$
(20)

Hence, we can construct a new feature representation matrix \({M}^{v}\) as follows:

$${M}^{v}=\left[\begin{array}{c}{R}_{r}^{v}\\ {R}_{m}^{v}\end{array}\right]\in {R}^{\left({n}_{r}+{n}_{m}\right)*{k}_{2}}$$
(21)

Here, \({k}_{2}\) represents the nunber of columns in \({M}^{v}\).

Decoder: Te decoder adopts the same structure as the encoder, and is defined as follows::

$$M^{\prime v} = sigmoid\left( {M^{v} \cdot \left( {M^{v} } \right)^{T} } \right)$$
(22)
$$sigmoid\left(x\right)= \frac{1}{1+{e}^{-x}}$$
(23)

Optimization: Taking into account the fact that the reconstructed matrix differs from the raw matrix, we adopt the MSE loss factor to determine the average of the sum of differences squared between \(M^{\prime v}\) and \(H^{v}\). The MSE loss function is defined as follows:

$$Loss = \frac{1}{{n_{r} + n_{m} }}\mathop \sum \limits_{i = 1}^{{n_{r} + n_{m} }} M^{\prime v} \left( i \right) - H^{v} \left( i \right)^{2}$$
(24)

where \(M^{\prime v} \left( i \right)\) and \(H^{v} \left( i \right)\) denote the i-th row of \(M^{\prime v}\) and \({H}^{v}\) respectively.

Finally, Finally, the Adam optimizer [40] will be further used to optimize the loss function in the model training process.

Furthermore, we present the workflow of the two-layer GAT in the following Fig. 2 for better understanding the implementation of the above two-layer GAT.

Fig. 2
figure 2

workflow of the two-layer GAT in GARFMDA

The structure of the two-layer random forest

Traditional machine learning, when faced with complex nonlinear patterns, may suffer from drawbacks such as overfitting problems and the inability to provide uncertainty estimates of the predicted outcomes [18]. In order to calculate the potential scores of unknown drug-microbe relationships, we will create a two-layer random forest model in this section and treat the drug-microbe problem as a binary classification problem, which can improve the model effect and reduces the risk of overfitting through the selection of features in the first layer of the random forest. For the input of the first layer of the two-layer random forest, we will respectively construct two feature matrices \({B}_{r}^{v}\) and \({B}_{m}^{v}\) according to the following equations:

$${B}_{r}^{v}= \left[{R}_{r}^{v}; {F}_{r}^{v}\right]$$
(25)
$${B}_{m}^{v}= \left[{R}_{m,}^{v};{F}_{m}^{v}\right]$$
(26)

And then, for any given drug \({r}_{i}\) and microbe \({m}_{j}\), let \({B}_{r}^{v}\left(i\right)\) and \({B}_{m}^{v}\left(j\right)\) represent the i-th row of \({B}_{r}^{v}\) and the j-th column of \({B}_{m}^{v}\) respectively, and \({F}^{v}\left(i,j\right)= \left[\begin{array}{c}{B}_{r}^{v}\left(i\right)\\ {B}_{m}^{v}\left(j\right)\end{array}\right]\in {R}^{\left({n}_{r}\times {n}_{m}\right)*2*{k}_{3}}\), where k3 represents the number of columns in \({F}^{v}\), then we will feed \({F}^{v}\) into the first layer of the bilayer random forest.

Moreover, in the first layer of the bilayer random forest, we will assume that the number of decision trees is p and the maximum depth is s. And after training, we will compare the magnitude of the contribution made by each feature during the growth of each decision tree in the bilayer random forest by calculating the sum of the Gini index [19] changes of each feature over all the decision trees in the forest \(G\left(tr\right)\) to represent the contribution made by the feature \(C\left(tr\right)\), which is defined as follows:

$$G\left(tr\right)=\sum Gini\left({F}^{v}\left(tr\right)\right)-Gini\left({F}_{h}^{v}\left(tr\right)\right)$$
(27)
$$C\left( {tr} \right) = \left( {G\left( {tr} \right)/\sum G\left( k \right)} \right)*100\% , \;\;where \;\;k \in \left( {1,m} \right)$$
(28)

where tr denotes the feature index, h represents the decision tree index, and m is the total number of features. \(Gini\left({F}_{h}^{v}\left(tr\right)\right)\) denotes the Gini index on the decision tree h conditional on the feature tr.

After that, we will eliminate the features with contribution value less than L, and obtain a new feature matrix \({F}^{{\prime}v}\), which will be fed into the second layer of the bilayer random forest for training and prediction. Hence, we can obtain a score matrix finally.

Obviously, based on the matrices \({H}^{1}\) and \({H}^{2}\), we can obtain two different score matrices \({Score}^{1}\) and \({Score}^{2}\) respectively. Therefore, we can construct an integrated score matrix \(S\in {R}^{{n}_{r}*{n}_{m}}\) as follows:

$$S\left(i,j\right)= \frac{{Score}^{1}\left(i,j\right)+{Score}^{2}\left(i,j\right)}{2}$$
(29)

Results

In this section, we will first examine the impact of parameters on the prediction performance of GARFMDA. And then, we will compare GARFMDA with five cutting-edge competitive prediction techniques. Finally, in order to illustrate the efficiency of GARFMDA, we will introduce some well-known drugs and microbes for case studies.

Sensitivity analysis of hyperparameters

From above descriptions, it is clear that there are some important parameters in GARFMDA, including the GAT learning rate, the GAT dropout rate, the maximum depth of the decision tree in the bilayer random forest, and the contribution value of these chosen features. In this section, we will execute 10 times of fivefold Cross Validation (CV) on MDAD to assess impact of these parameters on the effectiveness of GARFMDA for determining the best values of these parameters.

For simplicity, in experiments, we will use the abbreviations lr, dp, s and l to stand for the learning rate and the dropout rate of GAT, the maximum depth of the first and second layers of the decision tree in the bilayer random forest, and the contribution value of these chosen features, respectively. Firstly, we will evaluate the impact of lr on the prediction performance of GARFMDA while it varies in the range of {0.0001, 0.001, 0.01, 0.05, 0.1}. From observing the following Fig. 3a, it is clear that when lr is set to 0,01, GARFMDA can achieve the highest value of AUC. Next, we will limit the value of dp to a range of {0.2, 0.4, 0.5, 0.7}, and as shown in Fig. 3b, it is obvious that when dp is set to 0.4, GARFMDA can achieve the highest value of AUC. Additionally, we will restrict the value of s to the range of {1, 3, 5, 7, 9} and as illustrated in Fig. 3c, it is evident that when s is set to 7, GARFMDA can achieve the highest value of AUC. Finally, we will limit the value of l to a range of {0.0001, 0.0005, 0.001, 0.0012, 0.0015}, and as shown in Fig. 3d, the performance of GARFMDA will reach to the best when l is set to 0.0012.

Fig. 3
figure 3

Effects of parameters on performance of GARFMDA. a and b show the AUC values achieved by GARFMDA with different learning and abandonment rates of GAT, respectively. c and d illustrate the AUC values achieved by GARFMDA under different maximum depths of decision trees and contribution values of selected features in the bilayer random forest, separately

As for the parameter pf of the number of random forest trees in the bilayer random forest, we found through comparative experiments that the effect of the value of pf on the prediction performance of GARFMDA is not significant, but the computational efficiency of GARFMDA will be reduced when pf is set to a large number, therefore, we will set the size of decision trees in both layers of the bilayer random forest to 250 during experiments. Similarly, for the parameter of the number of training rounds of GAT, we found through experiments that its numerical size has little effect on the prediction performance of GARFMDA, so we will set it to 10. Furthermore, to make our model better, we will use these parameters that work best to evaluate GARFMDA, i.e., we will set lr to 0.01, dp to 0.4, s to 7 and l to 0.0012 in subsequent comparison experiments.

Comparison with state-of-the-art methods

To validate the predictive performance of GARFMDA, we will compare it with the following five representative approaches separately:

  1. (1)

    LAGCN [20]: which is a computational model for inferring unknown drug-disease associations based on graph convolutional networks and attention mechanisms

  2. (2)

    GSAMDA [21]: which is a microbe-drug association prediction model based on graph attention networks and sparse autoencoders

  3. (3)

    SCSMDA [22]: which aims to predict microbe-drug associations based on the structure-enhanced contrast learning and self-paced negative sampling strategies.

  4. (4)

    MDASAE [23]: which is a calculation method based on fusing multi-attention mechanisms with stacked autoencoders to detect possible microbial drug associations.

  5. (5)

    LRLSHMDA [24]: which is a computational scheme by exploiting Laplace Regularised Least Squares to predict microbe-disease associations.

During experiments, we will adopt the AUC values, the Accuracy values and the F1-score values as performance indicators and compare all of these rival approaches under the framework of tenfold cross validation. Experimental results are shown in the following Table 2 and Fig. 4 respectively. From observing the Table 2, it is easy to see that GARFMDA can reach to the highest AUC value of 0.9794 ± 0.0012, while MDASAE comes in second with an AUC value of 0.9701 ± 0.0023, and LAGCN has the lowest AUC value of 0.8544 ± 0.0042. As For the Accuracy values and F1-score values, GARFMDA can as well obtain the highest values of 0.9955 and 0.7106 respectively. Therefore, It is obvious that GARFMDA can achieve the best prediction performance among all these five competing models.

Table 2 AUC values, Accuracy values and F1-score values obtained by GARFMDA and five competing methods under the framework of tenfold CV on MDAD
Fig. 4
figure 4

ROC curves achieved by competing techniques on MDAD

Case study

In this section, we will undertake case studies of two well-known medications and one well-known microbe to better illustrate the efficacy of GARFMDA. In experiments, we will choose the top 20 candidate microbes or drugs predicted by GARFMDA and search in PubMed (https://pubmed.ncbi.nlm.nih.gov) for these candidate microbes or drugs to see if any publications had reported about them. Among them, the first drug we have chosen is ciprofloxacin, which is a synthetic second-generation quinolone antimicrobial drug with broad-spectrum antimicrobial activity and bactericidal efficacy, and can be used to treat illnesses caused by mycobacterium influenzae, escherichia coli, and pneumococcus specific polysaccharide [25]. In both vitro and vivo studies of ciprofloxacin, a very low incidence of resistant microorganisms has been reported [26].

In addition, Alhajj et al. [27] developed a dry powder of ciprofloxacin for inhalation for treating cystic fibrosis lung infections. Golapudi et al. demonstrated that ciprofloxacin inhibits TNF-(α)-induced HIV secretion in U1 cells [28]. Table 3 illustrates that there are 19 out of those top 20 predicted potential bacteria having been confirmed by published journals to be related to ciprofloxacin.

Table 3 The top 20 predicted candidate ciprofloxacin-associated bacteria. In this table, the first column lists the top 10 predicted microbes, while the third column lists the top 11 to 20 predicted microbes

The second drug we have selected is moxifloxacin, a quinolone broad-spectrum antimicrobial that treats adults (≥ 18 years of age) suffering from respiratory tract infections, both upper and lower [29], as well as acute sinusitis [30], acute exacerbations of chronic bronchitis [31], community-acquired pneumonia [32], and skin and soft tissue infections [33]. Januel et al. [34] studied the use of moxifloxacin to treat the genetic disorder spinal muscular atrophy (SMA). However, Inada et al. [35] found that moxifloxacin can induce aortic aneurysms and clips by increasing bone bridging proteins in mice.

Table 4 shows that there are 15 out of the top 20 predicted candidate microorganisms have been confirmed by published journals to be associated with moxifloxacin, demonstrating the value of GARFMDA for clinical drug application and the identification of possible drug-related bacteria.

Table 4 The top 20 predicted candidate moxifloxacin-associated bacteria. In this table, the first column lists the top10 predicted microbes, while the third column lists the top 11 to 20 predicted microbes

The microorganism that we have selected is E. coli, a conditionally pathogenic bacterium that under certain conditions can cause gastrointestinal infections or a variety of localised tissue and organ infections such as urogenital infections in humans and a wide range of animals [36]. Pathogenic E. coli can cause more than 16.01 billion cases of dysentery [37] and 1 million deaths annually, whereas non-pathogenic E. coli are part of the normal gut flora of healthy mammals and birds. For example, it is anticipated that the E. coli strain nissle will be utilized to cure human illnesses in addition to being utilized as a probiotic and therapeutic agent [38]. As shown in Table 5, 15 out of the top 20 predicted drugs have been confirmed by published journals to be associated with the E. coli.

Table 5 The top 20 forecasted drugs linked to E. coli. In this table, the first column lists the top 10 predicted drugs, while the third column lists the top 11 to 20 predicted drugs

Conclusion and discussion

In this paper, we developed a new prediction model called GARFMDA by combining a two-layer GAT with a two-layer random forest to detect possible drug-microbe correlations. Results of both comparison experiments and case studies showed that GARFMDA exceeded these state-of-the-art competitive prediction models. Naturally, GARDFMDA can also be adopted to solve other problems involving the association prediction of biological entities, such as the prediction of associations between diseases and circRNA and microbes. Of course, GARFMDA can yet be improved. For instance, we can add more biological data, like microbial sequencing information, to the feature selection section [9]. Additionally, because the dataset is sparse, the model frequently results in the overfitting phenomena. To address this issue, we can also think about data augmentation. Moreover, the public database is not updated in real time, which may affect the way that the model is used in practice, therefore, we might consider to reconstruct an extensive database in the future.

Availability of data and materials

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding authors.

References

  1. Human Microbiome Project Consortium. Structure, function and diversity of the healthy human microbiome. Nature. 2012;486(7402):207–14. https://doi.org/10.1038/nature11234.

    Article  ADS  CAS  Google Scholar 

  2. Thiele I, Heinken A, Fleming RM. A systems biology approach to studying the role of microbes in human health. Curr Opin Biotechnol. 2013;24(1):4–12. https://doi.org/10.1016/j.copbio.2012.10.001.

    Article  CAS  PubMed  Google Scholar 

  3. Young VB. The role of the microbiome in human health and disease: an introduction for clinicians. BMJ. 2017;356:j831. https://doi.org/10.1136/bmj.j831.

    Article  PubMed  Google Scholar 

  4. Hughes D, Andersson DI. Evolutionary trajectories to antibiotic resistance. Annu Rev Microbiol. 2017;71:579–96. https://doi.org/10.1146/annurev-micro-090816-093813.

    Article  CAS  PubMed  Google Scholar 

  5. Sun YZ, Zhang DH, Cai SB, Ming Z, Li JQ, Chen X. MDAD: a special resource for microbe-drug associations. Front Cell Infect Microbiol. 2018;8:424. https://doi.org/10.3389/fcimb.2018.00424.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Rajput A, Thakur A, Sharma S, Kumar M. aBiofilm: a resource of anti-biofilm agents and their potential implications in targeting antibiotic drug resistance. Nucleic Acids Res. 2018;46(D1):D894–900. https://doi.org/10.1093/nar/gkx1157.

    Article  CAS  PubMed  Google Scholar 

  7. Andersen PI, Ianevski A, Lysvand H, et al. Discovery and development of safe-in-man broad-spectrum antiviral agents. Int J Infect Dis. 2020;93:268–76. https://doi.org/10.1016/j.ijid.2020.02.018.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Zhu L, Duan G, Yan C, Wang J. Prediction of microbe-drug associations based on KATZ measure. In: 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), San Diego, CA, USA; 2019. pp. 183–187. https://doi.org/10.1109/BIBM47256.2019.8983209.

  9. Deng L, Huang Y, Liu X, Liu H. Graph2MDA: a multi-modal variational graph embedding model for predicting microbe-drug associations. Bioinformatics. 2022;38(4):1118–25. https://doi.org/10.1093/bioinformatics/btab792.

    Article  CAS  PubMed  Google Scholar 

  10. Long Y, Luo J. Association mining to identify microbe drug interactions based on heterogeneous network embedding representation. IEEE J Biomed Health Inform. 2021;25(1):266–75. https://doi.org/10.1109/JBHI.2020.2998906.

    Article  PubMed  Google Scholar 

  11. Ma Q, Tan Y, Wang L. GACNNMDA: a computational model for predicting potential human microbe-drug associations based on graph attention network and CNN-based classifier. BMC Bioinform. 2023;24:35. https://doi.org/10.1186/s12859-023-05158-7.

    Article  Google Scholar 

  12. Huang H, Sun Y, Lan M, Zhang H, Xie G. GNAEMDA: microbe-drug associations prediction on graph normalized convolutional network. IEEE J Biomed Health Inform. 2023. https://doi.org/10.1109/JBHI.2022.3233711.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Cheng X, Qu J, Song S, Bian Z. Neighborhood-based inference and restricted Boltzmann machine for microbe and drug associations prediction. PeerJ. 2022;10:e13848. https://doi.org/10.7717/peerj.13848.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Li H, Hou ZJ, Zhang WG, Qu J, Yao HB, Chen Y. Prediction of potential drug-microbe associations based on matrix factorization and a three-layer heterogeneous network. Comput Biol Chem. 2023;104:107857. https://doi.org/10.1016/j.compbiolchem.2023.107857.

    Article  CAS  PubMed  Google Scholar 

  15. Xu D, Xu H, Zhang Y, Wang M, Chen W, Gao R. MDAKRLS: Predicting human microbe-disease association based on Kronecker regularized least squares and similarities. J Transl Med. 2021;19(1):66. https://doi.org/10.1186/s12967-021-02732-6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Hattori M, Tanaka N, Kanehisa M, et al. SIMCOMP/SUBCOMP: chemical structure search servers for network analyses. Nucleic Acids Res. 2010;38(2):W652–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Kamneva OK. Genome composition and phylogeny of microbes predict their co-occurrence in the environment. PLoS Comput Biol. 2017;13(2):e1005366.

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  18. Hinton GE, Salakhutdinov RR. Reducing the dimensionality of data with neural networks. Science. 2006;313(5786):504–7. https://doi.org/10.1126/science.1127647.

    Article  ADS  MathSciNet  CAS  PubMed  Google Scholar 

  19. Ceriani L, Verme P. The origins of the Gini index: extracts from Variabilità e Mutabilità (1912) by Corrado Gini. J Econ Inequal. 2012;10:421–43. https://doi.org/10.1007/s10888-011-9188-x.

    Article  Google Scholar 

  20. Yu Z, Huang F, Zhao X, Xiao W, Zhang W. Predicting drug-disease associations through layer attention graph convolutional network. Brief Bioinform. 2021;22(4):bbaa243. https://doi.org/10.1093/bib/bbaa243.

    Article  CAS  PubMed  Google Scholar 

  21. Tan Y, Zou J, Kuang L, et al. GSAMDA: a computational model for predicting potential microbe-drug associations based on graph attention network and sparse autoencoder. BMC Bioinform. 2022;23(1):492. https://doi.org/10.1186/s12859-022-05053-7.

    Article  Google Scholar 

  22. Tian Z, Yu Y, Fang H, Xie W, Guo M. Predicting microbe-drug associations with structure-enhanced contrastive learning and self-paced negative sampling strategy. Brief Bioinform. 2023;24(2):bbac634. https://doi.org/10.1093/bib/bbac634.

    Article  PubMed  Google Scholar 

  23. Fan L, Wang L, Zhu X. A novel microbe-drug association prediction model based on stacked autoencoder with multi-head attention mechanism. Sci Rep. 2023;13:7396. https://doi.org/10.1038/s41598-023-34438-8.

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  24. Wang F, Huang ZA, Chen X, et al. LRLSHMDA: Laplacian regularized least squares for human microbe-disease association prediction. Sci Rep. 2017;7:7601. https://doi.org/10.1038/s41598-017-08127-2.

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  25. Campoli-Richards DM, Monk JP, Price A, Benfield P, Todd PA, Ward A. Ciprofloxacin: a review of its antibacterial activity, pharmacokinetic properties and therapeutic use. Drugs. 1988;35(4):373–447. https://doi.org/10.2165/00003495-198835040-00003.

    Article  CAS  PubMed  Google Scholar 

  26. Zhang GF, Liu X, Zhang S, Pan B, Liu ML. Ciprofloxacin derivatives and their antibacterial activities. Eur J Med Chem. 2018;25(146):599–612. https://doi.org/10.1016/j.ejmech.2018.01.078.

    Article  CAS  Google Scholar 

  27. Alhajj N, O’Reilly NJ, Cathcart H. Developing ciprofloxacin dry powder for inhalation: a story of challenges and rational design in the treatment of cystic fibrosis lung infection. Int J Pharm. 2022;613: 121388. https://doi.org/10.1016/j.ijpharm.2021.121388.

    Article  CAS  PubMed  Google Scholar 

  28. Gollapudi S, Kim CH, Roshanravan B, Gupta S. Ciprofloxacin inhibits activation of latent human immunodeficiency virus type 1 in chronically infected promonocytic U1 cells. AIDS Re Hum Retrovir. 1998;14:499–504. https://doi.org/10.1089/aid.1998.14.499.

    Article  CAS  Google Scholar 

  29. Nightingale CH. Moxifloxacin, a new antibiotic designed to treat community-acquired respiratory tract infections: a review of microbiologic and pharmacokinetic-pharmacodynamic characteristics. Pharmacotherapy. 2000;20(3):245–56. https://doi.org/10.1592/phco.20.4.245.34880.

    Article  MathSciNet  CAS  PubMed  Google Scholar 

  30. Johnson P, Cihon C, Herrington J, Choudhri S. Efficacy and tolerability of moxifloxacin in the treatment of acute bacterial sinusitis caused by penicillin-resistant Streptococcus pneumoniae: a pooled analysis. Clin Ther. 2004;26(2):224–31. https://doi.org/10.1016/s0149-2918(04)90021-5.

    Article  CAS  PubMed  Google Scholar 

  31. Wilson R, Macklin-Doherty A. The use of moxifloxacin for acute exacerbations of chronic obstructive pulmonary disease and chronic bronchitis. Expert Rev Respir Med. 2012;6(5):481–92. https://doi.org/10.1586/ers.12.50.

    Article  CAS  PubMed  Google Scholar 

  32. Torres A, Garrity-Ryan L, Kirsch C, et al. Omadacycline vs moxifloxacin in adults with community-acquired bacterial pneumonia. Int J Infect Dis. 2021;104:501–9. https://doi.org/10.1016/j.ijid.2021.01.032.

    Article  CAS  PubMed  Google Scholar 

  33. Fluoroquinolones. In LiverTox: Clinical and Research Information on Drug-Induced Liver Injury. Bethesda (MD): National Institute of Diabetes and Digestive and Kidney Diseases; March 10, 2020.

  34. Januel C, Menduti G, Mamchaoui K, et al. Moxifloxacin rescues SMA phenotypes in patient-derived cells and animal model. Cell Mol Life Sci. 2022;79(8):441. https://doi.org/10.1007/s00018-022-04450-8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Inada K, Koga M, Yamada A, Dohgu S, Yamauchi A. Moxifloxacin induces aortic aneurysm and dissection by increasing osteopontin in mice. Biochem Biophys Res Commun. 2022;629:1–5. https://doi.org/10.1016/j.bbrc.2022.08.080.

    Article  CAS  PubMed  Google Scholar 

  36. Leimbach A, Hacker J, Dobrindt U. E. coli as an all-rounder: the thin line between commensalism and pathogenicity. Curr Top Microbiol Immunol. 2013;358:3–32. https://doi.org/10.1007/82_2012_303.

    Article  PubMed  Google Scholar 

  37. Wirth T, Falush D, Lan R, et al. Sex and virulence in Escherichia coli: an evolutionary perspective. Mol Microbiol. 2006;60(5):1136–51. https://doi.org/10.1111/j.1365-2958.2006.05172.x.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Pradhan S, Weiss AA. Probiotic properties of Escherichia coli Nissle in Human Intestinal Organoids. MBio. 2020;11(4):e01470-e1520. https://doi.org/10.1128/mBio.01470-20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Köhler S, Bauer S, Horn D, et al. Walking the interactome for prioritization of candidate disease genes. Am J Hum Genet. 2008;82(4):949–58.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Kingma D, Ba J. Adam: a method for stochastic optimization. Comput Sci. 2014;10(22):1–15.

    ADS  Google Scholar 

Download references

Acknowledgements

The authors thank the referees for suggestions that helped improve the paper substantially.

Funding

This work was partly sponsored by the National Natural Science Foundation of China (No.62272064), and the Natural Science Foundation of Hunan Province (No.2023JJ60185).

Author information

Authors and Affiliations

Authors

Contributions

HK, HZ and XX produced the main ideas, and did the modeling, computation and analysis and also wrote the manuscript. LW, ZZ, XL and BZ provided supervision and effective scientific advice and related ideas, research design guidance, and added value to the article through editing and contributing completions. All authors contributed to the article and approved the submitted version.

Corresponding authors

Correspondence to Zhen Zhang, Bin Zeng, Xin Liu or Lei Wang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. The ID Numbers and Names of Newly-Downloaded Diseases.

Additional file 2

. The ID Numbers and Names of Newly-Downloaded Drugs.

Additional file 3

. The ID Numbers and Names of Newly-Downloaded Diseases.

Additional file 4

. Newly-Downloaded Known Associations between Drugs and Diseases.

Additional file 5

. Newly-Downloaded Known Associations between Drugs.

Additional file 6

. Newly-Downloaded Known Associations between Drugs and mcrobes.

Additional file 7

. Newly-Downloaded Known Associations between Microbes and Diseases.

Additional file 8

. Newly-Downloaded Known Associations between Microbes.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kuang, H., Zhang, Z., Zeng, B. et al. A novel microbe-drug association prediction model based on graph attention networks and bilayer random forest. BMC Bioinformatics 25, 78 (2024). https://doi.org/10.1186/s12859-024-05687-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12859-024-05687-9

Keywords