Quaternary structure predictions of transmembrane proteins starting from the monomer: a docking-based approach

Casciari, D; Seeber, M; Fanelli, F

doi:10.1186/1471-2105-7-340

Methodology article
Open access
Published: 12 July 2006

Quaternary structure predictions of transmembrane proteins starting from the monomer: a docking-based approach

D Casciari¹,
M Seeber¹ &
F Fanelli¹

BMC Bioinformatics volume 7, Article number: 340 (2006) Cite this article

6082 Accesses
40 Citations
Metrics details

Abstract

Background

We introduce a computational protocol for effective predictions of the supramolecular organization of integral transmembrane proteins, starting from the monomer. Despite the demonstrated constitutive and functional importance of supramolecular assemblies of transmembrane subunits or proteins, effective tools for structure predictions of such assemblies are still lacking. Our computational approach consists in rigid-body docking samplings, starting from the docking of two identical copies of a given monomer. Each docking run is followed by membrane topology filtering and cluster analysis. Prediction of the native oligomer is therefore accomplished by a number of progressive growing steps, each made of one docking run, filtering and cluster analysis. With this approach, knowledge about the oligomerization status of the protein is required neither for improving sampling nor for the filtering step. Furthermore, there are no size-limitations in the systems under study, which are not limited to the transmembrane domains but include also the water-soluble portions.

Results

Benchmarks of the approach were done on ten homo-oligomeric membrane proteins with known quaternary structure. For all these systems, predictions led to native-like quaternary structures, i.e. with C_α-RMSDs lower than 2.5 Å from the native oligomer, regardless of the resolution of the structural models.

Conclusion

Collectively, the results of this study emphasize the effectiveness of the prediction protocol that will be extensively challenged in quaternary structure predictions of other integral membrane proteins.

Background

A number of α-helical transmembrane (TM) proteins organize themselves in supramolecular assemblies, which constitute the functional units (i.e. biological units) of ion/water channels, electron or proton transfer proteins as well as transporters [1]. Also the 7-TM bacteriorhodopsin (BRD) forms trimers, which, in turn, form multimolecular assemblies in the native purple membrane [2, 3]. G-protein-coupled receptors (GPCRs), which constitute the largest superfamily of membrane proteins, are not exceptions to this rule. Although they have been classically assumed to exist and function as monomeric entities, the concept that they exist as constitutive dimers/oligomers is now substantiated by ever increasing evidences from in vitro studies (reviewed in refs. [4–7]). Predictions of the likely interfaces in GPCR dimers done so far essentially relied on sequence-based methods [8–13] (reviewed also in Ref[14]).

Despite the demonstrated constitutive and functional importance of supramolecular assemblies of TM subunits or proteins, effective tools other than sequence-based methods for structure predictions of such assemblies are still lacking. Indeed, almost all the docking algorithms and approaches to quaternary structure predictions developed so far work with water-soluble proteins and most of them employ geometrical constraints, including symmetry information [15–20]. The very few approaches intended for quaternary structure predictions of TM-helical proteins include a method based on Monte Carlo sampling of single TM helices [21]. The method is limited to very simple homo-oligomers and requires knowledge about the oligomerization state of the protein, since symmetry information is used to filter the most realistic solutions. The information from cryo-electron microscopy and evolutionary data is employed by an alternative automated method for orienting TM helices [22]. The method has been probed on a number of membrane proteins, including the multimeric TM portion of the acetylcholine receptor [22]. Another docking method, i.e. GRAMM [23], can deal with predictions of helix-helix packing interactions involving either water-soluble or TM-systems. In fact, GRAMM can run in a "helix docking mode" that discards configurations with large displacements along the helix axes and angles between helices larger than indicated cutoffs [24]. This is instrumental in saving computational time and facilitating the analysis of the docking results.

Herein, we present an approach based upon rigid body docking simulations, membrane topology-based filtering, and cluster analysis to predict the quaternary structure of homo-oligomeric integral membrane proteins. This approach does not employ symmetry constraints either for improving sampling or in the filtering step. Furthermore, there are no size-limitations in the systems under study, which are not limited to the TM domains but include also the water-soluble portions.

Benchmarks of the approach were first carried out on the tetrameric KcsA potassium channel H⁺ gated (384 amino acids, PDB code: 1BL8; Figure 2) [25], pentameric MscL and eptameric MscS mechanosensitive channels (540 amino acids, PDB code: 1MSL, Figure 3[26], and 1771 amino acids, PDB code: 1MXM, Figure 4) [27], respectively), and trimeric BRD (698 amino acids, PDB code: 1BRR; Figure 5) [2]. Selection of these supramolecular systems followed an accurate inspection of the database of Membrane Protein structures from White's laboratory [1]. The selected proteins, indeed, fulfilled at best the following requirements: (a) the biological unit is a homo-oligomer; (b) the asymmetric crystallographic unit contains the biological unit; (c) the monomers in the biological unit are α-helical TM proteins; and (d) significant structural diversity exists among the selected oligomers concerning oligomeric order, architecture and extension of the intracellular and extracellular domains. Successively, benchmarks have been extended to a number of homo-oligomeric transmembrane proteins, selected from the same database, for which the asymmetric unit is constituted by the monomer. They include a dimer, i.e. the membrane spanning region of the BtuCD Vitamin B12 Transporter (648 amino acids, PDB code: 1L7V) [28]; trimers like the AmtB ammonia channel (1146 amino acids, PDB code: 1U77) [29] and the AcrB bacterial multi-drug efflux transporter (3108 amino acids, PDB code: 1IWG); and tetramers like the AQP1 aquaporin water channel (996 amino acids, PDB code: 1J4N) [30], the GlpF glycerol facilitator channel (1016 amino acids, PDB code: 1FX8) [31], and the KirBac1 Inward-Rectifier Potassium channel (1032 amino acids, PDB code: 1P7B) [32].

The structure of the biological unit from the PDB, obtained by symmetry operations, was used as a native complex for each of these proteins.

For all the systems under study, the approach led to native-like quaternary structures, i.e. with a Root Mean Square Deviation of the C_α-atoms (Cα-RMSD) lower than 2.5 Å from the native oligomer, regardless of the resolution of the structural models.

The effectiveness of the approach makes it suitable for predictions of the supramolecular architecture of other integral membrane proteins.

Results

The computational approach developed in this study consists of a number of dense docking samplings, starting from the docking of two identical copies of a given monomer. Each docking run is followed by membrane topology filtering and cluster analysis. Thus, prediction of the native oligomer is accomplished by a number of progressive growing steps, each made of one docking run, filtering and cluster analysis (schematized in Figure 1)).

The only requirement with this approach is the structural model of the monomer and the knowledge of a set of C_α-atoms, which lie at the two lipid/water interfaces, defining two parallel planes. In the target monomer, these two planes must be parallel to the xy plane and, hence, perpendicular to the z-axis. If these planes are parallel to the xy plane and, hence, perpendicular to the z-axis, the orientation of the monomer is considered good and no reorientation is needed. In contrast, if such planes are not parallel to the xy plane, the monomer needs a reorientation. This reorientation is done by computing the rotations necessary to bring the two planes parallel to the xy plane and then using an average between the two rotations to reorient the whole monomer. The correct membrane topology of the target monomer is, indeed, necessary for the membrane topology filter to work properly (see Methods).

The following paragraphs summarize the results of benchmarks done on ten selected oligomers.

Quaternary structure predictions of oligomeric channels: KcsA, MscL and MscS

The A subunit of KcsA is constituted by 96 amino acids (i.e. 23–119 sequence) organized in two TM α-helices connected by the 30 amino acid pore region [25]. The A subunit extracted from the crystal structure was used both as a target and a probe in the first step, and as a probe in the following docking simulations. Prior to the first docking run, a reorientation of the KcsA A subunit was needed to put it in the right orientation with respect to the putative membrane. Two different ways for reorienting the subunit were probed, i.e. a dipole moment-based approach and a membrane topology-based approach. The first consisted in rotating the native oligomer so as to put the dipole moment of the tetramer parallel to the z-axis and then extracting the A subunit. The second consisted in the following steps: (a) prediction of two sets of C_α-atoms of the monomer, which lie at the two membrane-water interfaces; (b) determination of the planes that fit at best these sets of atoms; and (c) orientation of these planes parallel to the xy plane. The first reorientation approach, applied to the native oligomer, was instrumental in designing and benchmarking the prediction protocol, whereas the second approach, applied to the monomer, is instrumental in blind predictions.

Quaternary structure predictions of KcsA consisted of two alternative two-step growing paths differing for the fact that one employed a trimer (Figure 6a), whereas the other employed a dimer (Figure 6b and 6c) as a target at step 2. The two different paths shared in common step 1, i.e. A vs A docking. A vs A docking gave 231 realistic solutions, i.e. those solutions, which passed the membrane topology filter. These solutions essentially grouped into two clusters, which also showed similar indices of membrane topology goodness (i.e. MemTop indices of 0.495 and 0.494; see the Methods section for the MemTop definition). The best scored solutions from each of these two clusters, s2 and s1 (the letter "s" followed by a number indicates the solution number, or rank number, in the ZDOCK output list), were selected for the following growing step. These solutions contributed to the growth of a cyclic oligomer, likely to represent subunits D and B, respectively, i.e. labeled in a clockwise manner as seen from the extracellular side (Figure 6a). As for the path passing through an intermediate trimer, A vs A-Bs1-Ds2 (the upper case letter indicates the subunit) docking led to 41 realistic solutions, which essentially grouped in four clusters. The best scored solution from the third cluster (characterized by the best MemTop score: 0.375), s500, was selected to produce the A-Bs1-Cs500-Ds2 tetramer. Such tetramer was characterized by a C_α-RMSD of 1.79 Å from the native structure (Figures 2b, 2c and 6a, Table 1).

Table 1 Summary of the benchmark results.

Full size table

The alternative path passing through an intermediate dimer consisted of two parallel ways, depending on whether the A-Bs1 or the A-Ds2 dimer was employed as a target (Figure 6b and 6c, respectively). In the first case, 242 realistic solutions were filtered, which essentially grouped in two clusters. The best solutions from each of these two clusters, s2 and s1, were, respectively, characterized by contacts with the A and Bs1 monomers in the target dimer, leading to the A-Bs1-Cs1-Ds2 tetramer, which showed a C_α-RMSD of 2.45 Å from the native structure (Figure 6b, Table 1). The parallel pathway, which used the A-Ds2 dimer as a target at step 2, led to 199 realistic solutions, which essentially grouped in two clusters characterized also by similar MemTop indices (i.e. 0.567 and 0.490). The best solutions from each of these two clusters, s2 and s1, led to the A-Bs2-Cs1-Ds2 tetramer, showing a C_α-RMSD of 2.27 Å from the native structure (Figure 6c, Table 1).

In summary, the three different growing paths shared in common the first step, i.e. A vs A docking, and all led to native-like tetramers (Figure 6). Among these tetramers, the one obtained by employing a trimer as a target at step 2 showed the lowest C_α-RMSD from the native complex (i.e. 1.79 Å, Table 1 and Figures 2b, 2c and 2a). The intermediate and final oligomers, with only one exception, were made of solutions falling amongst the top four out of 4000 in the respective output list (Figure 6). The results presented above were achieved by employing a dipole moment-based criterion for the initial reorientation of the target A subunit. However, employing a membrane topology-based approach for subunit reorientation equally led to native-like solutions. In fact, the path employing a trimer as a target at step 2 led to the A-Bs2-Cs12-Ds4 tetramer, showing a C_α-RMSD of 0.94 Å from the native structure, whereas one of the two paths employing a dimer as a target at step 2 led to the A-Bs1-Cs52-Ds4 tetramer, showing a C_α-RMSD of 1.94 Å from the native structure. Also in this case, employing an intermediate trimer as a target at step 2 led to better predictions than employing a dimer.

The same general approaches were employed for quaternary structure predictions of the the pentameric MscL and eptameric MscS mechanosensitive channels, but neither one of the two systems required an initial reorientation of the monomer. In detail, the A subunit of the MscL is constituted by 108 amino acids (i.e. 10–118 sequence), organized in two TM α-helices and a third cytoplasmic helix (Figure 3) [26]. The A subunit of MscS is constituted by 253 amino acids (i.e. 27–280 sequence), organized in three TM α-helices and a huge cytosolic domain (i.e. 113–280 sequence, Figure 4) [27].

For MscL, two-step and three-step growing paths were probed, differing for the fact that the first path utilized a trimer (Figure 7a), whereas the second one utilized a dimer (Figure 7b and 7c) as a target at step 2. Similarly to the KcsA, the two different growing paths shared in common the first step, i.e. A vs A docking. Thus, A vs A docking gave 211 realistic solutions, essentially grouped in two clusters, showing also similar MemTop indices (i.e. 0.527 and 0.588). As for the two-step growing path, the first run was instrumental in obtaining the A-Bs5-Es6 trimer by employing, simultaneously, the best scored solutions from the first two most populated clusters, s5 and s6 (Figure 7a, bold labels). A vs A-Bs5-Es6 docking led to 121 realistic solutions, which essentially grouped in the first two clusters. The best scored solutions from each of these clusters, s1 and s4, were characterized by contacts with the Bs5 and Es6 monomers, respectively, closing the oligomeric cycle. The A-Bs5-Cs1-Ds4-Es6 pentamer showed a C_α-RMSD of 3.45 Å from the native structure (Figure 7a, Table 1).

As for the three-step paths, the best scored solutions from the two most populated clusters, s5 and s6, were used to produce the A-Bs5 and A-Es6 dimers. Each of these two dimers was used in turn as a target in two parallel sets of docking simulations, which finally led to the A-Bs5-Cs3-Ds2-Es1 and A-Bs8-Cs1-Ds1-Es6 pentamers, characterized by almost comparable C_α-RMSDs, i.e. 2.83 Å and 2.47 Å, respectively, from the native oligomer (Figure 7b and 7c, Table 1). Thus, employing a dimer as a target at step 2 was better than using a trimer (Figure 7). Also, all the predicted pentamers were made of solutions falling amongst the top ten out of 4000 in the respective output list (Figure 7, Table 1).

For quaternary structure predictions of MscS, three-step and four-step growing paths were probed, differing for the fact that the first utilized a trimer (Figure 8a), whereas the second utilized a dimer (Figure 8b and 8c) as a target at step 2. Also in this case, the two different growing paths shared in common the first step, i.e. A vs A docking. Thus, A vs A docking gave 111 realistic solutions, essentially grouped in two clusters showing also significantly low MemTop indices (i.e. 0.258 and 0.312).

As for the three-step growing path, the first run was instrumental in obtaining the A-Bs3-Gs1 trimer by using the best scored solutions from the first two most populated clusters, s3 and s1 (Figure 8a). A vs A-Bs3-Gs1 docking led to 79 realistic solutions, which essentially grouped in the first two clusters. The best scored solutions from these clusters, s1 and s3, were, respectively, characterized by contacts with the Gs1 and Bs3 monomers, likely to represent the F and C subunits. A vs A-Bs3-Cs3-Fs1-Gs1 docking produced 89 solutions essentially grouped in the first two clusters, showing also similar MemTop indices (i.e. 0.431 and 0.379). The best scored solution from each of these clusters, s1 and s3, were, respectively, characterized by contacts with the Fs1 and Cs3 monomers and completed the oligomeric cycle, likely to be the E and D subunits, respectively. The thereof obtained A-Bs3-Cs3-Ds3-Es1-Fs1-Gs1 eptamer showed a C_α-RMSD of 2.21 Å from the native oligomer (Figure 8a, Table 1).

As for the four-step paths, the best scored solutions from the first two most populated clusters, s3 and s1, were used to produce the A-Bs3 and A-Gs1 dimers. These dimers were used in turn as a target in two parallel sets of docking simulations, which finally led to the A-Bs3-Cs3-Ds1-Es165-Fs3-Gs1 and A-Bs3-Cs1-Ds1-Es6-Fs1-Gs1 eptamers, characterized by C_α-RMSD of 11.30 Å and 2.06 Å, respectively, from the native oligomer (Figure 8b and 8c, Table 1). In summary, the solutions constituting the three alternative eptamers, which were obtained following different paths, fell amongst the top ten out of 4000 in the respective output list (Table 1 and Figure 8). Also, one of the two four-step growing paths produced the eptamer with the lowest C_α-RMSD from the native oligomer (i.e. 2.06 Å, Table 1 and 4b, 4c and 8c), whereas the other four-step growing path gave the worst eptamer in terms of C_α-RMSD (i.e. 11.30 Å, Figure 8b). Since the two predicted quaternary structures are significantly different from each other (i.e. the C_α-RMSD between them was 10.88 Å), and contain large water-soluble domains, they were subjected to a quality check by means of the 3D-Profile program [33], to help selection of the best one. Interestingly, for the native-like A-Bs3-Cs1-Ds1-Es6-Fs1-Gs1 eptamer, the 3D-Profile score was higher than that of the alternative eptamer (i.e. 438.0 vs 392.4, respectively), quite indicative of a more suitable packing of the seven subunits.

Quaternary structure predictions of BRD

For BRD, A vs A docking, followed by the application of the membrane topology filter, led to 77 solutions (Figure 9). These solutions essentially grouped in two clusters. One of these two clusters showed a better MemTop index than the other cluster (i.e. 0.537 vs 0.732, respectively) and was, thus, used to select the best scored solution, leading to the A-Bs162 dimer. A vs A-Bs162 docking led to 45 solutions, grouped in seven clusters. The solution selected based upon the criteria described above, s14, led to the A-Bs162-Cs14 trimer, characterized by a C_α-RMSD of 1.04 Å from the native trimer (Table 1 and Figures 5b, 5c and 9 top). This is a striking result, considering also that the lipid molecules, which are present at the interfaces between the monomers in the crystal structure of the trimer, were completely neglected in docking simulations. Furthermore, the predicted trimer is made of identical monomers, whereas, in the crystal structure of the oligomer, slight differences exist between the monomers, concerning their internal coordinates and the main-chain length.

We probed also a different crystal structure of BRD, 1KME [34]. This structure differs from the one extracted from the native oligomer both in the backbone (i.e. 1.0 Å C_α-RMSD in the 5–231 sequence, and 1.5 Å C_α-RMSD in the intracellular and extracellular domains) and side-chain conformations. Following exactly the same path and approach as those used for the monomer extracted from the native trimer, a trimer was achieved characterized by a C_α-RMSD of 4.0 Å from the native oligomer. It is worth noting that such C_α-RMSD value comprises also the structural differences between the monomers in the native and predicted trimers.

Quaternary structure predictions of the set of membrane proteins for which the asymmetric unit did not correspond to the biological unit

The subset of membrane proteins, KcsA, MscL and MscS and BRD, for which the asymmetric units corresponded to the biological unit, were employed for setting the prediction protocol. Benchmarks were then extended to a number of homo-oligomers, for which the asymmetric unit did not contain the biologic unit that was, hence, obtained by symmetry operations. These oligomers included: the tetrameric AQP1, GlpF and KirBac1, the trimeric AmtB and AcrB, as well as the dimeric BtuCD.

In detail, for the tetrameric AQP1, GlpF and KirBac1, the same growing paths as those employed for quaternary structure predictions of KcsA were probed. Similarly to KcsA, for these proteins, the two-step growing paths (a), (b) and (c) always resulted into native-like tetramers. In detail, for AQP1, the predicted tetramers (i.e. A-Bs2-Cs1-Ds1, A-Bs1-Cs2-Ds1 and A-Bs2-Cs4-Ds1) showed, respectively, C_α-RMSDs equal to 1.31 Å, 1.71 Å and 1.37 Å from the native oligomer (Table 1 and Figure 10a). For GlpF, the predicted tetramers (i.e. A-Bs2-Cs1-Ds1, A-Bs1-Cs2-Ds1 and A-Bs2-Cs2-Ds1) showed, respectively, C_α-RMSDs equal to 1.11 Å, 1.16 Å, and 1.39 Å from the native oligomer (Table 1 and Figure 10b). Finally, for KirBac1, the predicted tetramers (i.e. A-Bs3-Cs1-Ds1, A-Bs3-Cs1-Ds3 and A-Bs2-Cs1-Ds1) showed, respectively, C_α-RMSDs equal to 1.53 Å, 1.47 Å and 1.8 Å from the native oligomer (Table 1 and Figure 10c). It is worth noting that, for all the predicted structures, the docking solutions contributing to each tetramer fell among the top 3 solutions out of 4000 in the output list. Furthermore, similarly to KcsA, for AQP1 and GlpF, the growing path, which employs a trimer as a target at step 2 (i.e. path (a)), produced the best native-like tetramer (Table 1).

Differently from BRD, for AmtB and AcrB, the MemTop indices of the two most populated clusters from the first docking step were similar. This made it possible to achieve trimers following either path (a) or (b) or (c) (Figure 9 bottom). In detail, for AmtB, A vs A docking led to 38 solutions essentially grouped in two equi-populated clusters showing also good and similar MemTop indices (i.e. 0.317 and 0.319). The best scored solutions from these two clusters, s1 and s7, were likely to form the C and B subunits, respectively. The A-Bs7-Cs1 trimer was characterized by a C_α-RMSD of 1.17 Å from the native trimer. A vs A-Bs7 and A vs A-Cs1 docking, led to 53 and 62 reliable solutions, respectively, which essentially grouped in one cluster. The best solution from this cluster, s1, in both cases, led to the A-Bs7-Cs1 and A-Bs1-Cs1 native-like trimers characterized, respectively, by C_α-RMSDs of 0.83 Å and 0.75 Å from the native trimer (Table 1 and Figures 9 and 10d). For AcrB, A vs A docking led to 123 solutions, essentially grouped in two clusters showing low and similar MemTop indices (i.e. 0.220 and 0.135). The best scored solutions from these two clusters led to the A-Bs4-Cs1 trimer showing a C_α-RMSD equal to 2.78 Å from the native trimer. A vs A-Bs4 and A vs A-Cs1 docking, led to 48 and 26 reliable solutions, respectively, which essentially grouped in one cluster. The best solution from this cluster, s1 in both cases, led to the A-Bs4-Cs1 and A-Bs1-Cs1 native-like trimers characterized, respectively, by C_α-RMSDs of 1.56 Å and 1.64 Å from the native trimer (Table 1 and Figures 9 and 10e).

For BtuCD, whose membrane-spanning domain is a dimer, A vs A docking led to 47 solutions distributed in 11 small clusters. Since the cluster population in this case is meaningless, we considered only the cluster characterized by the best MemTop index (i.e.0.384). The lowest scored solution from this cluster, s178, led to a dimer characterized by a C_α-RMSD of 0.63 Å from the native complex (Table 1 and Figure 10f). These results were achieved following a dipole moment-based approach for subunit reorientation (see the Experimental Procedure section). The membrane topology-based reorientation approach, led to 48 reliable solutions, 30 of which were divided into 7 clusters, whereas the remaining 18 couldn't be clusterized. The 25 unique solutions contained the native-like dimer, characterized by one of the best MemTop index and a C_α-RMSD of 0.48 Å from the native complex. However, in this case, retrieving the native-like solution couldn't be unequivocally done, as it didn't hold either the best ZDOCK score or the best MemTop index.

Discussion

Following benchmarks, we have defined an effective computational protocol to predict the supramolecular structure of integral α-helical TM proteins, starting from the monomer. The approach was first tested on the trimeric BRD, tetrameric KcsA, pentameric MscL and eptameric MscS. These systems were selected following an accurate database search for high resolution structures of oligomeric membrane proteins, significantly different from each other and whose asymmetric crystallographic unit contains the biological unit. Docking samplings on these proteins were instrumental in setting the structure prediction protocol. Benchmarks were then extended to a number of homo-oligomers, for which the asymmetric unit did not contain the biologic unit that was, hence, obtained by symmetry operations. These oligomers included: the dimeric BtuCD, the trimeric AmtB and AcrB, and the tetrameric AQP1, GlpF and KirBac1.

For all the considered systems, native-like quaternary structures were achieved regardless of the resolution of the structural models and the extensions of the TM and water-exposed domains (Figures 2, 3, 4, 5, 6, 7, 8, 9, 10 and Table 1), thus, proving the effectiveness of the prediction approach. The latter consists in a number of dense docking samplings, starting from the docking of two identical copies of a given monomer. Each docking run is followed by membrane topology filtering and cluster analysis. Thus, prediction of the native oligomer is accomplished by a number of progressive growing steps, each made of one docking run, filtering and cluster analysis (Figure 1).

The only requirement with this approach is the structural model of the monomer that must hold the proper membrane topology. The correct membrane topology of the monomer, which in the first docking run of each growing path is employed in two identical copies (one for the target and the other for the probe), is, indeed, necessary for the membrane topology filter to work properly. This filter generally discards more than 94% of the total solutions provided by each docking run (i.e. 4000 solutions), thus allowing for an easy individuation of the clusters holding the native-like solutions. These clusters are, in fact, the one or two most populated, and hold also the best membrane topology indices, i.e a MemTop index close to zero. For almost all the docking runs, the best hit from the selected clusters fall within the first 10 out of the 4000 output solutions provided by the docking algorithm.

A significant number of trials on the four highly different systems BRD, KcsA, MscL and MscS allowed us to define the best criteria for selecting the proper docking solutions at each growing step and the best growing path. These criteria proved validity in quaternary structure predictions of the second set of membrane proteins.

Selection of the proper solution/s at each growing step should be based on the docking score as well as on the population and membrane topology of the solution clusters. In detail, if the realistic solutions group essentially in two clusters characterized also by similar and significantly low MemTop indices, the best scored solutions from both clusters should be selected to grow the oligomer. In contrast, if the two most populated clusters have significantly different MemTop indices, only the one showing the lowest index must be chosen for extracting the best scored solution. Finally, if one cluster is significantly most populated than the others, only that cluster must be considered for extracting the best scored solution.

As for the growing paths, in general, those should be pursued, which employ a dimeric target in the second step (Figures 6, 7, 8, 9). For cyclic oligomers, our approach is able to predict the oligomeric order of the functional unit, since completion of the cycle determines the end of the growth. Once established the oligomeric order, for systems like KcsA, AQP1 and GlpF, which are characterized by an even number of monomers, a growing path, which employs a trimer as a target at step 2, is worth attempting. In fact, in our benchmarks, this path led to the best tetramer, in terms of C_α-RMSD from the native structure (Table 1, path (a)). Based on these results, we infer that, for even order oligomers, growing paths characterized by the employment of a trimeric target at step 2 may be preferred, whereas, for odd order oligomers, the preferential growing paths are those employing a dimeric target at step 2. Interestingly, both these paths share the characteristic that, in the final step, only one monomer is needed to complete the oligomer, i.e. to close the cycle. For odd order oligomers, this would imply pursuing two parallel growing paths, finally leading to two predicted quaternary structures. When the two predicted oligomers differ significantly from each other, quality checks are needed to select the final structure. In our study, this was the case of the MscS oligomer (Figure 8b and 8c). In fact, the two parallel growing paths passing through a dimeric target led to two significantly different oligomers (i.e. the C_α-RMSD between them was 10.88 Å). In this case, given the significant extension of the water-exposed domains compared to the TM ones, the 3D-Profile score proved effectiveness in individuating the native-like oligomer.

For non-cyclic oligomers, the growth of the supramolecular assembly must stop when no more realistic solutions can be found. Low resolution information from Atomic Force Microscopy (AFM) or cryo-electron microscopy, if available, should be combined with the results of docking analysis. We couldn't do benchmarks on non-cyclic oligomers characterized by oligomeric orders higher than two or three due to the lack of high resolution structures for such systems. As an example, for BRD, we stopped the growth at the trimer, since the native structure, required for benchmarks, is available only for the trimer. However, we know from AFM images that BRD trimers organize in higher order oligomers [3]. Preliminary results of trimer vs trimer docking, reconstructed a BRD esamer similar to that inferred from AFM images (results not shown) [3]. We observed that the number of filtered solutions decreases significantly ongoing from monomer vs monomer to trimer vs trimer docking (i.e. from 77 to 15, respectively). This is in line with the fact that intra-trimer contacts should be stronger than the inter-trimer ones, which are mediated also by lipid molecules.

The effectiveness of predictions is independent of the content of water-exposed domains in the monomeric units, suggestive of the fundamental role of the TM domains in driving the supramolecular organization.

An important result is that predictions were excellent also for those systems, for which the asymmetric unit consists of the monomer and not of the biologic unit. These may be considered as unbound-unbound docking cases. Moreover, quaternary structure predictions can not be considered as bound-bound docking cases neither for those cases, in which the crystallographic asymmetric unit contains the biologic unit, and, hence, the monomer used for simulations was extracted from the X-ray structure of the complex. In fact, in none of the docking steps the situation is such that probe and target are the same components of the native complex. Moreover, for BRD, the predicted trimer is made of identical monomers, whereas, in the crystal structure of the oligomer, slight differences exist between the monomers, concerning their internal coordinates and the main-chain length. Moreover, the lipid molecules, which are present at the interfaces between the monomers in the crystal structure of the trimer, were completely neglected in our docking simulations. The employment of an "unbound" BRD structure (i.e. PDB code 1KME) [34], differing both in the backbone and side-conformations from the monomer extracted from the native trimer did not prevent the approach from leading to a correct quaternary structure.

Conclusion

In conclusion, the benchmarks carried out in this study on ten homo-oligomeric membrane proteins with known quaternary structure validate the proposed structure prediction protocol that, for all the tested cases, led to native-like quaternary structures regardless of the resolution of the structural models.

Quaternary structure predictions will be, hence, extended to other integral membrane proteins, including GPCRs. An attempt in this respect has been already reported, though based on an early and different version of the computational protocol, proving usefulness in aiding the interpretation of biophysical data and the design of novel in vitro experiments[14, 35]

Methods

Overview of the computational approach

Quaternary structure predictions were carried out through a stepwise approach consisting of a combination of rigid-body docking, membrane topology filtering, cluster analysis, and solution selection for the growth of the oligomer. Solution selection was essentially based on the docking score and the quality of the membrane topology (Figure 1).

Rigid body docking was carried out by means of the program ZDOCK 2.1, which utilizes the Pairwise Shape Complementarity (PSC) scoring function, neglecting desolvation [36].

For each protein system, the first docking run consisted of docking two identical copies of the crystal structure of monomer A, keeping one monomer fixed (i.e. target) and allowing the other copy of the monomer to move around the target (i.e. probe). For the membrane topology filter to work properly, the two identical copies of the starting monomer must have the appropriate orientation with respect to the putative membrane. This is due to the fact that ZDOCK expresses its docking solutions in terms of a x,y,z-translation and a RzRxRz-rotation of the probe. If both target and probe are properly oriented in a membrane parallel to the XY plane, the translation along the z-axis can be considered as an offset out of the membrane and the Rx component of the rotation as a deviation from the original orientation in the membrane. The membrane topology filter, indeed, discards all the solutions characterized by a deviation angle from the original z-axis, i.e. tilt angle, and a displacement of the geometrical center along the z-axis, i.e. z-offset, above defined threshold values. For the tilt angle and the z-offset, thresholds of 0.4 radians and 6.0 Å were, respectively, employed. The check of the correct orientation and the eventual reorientation of the monomer are automatically carried out in the following way. Membrane topology predictions allow for the definition of two sets of C_α-atoms, which putatively lie at the two lipid/water interfaces of the membrane. The planes that fit at best these C_α-atoms are found by means of a multiple linear regression approach. If these planes are parallel to the xy plane and, hence, perpendicular to the z-axis, the orientation of the monomer is considered good and no reorientation is needed. In contrast, if the monomer needs a reorientation, the rotations necessary to bring the two planes parallel to the xy plane are computed and then an average between the two rotations is employed to reorient the whole monomer. A number of membrane topology predictors were probed [37]. Overall, the PRODIV-TMHMM_0.91 predictor allowed for the most effective subunit reorientation. A reorientation was needed only for KcsA, BtuCD and the two different monomers of BRD (i.e. the one extracted from the trimer, PDB code 1BRR, and the monomeric structure encoded as 1KME). For KcsA and BtuCD we probed also an approach consisting in orienting the native oligomeric structure so that the dipole moment of the oligomer was parallel to the z-axis.

The codes for monomer reorientation and membrane topology filtering have been included in our FIPD software for the analyses of ZDOCK outputs [see Additional file 1][40].

For each set of docking simulations aimed at reaching the final oligomerization state of the protein, the runs that succeeded the first one were carried out by using the original monomer A as a probe and the intermediate oligomer as a target. For each of these runs, neither the probe monomer nor the target oligomer require a reorientation, since the former molecule is the same as that used in the first step, whereas the oligomeric target inherits the orientation of the original target.

Docking samplings were carried out by using a 128 × 128 × 128 point grid with a spacing of 1.2 Å and a rotational sampling interval of 6°, i.e. dense sampling. From each docking run, the first best 4000 solutions, according to the ZDOCK score, were retained. These solutions were subjected to the "membrane topology" filter described above. The filtered solutions from each run were merged with the target protein, leading to an equivalent number of oligomers that were subjected to cluster analysis. Clustering was carried out by using an algorithm from Dr. M. Schaefer (Michael Schaefer, Syngenta Crop Protection AG, unpublished work). The algorithm first calculates the Cα-RMSD for each superimposed pair of dimers/oligomers and then it computes the number of neighbors for each dimer/oligomer, using a threshold Cα-RMSD. The dimer/oligomer with the highest number of neighbors is considered as the center of the first cluster. All the neighbors of this configuration are removed from the ensemble of configurations to be counted only once. The center of the second cluster is then determined in the same way as for the first cluster, and this procedure is repeated until each structure is assigned to a cluster. The Cα-RMSD threshold value for each superimposed pair of dimers/oligomers was set equal to 3.0 Å. All the amino acid residues in the dimer/oligomer were included in C_α-RMSD calculations.

To identify the cluster of solutions with the best membrane topology, i.e. with the lowest values of both tilt angle and z-offset, we defined the MemTop index, $M e m T o p = \sqrt{{〈 t i l t_{n o r} 〉}^{2} + {〈 Z_{o f f n o r} 〉}^{2}}$ , were <tilt_nor> and <Z_offnor> are, respectively, the normalized tilt angle and the z-offset averaged over all the members of a given cluster. Normalization of each tilt angle and z-offset value was carried out by dividing each value for the respective cutoff value, i.e. 0.4 radians, for the tilt angle, and 6.0 Å, for the z-offset. The optimal value for such index is zero.

The quality of some of the predicted oligomeric structures of MscS was estimated by means of the 3D-Profile program [33] within the Quanta 2000 package.

Molecular visualizations were done by means of the VMD v1.8.2 program [38].

As indicative values, a monomer vs monomer and monomer vs esamer docking for MscS required 87 hours and 147 hours, respectively, of CPU time on a 2.2 GHz Opteron with 2 GB RAM.

References

Membrane proteins of known 3D structure [http://blanco.biomol.uci.edu/Membrane_Proteins_xtal.html]
Essen L, Siegert R, Lehmann WD, Oesterhelt D: Lipid patches in membrane protein oligomers: crystal structure of the bacteriorhodopsin-lipid complex. Proc Natl Acad Sci U S A 1998, 95(20):11673–11678. 10.1073/pnas.95.20.11673
Article PubMed Central CAS PubMed Google Scholar
Muller DJ, Heymann JB, Oesterhelt F, Moller C, Gaub H, Buldt G, Engel A: Atomic force microscopy of native purple membrane. Biochim Biophys Acta 2000, 1460(1):27–38. 10.1016/S0005-2728(00)00127-4
Article CAS PubMed Google Scholar
George SR, O'Dowd BF, Lee SP: G-protein-coupled receptor oligomerization and its potential for drug discovery. Nat Rev Drug Discov 2002, 1(10):808–820. 10.1038/nrd913
Article CAS PubMed Google Scholar
Agnati LF, Ferre S, Lluis C, Franco R, Fuxe K: Molecular mechanisms and therapeutical implications of intramembrane receptor/receptor interactions among heptahelical receptors with examples from the striatopallidal GABA neurons. Pharmacol Rev 2003, 55(3):509–550. 10.1124/pr.55.3.2
Article CAS PubMed Google Scholar
Kroeger KM, Pfleger KD, Eidne KA: G-protein coupled receptor oligomerization in neuroendocrine pathways. Front Neuroendocrinol 2003, 24(4):254–278. 10.1016/j.yfrne.2003.10.002
Article CAS PubMed Google Scholar
Terrillon S, Bouvier M: Roles of G-protein-coupled receptor dimerization. EMBO Rep 2004, 5(1):30–34. 10.1038/sj.embor.7400052
Article PubMed Central CAS PubMed Google Scholar
Dean MK, Higgs C, Smith RE, Bywater RP, Snell CR, Scott PD, Upton GJ, Howe TJ, Reynolds CA: Dimerization of G-protein-coupled receptors. J Med Chem 2001, 44(26):4595–4614. 10.1021/jm010290+
Article CAS PubMed Google Scholar
del Sol Mesa A, Pazos F, Valencia A: Automatic methods for predicting functionally important residues. J Mol Biol 2003, 326(4):1289–1302. 10.1016/S0022-2836(02)01451-1
Article CAS PubMed Google Scholar
Filizola M, Weinstein H: The study of G-protein coupled receptor oligomerization with computational modeling and bioinformatics. Febs J 2005, 272(12):2926–2938. 10.1111/j.1742-4658.2005.04730.x
Article CAS PubMed Google Scholar
Nemoto W, Toh H: Prediction of interfaces for oligomerizations of G-protein coupled receptors. Proteins 2005, 58(3):644–660. 10.1002/prot.20332
Article CAS PubMed Google Scholar
Soyer OS, Dimmic MW, Neubig RR, Goldstein RA: Dimerization in aminergic G-protein-coupled receptors: application of a hidden-site class model of evolution. Biochemistry 2003, 42(49):14522–14531. 10.1021/bi035097r
Article CAS PubMed Google Scholar
Thummer RP, Campbell MP, Dean MK, Frusher MJ, Scott PD, Reynolds CA: Entropy and oligomerization in GPCRs. J Mol Neurosci 2005, 26: 113–122. 10.1385/JMN:26:2-3:113
Article CAS PubMed Google Scholar
Fanelli F, De Benedetti PG: Computational modeling approaches to Structure-Function Analysis of G Protein-Coupled Receptors. Chem Rev 2005, 105: 3297–3351. 10.1021/cr000095n
Article CAS PubMed Google Scholar
Eisenstein M, Shariv I, Koren2 G, Friesem AA, Katchalski-KatziR E: Modeling Supra-molecular Helices: Extension of the Molecular Surface Recognition Algorithm and Application to the Protein Coat of the Tobacco Mosaic Virus. J Mol Biol 1997, 266: 135–143. 10.1006/jmbi.1996.0773
Article CAS PubMed Google Scholar
Gardiner EJ, Willett P, Artymiuk PJ: Protein Docking Using a Genetic Algorithm. PROTEINS 2001, 44: 44–56. 10.1002/prot.1070
Article PubMed Google Scholar
Berchanski A, Eisenstein M: Construction of Molecular Assemblies via Docking: Modeling of Tetramers with D2 Symmetry. PROTEINS 2003, 53: 817–829. 10.1002/prot.10480
Article PubMed Google Scholar
Inbar Y, Benyamini H, Nussinov R, Wolfson HJ: Protein structure prediction via combinatorial assembly of sub-structural units. Bioinformatics 2003, 19 Suppl. 1 : i158–i168.
PubMed Google Scholar
Berchanski A, Segal D, Eisenstein M: Modeling Oligomers with Cn or Dn Symmetry: Application to CAPRI Target 10. PROTEINS 2005, 60: 202–206. 10.1002/prot.20558
Article PubMed Google Scholar
Pierce B, Tong W, Weng Z: M-ZDOCK: a grid-based approach for Cn symmetric multimer docking. Bioinformatics 2005, 21(8):1472–1478. 10.1093/bioinformatics/bti229
Article CAS PubMed Google Scholar
Kim S, Chamberlain AK, Bowie JU: A simple method for modeling transmembrane helix oligomers. J Mol Biol 2003, 329(4):831–840. 10.1016/S0022-2836(03)00521-7
Article CAS PubMed Google Scholar
Fleishman SJ, Harrington S, Friesner RA, Honig B, Ben-Tal N: An Automatic Method for Predicting Transmembrane Protein Structures Using Cryo-EM and Evolutionary Data. Biophys J 2004, 87(5):3448–3459. 10.1529/biophysj.104.046417
Article PubMed Central CAS PubMed Google Scholar
Vakser Lab [http://vakser.bioinformatics.ku.edu/resources/gramm]
Vakser IA, Jiang S: Strategies for modeling the interactions of transmembrane helices of G protein-coupled receptors by geometric complementarity using the GRAMM computer algorithm. Methods in Enzymol 2002, 343: 313–328.
Article Google Scholar
Doyle DA, Morais Cabral J, Pfuetzner RA, Kuo A, Gulbis JM, Cohen SL, Chait BT, MacKinnon R: The structure of the potassium channel: molecular basis of K+ conduction and selectivity. Science 1998, 280(5360):69–77. 10.1126/science.280.5360.69
Article CAS PubMed Google Scholar
Chang G, Spencer RH, Lee AT, Barclay MT, Rees DC: Structure of the MscL homolog from Mycobacterium tuberculosis: a gated mechanosensitive ion channel. Science 1998, 282(5397):2220–2226. 10.1126/science.282.5397.2220
Article CAS PubMed Google Scholar
Bass RB, Strop P, Barclay M, Rees DC: Crystal structure of Escherichia coli MscS, a voltage-modulated and mechanosensitive channel. Science 2002, 298(5598):1582–1587. 10.1126/science.1077945
Article CAS PubMed Google Scholar
Locher KP, Lee AT, Rees DC: The E. coli BtuCD structure: a framework for ABC transporter architecture and mechanism. Science 2002, 296(5570):1091–1098. 10.1126/science.1071142
Article CAS PubMed Google Scholar
Khademi S, O'Connell J, Remis J, Robles-Colmenares Y, Miercke LJ, Stroud RM: Mechanism of ammonia transport by Amt/MEP/Rh: structure of AmtB at 1.35 A. Science 2004, 305(5690):1587–1594. 10.1126/science.1101952
Article CAS PubMed Google Scholar
Sui H, Han BG, Lee JK, Walian P, Jap BK: Structural basis of water-specific transport through the AQP1 water channel. Nature 2001, 414(6866):872–878. 10.1038/414872a
Article CAS PubMed Google Scholar
Fu D, Libson A, Miercke LJ, Weitzman C, Nollert P, Krucinski J, Stroud RM: Structure of a glycerol-conducting channel and the basis for its selectivity. Science 2000, 290(5491):481–486. 10.1126/science.290.5491.481
Article CAS PubMed Google Scholar
Kuo A, Gulbis JM, Antcliff JF, Rahman T, Lowe ED, Zimmer J, Cuthbertson J, Ashcroft FM, Ezaki T, Doyle DA: Crystal structure of the potassium channel KirBac1.1 in the closed state. Science 2003, 300(5627):1922–1926. 10.1126/science.1085028
Article CAS PubMed Google Scholar
Luthy R, Bowie JU, Eisenberg D: Assessment of protein models with three-dimensional profiles. Nature 1992, 356(6364):83–85. 10.1038/356083a0
Article CAS PubMed Google Scholar
Faham S, Bowie JU: Bicelle crystallization: a new method for crystallizing membrane proteins yields a monomeric bacteriorhodopsin structure. J Mol Biol 2002, 316(1):1–6. 10.1006/jmbi.2001.5295
Article CAS PubMed Google Scholar
Canals M, Marcellino D, Fanelli F, Ciruela F, De Benedetti P, Goldberg SR, Neve K, Fuxe K, Agnati LF, Woods AS, Ferre S, Lluis C, Bouvier M, Franco R: Adenosine A2A-dopamine D2 receptor-receptor heteromerization: qualitative and quantitative assessment by fluorescence and bioluminescence energy transfer. J Biol Chem 2003, 278(47):46741–46749. 10.1074/jbc.M306451200
Article CAS PubMed Google Scholar
Chen R, Weng Z: A novel shape complementarity scoring function for protein-protein docking. Proteins 2003, 51(3):397–408. 10.1002/prot.10334
Article CAS PubMed Google Scholar
bdm [http://gpcr.biocomp.unibo.it/biodec/]
Humphrey W, Dalke A, Schulten K: VMD: visual molecular dynamics. J Mol Graph 1996, 14(1):33–8, 27–8. 10.1016/0263-7855(96)00018-5
Article CAS PubMed Google Scholar
PyMOL [http://pymol.sourceforge.net/]
Computational Biochemistry and Biophysics Group - DTI [http://www.cbl.unimo.it/software/software.htm]

Download references

Acknowledgements

This study was supported by a Telethon-Italy grant n. S00068TELA (To FF).

We'd like to thank Daniele Dell'Orco for his fruitful inputs.

Author information

Authors and Affiliations

Department of Chemistry, Dulbecco Telethon Institute (DTI), University of Modena e Reggio Emilia, Via Campi 183, 41100, Modena, Italy
D Casciari, M Seeber & F Fanelli

Authors

D Casciari
View author publications
You can also search for this author in PubMed Google Scholar
M Seeber
View author publications
You can also search for this author in PubMed Google Scholar
F Fanelli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to F Fanelli.

Additional information

Authors' contributions

DC carried out docking simulations and analyses. MS wrote the codes to allow effective analyses of the docking outputs. FF conceived and coordinated the study and wrote the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material

12859_2006_1079_MOESM1_ESM.zip

Additional File 1: fipd. Archive file containing two files: (a) the FIPD software compiled for Linux machines (i.e. fipd_i386.x), and (b) the relative documentation (README). Such software allows for both monomer reorientation and ZDock solution filtering.This software can be also downloaded from the authors' WEB site [40]. (ZIP 246 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Casciari, D., Seeber, M. & Fanelli, F. Quaternary structure predictions of transmembrane proteins starting from the monomer: a docking-based approach. BMC Bioinformatics 7, 340 (2006). https://doi.org/10.1186/1471-2105-7-340

Download citation

Received: 18 January 2006
Accepted: 12 July 2006
Published: 12 July 2006
DOI: https://doi.org/10.1186/1471-2105-7-340

Quaternary structure predictions of transmembrane proteins starting from the monomer: a docking-based approach