Discrete structural features among interface residue-level classes

Sowmya, Gopichandran; Ranganathan, Shoba

doi:10.1186/1471-2105-16-S18-S8

Volume 16 Supplement 18

Joint 26th Genome Informatics Workshop and 14th International Conference on Bioinformatics: Bioinformatics

Research
Open access
Published: 09 December 2015

Discrete structural features among interface residue-level classes

Gopichandran Sowmya¹ &
Shoba Ranganathan¹

BMC Bioinformatics volume 16, Article number: S8 (2015) Cite this article

1955 Accesses
10 Citations
Metrics details

Abstract

Background

Protein-protein interaction (PPI) is essential for molecular functions in biological cells. Investigation on protein interfaces of known complexes is an important step towards deciphering the driving forces of PPIs. Each PPI complex is specific, sensitive and selective to binding. Therefore, we have estimated the relative difference in percentage of polar residues between surface and the interface for each complex in a non-redundant heterodimer dataset of 278 complexes to understand the predominant forces driving binding.

Results

Our analysis showed ~60% of protein complexes with surface polarity greater than interface polarity (designated as class A). However, a considerable number of complexes (~40%) have interface polarity greater than surface polarity, (designated as class B), with a significantly different p-value of 1.66E-45 from class A. Comprehensive analyses of protein complexes show that interface features such as interface area, interface polarity abundance, solvation free energy gain upon interface formation, binding energy and the percentage of interface charged residue abundance distinguish among class A and class B complexes, while electrostatic visualization maps also help differentiate interface classes among complexes.

Conclusions

Class A complexes are classical with abundant non-polar interactions at the interface; however class B complexes have abundant polar interactions at the interface, similar to protein surface characteristics. Five physicochemical interface features analyzed from the protein heterodimer dataset are discriminatory among the interface residue-level classes. These novel observations find application in developing residue-level models for protein-protein binding prediction, protein-protein docking studies and interface inhibitor design as drugs.

Background

Protein-protein binding is a known phenomenon in complex biological networks. The molecular principle of such binding is often elusive in nature. Understanding its driving forces using known protein complexes is essential. The analysis of existing protein-protein interaction (PPI) complexes from the Protein Data Bank (PDB) [1] is the key to gaining insights into recognition mechanisms and binding principles as reviewed elsewhere [2–6]. Sequence and structural investigations on the existing complexes has been carried out for several decades [3, 7–10]. In these extensive surveys, structural features over diverse datasets of protein-protein complexes were typically averaged, obscuring information on individual proteins' structural integrity. Each individual complex is specific and sensitive to binding. Although, non-polar (or hydrophobic) interactions are known to play a major role in contributing to the driving force for binding, in a considerable number of complexes, polar interactions are also observed to contribute abundantly to the formation of a stable interface [11]. Therefore, it is often essential to study the relative difference in surface and interface polarity of each PPI complex to determine the major binding forces at the interface and determine their discriminatory features.

Interfaces are localized regions of surfaces with different physico-chemical properties compared to the rest of the surfaces, thereby driving binding to other molecules. Both physical and chemical features (including hydrophobicity, electrostatic interactions, binding energy, interface size, hydrogen bonds, salt bridges, disulphide bonds, planarity, sphericity, shape complementarity, amino acid chemical groups, and conserved residue clusters) govern the formation of protein interfaces as described elsewhere [7, 9, 12–18]. The chemical nature of residues forming a protein interface (amino acid residue composition) determines the hydrophobic effect of an interface. Non-polar (or hydrophobic) residues are observed to occur predominantly at the protein interface, playing a major role in contributing to the driving force for binding [7, 13]. Interfaces are observed to be less non-polar (or hydrophobic) than the protein interior [13]. The residue composition of protein-protein interfaces was observed to be more similar to the protein surface than the protein interior [9].

Interfaces were observed to be significantly polar as well as non-polar with few charged groups, similar to the characteristics of the protein surface [12]. Structural analysis also revealed that charged and polar amino acids are involved in protein-protein interactions as reviewed elsewhere [19]. Charged and polar residues contributing to binding specificity and complex formation are demonstrated in a number of complexes such as human IL-4, human CD2 and CD58, barnase-barstar, Colicin E9, integrin αvβ6 membrane protein and in intrinsically disordered proteins [20–25]. Shape complementarity, polar interactions, hydrogen bonding and salt bridges are also known to contribute to binding specificity and free energy of binding [17, 24, 26, 27]. Also, charged and aromatic side chains are crucial for binding, determining the cation-pi, electrostatic and aromatic interactions [8]. The role of electrostatics in binding stability of protein-protein complexes is demonstrated [16]. These observations indicate that although PPIs are driven by non-polar interactions at the interface for a majority of complexes, in some cases polar interactions contribute to binding specificity (characteristic of polar residues) and likewise to complex stabilization. Therefore, a study on the relative percentage difference between surface and interface polarities of each protein complex is often essential. In our previous study, we have identified a class of complexes with more polar residues at the interface than the rest of the surface, where binding is mainly polar with a dataset of 198 complexes [11]. This observation has now been extended for an updated yet non-redundant dataset of 278 protein complexes to verify and identify any discriminatory features among these interface residue-level classes.

In this study, we have carried out a comprehensive structural analysis of 278 non-redundant heterodimeric protein complexes from the PDB. We estimated the relative difference in surface and interface polarities of each complex in the dataset, using percentage values of polar residues. This property divides the dataset into two interface classes as also observed in our previous study with a smaller dataset [11]. Class A has less polar residues at the interface than the rest of the surface (~60%) which is the 'classical' definition of a PPI complex and class B has more polar residues at the interface than the rest of the surface (~40%), is 'non-classical.' Therefore, we have investigated essential PPI structural features such as interface area (ΔASA), the relative abundance of polar and non-polar residues at the interface (interface polarity abundance), hydrogen bonds (H-bonds), salt bridges, percentage of charged residues at the interface (interface charged residues%), solvation free energy gain upon interface formation (ΔⁱG), binding energy (BE), and electrostatics among these interface classes and their gleaned features are documented. We identified five key features (ΔASA, interface polarity abundance, interface charged residues%, ΔⁱG and BE) that are significantly different between the interface classes. These novel observations have implications for residue-level characterization of protein complexes to develop models for protein-protein binding prediction and docking studies.

Methods

Heterodimer dataset

We created a non-redundant heterodimer dataset of protein complexes from the PDB, using the RCSB PDB's advanced search interface. The following criteria were used for filtering: (i) resolution <= 3Å (ii) protein size >50 residues (iii) contains experimental data (iv) number of chains, entities and oligomeric state is set at 2 (v) devoid of DNA or RNA or a hybrid of such molecules with the protein or otherwise (vi) entries with mutations were not included and (vii) sequence identity cut-off is set to 30%, which is the minimum cut-off available in the PDB. As a second step, the USEARCH program [28] was used to further remove the redundancy among heterodimer complexes at sequence identity cut-off of 20%, as this threshold eliminates remote homology up to 25% sequence identity seen in structures as well [29].

Interface analysis

The interface of PPI complex is calculated as the change in solvent accessible surface area (ΔASA) upon complex formation. The Surface Racer 5.0 program [30] was used to calculate ASA with a probe radius of 1.4Å and Lee and Richards implementation [31]. Interface residues with ΔASA > 0.1Å² were considered for this analysis, as defined by Chakrabarti and Janin [32]. ASA was used to determine surface residues of each complex. The amount of polar, non-polar and charged residues at the interface was then estimated for the dataset. The interface polarity abundance (P%-NP%) is measured as the difference in the percentage of polar residues (P%) and percentage of non-polar residues (NP%) at the interface [11].

Classification based on relative interface-surface polarity

Interfaces are part of protein surface formed upon binding of individual subunits. Each protein complex has a specific composition of polar (P) and nonpolar (NP) residues at the surface (S) and at the interface (I). The distribution of polar and nonpolar residues at the interface of a protein complex describes the nature of the interface and the major driving force for binding. We have calculated the percentage of polar and nonpolar residues at the surface and interface for each complex in the dataset. Polar residues considered in the analysis are R, N, D, E, Q, H, K, S, T, and Y and non-polar residues are A, C, G, I, L, M, F, P, V, and W. Complexes were then grouped based on the relative difference in percentage of polar residues between surface (S) and the interface (I). Complexes with interface polarity less than the surface (represented as S>I) are grouped as class A, and those that have interface polarity greater than the surface (represented as S<I) are grouped as class B [11].

Intermolecular H-bonds and salt bridges calculation

We calculated the intermolecular hydrogen bonds for the dataset using HBPLUS program [33] at a distance of < 4Å. The H-bonds were extracted such that the donor and acceptor are from two different chains. Salt bridges were calculated using SBION program [34] within a distance of 4Å. The salt bridges were also extracted such that the oppositely charged atoms are from two different chains.

ΔⁱG and BE calculation

PDBePISA webserver [35] was used to obtain the solvation free energy gain upon interface formation (ΔⁱG, in kcal/mol, with negative ΔⁱG values indicating hydrophobic interface) and for the heterodimer dataset. BE values were calculated using the DCOMPLEX program [36] with the most negative value considered the strongest. The DCOMPLEX program uses DFIRE-based potentials [37] to calculate BE terms, without values for individual components (electrostatic, van der Waals, hydrophobic and entropic terms) contributing to BE. Initially, the program calculates the total atom-atom potential of mean force, G, for each protein structure as follows:

G = \frac{1}{2} \sum_{i, j} (ū_{i, j}, r_{i, j})

(1)

where ū is the atom-atom potential of mean force between two atoms, i and j which are 'r' distance apart. The total is over atomic pairs which are not from the same residue and a K factor is used to avoid double-counting of residue-residue and atom-atom interactions [36]. The binding energy between two interacting proteins A and B can be calculated as follows:

Δ G_{b i n d} = G_{c o m p l e x} (G_{A} + G_{B})

(2)

where A and B are considered as two protein structures whose interface residues contribute most to ΔG_bind. Therefore, DCOMPLEX [36] uses the equation below to calculate BE:

Δ G_{bind} = \frac{1}{2} \sum_{i j}^{i n t e r f a c e} (ū_{i, j}, r_{i, j})

(3)

Electrostatic potential at the interface

The surface electrostatic potential of chain A and chain B of a protein complex was calculated by solving Poisson-Boltzmann equation with dielectric constant (protein) of 4 using DEEPVIEW [38].

Statistical analysis

The Wilcoxon signed-rank test [39], a non-parametric statistical hypothesis test is used to compare the two interface classes to assess whether the mean ranks for the PPI features in the two classes differ (i.e. it is a paired difference test). The discriminatory PPI features among the two classes were thus tested for statistical significance with p < 0.05 (for the Wilcoxon signed-rank test) in RStudio [40].

Results and discussion

We calculated the amount of polar and non-polar residues at the surface and interface of each protein-protein complex and estimated their relative interface-surface polarities for classification into class A and class B (as described in Materials and Methods section), to determine the type of interactions predominantly driving protein-protein binding. Additional File 1: Table S1 shows the heterodimer dataset (278) divided into class A (165) and class B (113). Thus, 59.4% of complexes in our dataset belong to class A (relative surface polarity is greater than interface polarity), where non-polar interactions are predominant at the interface, as previously observed in a number of studies [7, 13]. Nevertheless, 40.6% of complexes belong to class B (relative interface polarity is greater than surface polarity), where polar interactions are predominant at the interface, similar to the surface characteristics as also observed [12]. Class A and class B are significantly different with a p-value of 1.66E-45 (using Wilcoxon rank sum test) as shown in Additional File 2: Figure S1. Examples of class A and class B complexes representing predominant non-polar and predominant polar interfaces (using the PDBsum [41] interaction analysis) respectively are shown in Figure 1.

PPI features among class A and class B complexes

We carried out a statistical analysis of all the structural features (described in Materials and Methods section including ΔASA, interface polarity abundance, interface charged residues%, H-bonds, salt bridges, ΔⁱG, BE) in R program (using Wilcoxon rank sum test), to determine whether structural features discriminate among class A and class B complexes. Interestingly, five structural features showed significant difference among the interface classes as shown in Figure 2, with p-value < 0.05 (Table 1). The q-value in Table 1 is the smallest False Discovery Rate (FDR) at which a particular class (class A or class B) would stay on the discriminatory features table. This is not identical to the p-value, which is the smallest false positive rate (FPR) at which a class appears positive on the discriminatory features table. The p-value is much stricter than the q-value. An FDR of 5% (q-value <0.05) is acceptable, which is accepting 5% of erroneous single results, according to Wilcoxon test [39]. These structural features are presented below, along with sets of other correlated properties and electrostatics among classes.

Table 1 PPI features distinguishing class A and class B (using Wilcoxon rank sum test).

Full size table

Interface polarity abundance among classes

Protein interfaces are composed of both polar and non-polar residues. Some interfaces are abundant in non-polar residues while few others are abundant in polar residues. The interface polarity abundance (P%-NP%) measure is significantly different among the interface classes with p = 7.01E-30 (Table 1 and Figure 2).

ΔⁱG among classes

The solvation free energy gain upon interface formation (ΔⁱG) is a measure of the interface stability in protein complexes [35]. The ΔⁱG values are significantly different among interface classes with p = 7.43E-18 (Table 1) as shown in Figure 2.

BE among classes

The strength of binding among class A and class B complexes is estimated as a measure of BE in kcal/mol. The BE values are relatively stronger for class A complexes (average BE is -33.99 kcal/mol), as compared to class B complexes (average BE is -17.94 kcal/mol). The BE values are significantly different among interface classes with p = 2.63E-14 (Table 1) as shown in Figure 2.

Interface charged residues among classes

The percentage of charged residues at the interface is estimated for both classes. The interface charged residues% is significantly different among interface classes with p = 6.58E-13 (Table 1) as shown in Figure 2.

ΔASA among classes

The interface area (ΔASA) of a complex is an important structural characteristic of PPI. We observed that class A complexes demonstrate comparatively larger interfaces than class B complexes. The ΔASA is significantly different among the classes with p = 1.31E-08 (Table 1 and Figure 2).

Other correlations of interface features among classes

The stability of protein-protein binding depends on the number of hydrogen bonds and salt bridges formed between the two interacting subunits. Class A complexes show high correlation between intermolecular H-bonds and interface area (r = 0.9) as previously observed [7, 42]. However, class B complexes alone show reduced trends (r = 0.73) between intermolecular H-bonds and interface area (Additional File 3: Figure S2), indicating that low quality of intermolecular hydrogen bonds is a characteristic of the large number of polar or charged residues across protein interfaces as previously observed [17]. Although salt bridges showed no distinguishing trends among classes, we observed that class B complexes are rich in salt bridges (average number of salt bridges is 6.5), as compared to class A complexes (average number of salt bridges is 5.8).

The BE values are proportional to interface area in the dataset (r = 0.96, shown in Additional File 4: Figure S3) as previously observed [43]. The ΔⁱG values show relatively less correlation with interface area in class B complexes (r = -0.62) as compared to class A complexes (r = -0.92, Additional File 5: Figure S4). Moreover, the ΔⁱG and BE is highly correlated among the dataset (r = 0.88) and class A (r = 0.91), however shows limited correlation among class B (r = 0.55, Additional File 6: Figure S5).

Electrostatic visualization maps among protein interface classes

We have studied the surface electrostatic potential solving Poisson-Boltzmann equation using DEEPVIEW for a few examples of class A and class B complexes. This shows common surface electrostatics at work amongst the class A and amongst the class B complexes. Interestingly, the class A complexes demonstrate similar distribution of charges at the protein interfaces of both chains, suggesting electrostatic energy may not contribute to binding energy among class A complexes. However, class B complexes show opposite charge distributions at the protein interfaces, suggesting electrostatic energy plays an important role in PPIs among class B complexes as shown in Figure 3. Therefore, the surface electrostatic potential maps give quick visual clues for identifying or distinguishing class A and class B complexes.

Conclusions

Structural analyses of known protein interfaces help in understanding the molecular principals of PPIs. Therefore, a comprehensive analysis of known structural interfaces of 278 complexes was carried out and their gleaned features are documented in this study. It is realized that each complex type is unique, specific and sensitive to binding. Nonetheless, there is a considerable degree of observed pattern among protein interface classes. We report two classes of interfaces, one class with less polar residues and the other class with more polar residues compared to the surfaces in bound state. The surfaces of proteins are quite polar and therefore, it is perhaps not surprising that some interfaces are polar as well and that PPI complex forms due to interactions among charged and polar residues. Thus, the need for a residue-level characterization of the interface is crucial in addition to other structural features. We document five discriminatory features (interface area, interface polarity abundance (P%-NP%), interface charged residues%, solvent free energy gain upon interface formation (ΔⁱG), and binding energy) among the interface residue-level classes. This is a first attempt towards classifying the complexes based on interface residue-level classes for the characterization of PPI features amongst these classes. These observations corroborate the need for classification of complexes in determining their combinatorial features and drawing consensus for common patterns in protein-protein recognition. These results provide molecular insights for protein-protein binding towards the development of residue-level prediction models in future studies. Additionally, mutation experiments using hot spot residue databases [44] and detailed interface residue characterization (cation-pi, electrostatic and aromatic interactions [8]) will further strengthen this study, for individual structures. Furthermore, extending this analysis for a larger dataset with a combined formulation of atomic and residue level features in future studies may improve protein-protein docking.

Abbreviations

PPI:: Protein-Protein Interaction
ΔⁱG:: solvent free energy gain upon interface formation
BE:: Binding Energy
H-bonds:: Hydrogen bonds
PDB:: Protein Data Bank
ΔASA:: interface area
P:: Polar residues
NP:: Non-polar residues
S:: Surface polarity
I:: Interface Polarity.

References

Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res. 2000, 28 (1): 235-242.
Article CAS PubMed Central PubMed Google Scholar
Reichmann D, Rahat O, Cohen M, Neuvirth H, Schreiber G: The molecular architecture of protein-protein binding sites. Curr Opin Struct Biol. 2007, 17 (1): 67-76.
Article CAS PubMed Google Scholar
Keskin O, Gursoy A, Ma B, Nussinov R: Principles of protein-protein interactions: what are the preferred ways for proteins to interact?. Chem Rev. 2008, 108 (4): 1225-1244.
Article CAS PubMed Google Scholar
Sudha G, Nussinov R, Srinivasan N: An overview of recent advances in structural bioinformatics of protein-protein interactions and a guide to their principles. Prog Biophys Mol Biol. 2014
Google Scholar
Sowmya G, Ranganathan S: Protein-protein interactions and prediction: a comprehensive overview. Protein Pept Lett. 2014, 21 (8): 779-789.
Article CAS PubMed Google Scholar
Bahadur RP, Zacharias M: The interface of protein-protein complexes: analysis of contacts and prediction of interactions. Cell Mol Life Sci. 2008, 65 (7-8): 1059-1072.
Article CAS PubMed Google Scholar
Jones S, Thornton JM: Protein-protein interactions: a review of protein dimer structures. Prog Biophys Mol Biol. 1995, 63 (1): 31-65.
Article CAS PubMed Google Scholar
Gromiha MM, Yokota K, Fukui K: Energy based approach for understanding the recognition mechanism in protein-protein complexes. Mol Biosyst. 2009, 5 (12): 1779-1786.
Article CAS PubMed Google Scholar
Jones S, Thornton JM: Principles of protein-protein interactions. Proc Natl Acad Sci USA. 1996, 93 (1): 13-20.
Article CAS PubMed Central PubMed Google Scholar
Gilson MK, Zhou HX: Calculation of protein-ligand binding affinities. Annu Rev Biophys Biomol Struct. 2007, 36: 21-42.
Article CAS PubMed Google Scholar
Sowmya G, Anita S, Kangueane P: Insights from the structural analysis of protein heterodimer interfaces. Bioinformation. 2011, 6 (4): 137-143.
Article PubMed Central PubMed Google Scholar
Lo Conte L, Chothia C, Janin J: The atomic structure of protein-protein recognition sites. Journal of molecular biology. 1999, 285 (5): 2177-2198.
Article CAS PubMed Google Scholar
Chothia C, Janin J: Principles of protein-protein recognition. Nature. 1975, 256 (5520): 705-708.
Article CAS PubMed Google Scholar
Miller S, Lesk AM, Janin J, Chothia C: The accessible surface area and stability of oligomeric proteins. Nature. 1987, 328 (6133): 834-836.
Article CAS PubMed Google Scholar
Saha RP, Bahadur RP, Pal A, Mandal S, Chakrabarti P: ProFace: a server for the analysis of the physicochemical features of protein-protein interfaces. BMC Struct Biol. 2006, 6: 11-
Article PubMed Central CAS PubMed Google Scholar
Dong F, Zhou HX: Electrostatic contribution to the binding stability of protein-protein complexes. Proteins. 2006, 65 (1): 87-102.
Article CAS PubMed Google Scholar
Xu D, Tsai CJ, Nussinov R: Hydrogen bonds and salt bridges across protein-protein interfaces. Protein Eng. 1997, 10 (9): 999-1012.
Article CAS PubMed Google Scholar
Guharoy M, Chakrabarti P: Conserved residue clusters at protein-protein interfaces and their use in binding site identification. BMC Bioinformatics. 2010, 11: 286-
Article PubMed Central CAS PubMed Google Scholar
Sheinerman FB, Norel R, Honig B: Electrostatic aspects of protein-protein interactions. Curr Opin Struct Biol. 2000, 10 (2): 153-159.
Article CAS PubMed Google Scholar
Wang T, Tomic S, Gabdoulline RR, Wade RC: How optimal are the binding energetics of barnase and barstar?. Biophys J. 2004, 87 (3): 1618-1630.
Article CAS PubMed Central PubMed Google Scholar
Wang JH, Smolyar A, Tan K, Liu JH, Kim M, Sun ZY, Wagner G, Reinherz EL: Structure of a heterophilic adhesion complex between the human CD2 and CD58 (LFA-3) counterreceptors. Cell. 1999, 97 (6): 791-803.
Article CAS PubMed Google Scholar
Zhang JL, Simeonowa I, Wang Y, Sebald W: The high-affinity interaction of human IL-4 and the receptor alpha chain is constituted by two independent binding clusters. J Mol Biol. 2002, 315 (3): 399-407.
Article CAS PubMed Google Scholar
Kuhlmann UC, Pommer AJ, Moore GR, James R, Kleanthous C: Specificity in protein-protein interactions: the structural basis for dual recognition in endonuclease colicin-immunity protein complexes. J Mol Biol. 2000, 301 (5): 1163-1178.
Article CAS PubMed Google Scholar
Wong ET, Na D, Gsponer J: On the importance of polar interactions for complexes containing intrinsically disordered proteins. PLoS Comput Biol. 2013, 9 (8): e1003192-
Article CAS PubMed Central PubMed Google Scholar
Sowmya G, Khan JM, Anand S, Ahn SB, Baker MS, Ranganathan S: A site for direct integrin alphavbeta6.uPAR interaction from structural modelling and docking. J Struct Biol. 2014, 185 (3): 327-335.
Article CAS PubMed Google Scholar
McCoy AJ, Chandana Epa V, Colman PM: Electrostatic complementarity at protein/protein interfaces. J Mol Biol. 1997, 268 (2): 570-584.
Article CAS PubMed Google Scholar
Kundrotas PJ, Alexov E: Electrostatic properties of protein-protein complexes. Biophys J. 2006, 91 (5): 1724-1736.
Article CAS PubMed Central PubMed Google Scholar
Edgar RC: Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010, 26 (19): 2460-2461.
Article CAS PubMed Google Scholar
Sander C, Schneider R: Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins. 1991, 9 (1): 56-68.
Article CAS PubMed Google Scholar
Tsodikov OV, Record MT, Sergeev YV: Novel computer program for fast exact calculation of accessible and molecular surface areas and average surface curvature. J Comput Chem. 2002, 23 (6): 600-609.
Article CAS PubMed Google Scholar
Lee B, Richards FM: The interpretation of protein structures: estimation of static accessibility. J Mol Biol. 1971, 55 (3): 379-400.
Article CAS PubMed Google Scholar
Chakrabarti P, Janin J: Dissecting protein-protein recognition sites. Proteins. 2002, 47 (3): 334-343.
Article CAS PubMed Google Scholar
McDonald IK, Thornton JM: Satisfying hydrogen bonding potential in proteins. J Mol Biol. 1994, 238 (5): 777-793.
Article CAS PubMed Google Scholar
Gupta PS, Mondal S, Mondal B, Islam RN, Banerjee S, Bandyopadhyay AK: SBION: A Program for Analyses of Salt-Bridges from Multiple Structure Files. Bioinformation. 2014, 10 (3): 164-166.
Article PubMed Central PubMed Google Scholar
Krissinel E, Henrick K: Inference of macromolecular assemblies from crystalline state. J Mol Biol. 2007, 372 (3): 774-797.
Article CAS PubMed Google Scholar
Liu S, Zhang C, Zhou H, Zhou Y: A physical reference state unifies the structure-derived potential of mean force for protein folding and binding. Proteins. 2004, 56 (1): 93-101.
Article CAS PubMed Google Scholar
Zhou H, Zhou Y: Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction. Protein Sci. 2002, 11 (11): 2714-2726.
Article CAS PubMed Central PubMed Google Scholar
Kaplan W, Littlejohn TG: Swiss-PDB Viewer (Deep View). Brief Bioinform. 2001, 2 (2): 195-197.
Article CAS PubMed Google Scholar
McDonald JH: Handbook of Biological Statistics. 2014, Baltimore, Maryland: Sparky House Publishing, 3
Google Scholar
RStudio: RStudio: Integrated development environment for R. 2012, Boston, v0.98.1091
Google Scholar
Laskowski RA: PDBsum new things. Nucleic Acids Res. 2009, 37 (Database): D355-359.
Article CAS PubMed Central PubMed Google Scholar
Gao Y, Wang R, Lai L: Structure-based method for analyzing protein-protein interfaces. J Mol Model. 2004, 10 (1): 44-54.
Article CAS PubMed Google Scholar
Chen J, Sawyer N, Regan L: Protein-protein interactions: General trends in the relationship between binding affinity and interfacial buried surface area. Protein Sci. 2013, 22 (4): 510-515.
Article CAS PubMed Central PubMed Google Scholar
Ahmad S, Keskin O, Mizuguchi K, Sarai A, Nussinov R: CCRXP: exploring clusters of conserved residues in protein structures. Nucleic Acids Res. 2010, 38 (Web Server): W398-401.
Article CAS PubMed Central PubMed Google Scholar

Download references

Acknowledgements

G.S. is the recipient of an iMQRES PhD scholarship from Macquarie University. We are grateful to Dr. Edmond J. Breen for help with statistical analysis and suggestions.

Declaration

Publication charges for this article have been funded by Macquarie University.

This article has been published as part of BMC Bioinformatics Volume 16 Supplement 18, 2015: Joint 26th Genome Informatics Workshop and 14th International Conference on Bioinformatics: Bioinformatics. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/16/S18.

Author information

Authors and Affiliations

Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW, 2109, Australia
Gopichandran Sowmya & Shoba Ranganathan

Authors

Gopichandran Sowmya
View author publications
You can also search for this author in PubMed Google Scholar
Shoba Ranganathan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shoba Ranganathan.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

Conceived and designed the experiment: GS. Data collected, analyzed and drafted the manuscript: GS. Interpretation of results and finalizing the manuscript: GS and SR. All authors read and approved the final manuscript.

Electronic supplementary material

12859_2015_7199_MOESM1_ESM.pdf

Additional file 1: Table S1: Heterodimer dataset (278) divided into interface classes based on residue level relative surface-interface polarity. The PDB code is shown along with the specific chains used in this study. (PDF 153 KB)

12859_2015_7199_MOESM2_ESM.pdf

Additional file 2: Figure S1: Class A and Class B are significantly different. The boxplot depicts class A and class B significantly different with a p-value of 1.66E-45 (using Wilcoxon rank sum test). (PDF 276 KB)

12859_2015_7199_MOESM3_ESM.pdf

Additional file 3: Figure S2: Intermolecular H-bonds shows relatively low correlation with interface area in class B. Hydrogen bonds at the protein interface are highly correlated to interface area in the dataset (r = 0.88) and class A (r = 0.9), however shows relatively lower trends (r = 0.73) in class B. (PDF 278 KB)

12859_2015_7199_MOESM4_ESM.pdf

Additional file 4: Figure S3: Binding energy is highly correlated to interface area. BEs at the protein interfaces are highly correlated to interface area with r = -0.96. (PDF 236 KB)

12859_2015_7199_MOESM5_ESM.pdf

Additional file 5: Figure S4: Solvation free energy gain upon interface formation (ΔⁱG) shows limited correlation with interface area in class B complexes. ΔⁱG shows high correlation with interface area in (a) heterodimer dataset (r = -0.88), and (b) class A (r = -0.92), however shows limited correlation in (c) class B complexes (r = -0.62). (PDF 275 KB)

12859_2015_7199_MOESM6_ESM.pdf

Additional file 6: Figure S5: BE shows limited correlated with ΔⁱG in class B. Binding energies at the protein interfaces are highly correlated to solvation free energy gain upon interface formation (ΔⁱG) in the dataset (r = 0.88) and class A (r = 0.91), however shows limited correlation between BE and ΔⁱG in class B (r = 0.55). (PDF 145 KB)

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Sowmya, G., Ranganathan, S. Discrete structural features among interface residue-level classes. BMC Bioinformatics 16 (Suppl 18), S8 (2015). https://doi.org/10.1186/1471-2105-16-S18-S8

Download citation

Published: 09 December 2015
DOI: https://doi.org/10.1186/1471-2105-16-S18-S8

Joint 26th Genome Informatics Workshop and 14th International Conference on Bioinformatics: Bioinformatics

Discrete structural features among interface residue-level classes

Abstract

Background

Results

Conclusions

Background

Methods

Heterodimer dataset

Interface analysis

Classification based on relative interface-surface polarity

Intermolecular H-bonds and salt bridges calculation

ΔiG and BE calculation

Electrostatic potential at the interface

Statistical analysis

Results and discussion

PPI features among class A and class B complexes

Interface polarity abundance among classes

ΔiG among classes

BE among classes

Interface charged residues among classes

ΔASA among classes

Other correlations of interface features among classes

Electrostatic visualization maps among protein interface classes

Conclusions

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Electronic supplementary material

12859_2015_7199_MOESM1_ESM.pdf

12859_2015_7199_MOESM2_ESM.pdf

12859_2015_7199_MOESM3_ESM.pdf

12859_2015_7199_MOESM4_ESM.pdf

12859_2015_7199_MOESM5_ESM.pdf

12859_2015_7199_MOESM6_ESM.pdf

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us

ΔⁱG and BE calculation

ΔⁱG among classes