No statistical support for correlation between the positions of protein interaction sites and alternatively spliced regions
BMC Bioinformatics volume 5, Article number: 41 (2004)
Alternative splicing is an efficient mechanism for increasing the variety of functions fulfilled by proteins in a living cell. It has been previously demonstrated that alternatively spliced regions often comprise functionally important and conserved sequence motifs. The objective of this work was to test the hypothesis that alternative splicing is correlated with contact regions of protein-protein interactions.
Protein sequence spans involved in contacts with an interaction partner were delineated from atomic structures of transient interaction complexes and juxtaposed with the location of alternatively spliced regions detected by comparative genome analysis and spliced alignment. The total of 42 alternatively spliced isoforms were identified in 21 amino acid chains involved in biomolecular interactions. Using this limited dataset and a variety of sophisticated counting procedures we were not able to establish a statistically significant correlation between the positions of protein interaction sites and alternatively spliced regions.
This finding contradicts a naïve hypothesis that alternatively spliced regions would correlate with points of contact. One possible explanation for that could be that all alternative splicing events change the spatial structure of the interacting domain to a sufficient degree to preclude interaction. This is indirectly supported by the observed lack of difference in the behaviour of relatively short regions affected by alternative splicing and cases when large portions of proteins are removed. More structural data on complexes of interacting proteins, including structures of alternative isoforms, are needed to test this conjecture.
One of the main surprises resulting from the Human Genome Project was the realization that the enormous complexity of the human metabolism and regulation is encoded by a relatively small number of genes. While the estimates of this number still vary significantly in the range of 25000 to 35000  it is clear that we have fewer genes than maize , and only approximately five times more than the unicellular eukaryote S. cerevisiae . Earlier predictions postulated that the human genome should contain between 50.000 and 100.000 genes . These recent findings helped to realize the importance of post-transcriptional gene regulation in generating the proteomic complexity of the human cell.
One of the mechanisms for generating the protein diversity in eukaryotes is alternative splicing (AS). Current estimates of the prevalence of AS in the human genome range from one third of genes [5, 6] to about 60% . In fact, analysis of EST data showed that almost all human genes have possible alternatively spliced forms, although many of them seem to be non-functional . AS is also widely observed in genomes of other multicellular eukaryotes . There are two aspects of the AS phenomenon. Firstly, recent results demonstrate that alternative isoforms, especially tissue-specific ones, are often evolutionary young [10–12]. One possible interpretation for that is that AS provides a convenient evolutionary mechanism for generating new proteins without sacrificing existing ones . Secondly, AS leads to generation of proteins identical in some domains and different in others. This is a powerful regulatory mechanism [13, 14]. Indeed, it has been shown that AS, compared to a random expectation, tends to avoid disrupting protein domains, to shuffle entire domains, and to target functional sites in proteins . AS is frequent in genes involved in signal transduction and regulatory interactions .
To summarize, AS is a powerful mechanism for modulating the protein mode of action. Perhaps the most important type of functional context in which proteins execute their function is constituted by protein-protein interactions (PPI). It is thus tempting to speculate that AS could influence the structure of protein-protein interaction networks by selectively blocking or activating individual interactions dependent on cellular conditions.
Occasional evidence of the interplay between AS and PPI is scattered in the biomedical literature. For example, the domain composition of the Shank postsynaptic density protein is regulated by AS, defining the spectrum of its interaction partners . The interaction of Annexin XI with calcyclin was shown to be isoform specific, owing to partial deletion of the calcyclin-binding site in one of the splice forms . In Drosophila melanogaster the regulation of dynein targeting to various cellular organelles via binding to dynactin is modulated by structural variations in the N-terminal portion caused by AS . At the same time, other studies show no interdependence between protein interaction patterns and AS. For example, Liu et al.  reported that specific protein-protein interactions of cytoplasmic serine hydroxymethyltransferase are not regulated by AS.
To test the hypothesis that protein-protein interactions can be modulated in an alternative splicing dependent manner we have undertaken a study of AS events occurring in proteins known to be involved in transient interactions (see Figure 1). We were able to identify the total of 42 alternatively spliced isoforms in the amino acid chains participating in structurally characterized interaction complexes, as surveyed recently by Thornton and Nooren . In addition to statistics based on the entire dataset we also considered separately AS events changing more than a half of the protein length, where complete domain disruption is a likely result of AS , as well AS influencing less than 25% that could be expected to have less drastic effect. In all cases, no statistically significant relationship between AS and PPI could be established.
We begin with presenting several examples illustrating the interplay between PPI and AS. In the first case, shown in Figure 2, two alternatively spliced regions were identified in the cyclin-dependent kinase 2 (cdk2). The first (N-terminal) splice event eliminates nearly completely the PSTAIRE region of cdk2 which plays a central role in its interaction with cyclin . The second (C-terminal) sequence span deleted by splicing overlaps partially with the T-loop region of cdk, also involved in cdk2-cyclin interface. In the second example, presented in Figure 3, AS eliminates a large portion of the phosducin N-terminal domain, including one of the regions that contact the βγ-subunits of the retinal G-protein transducin . The corresponding isoform, called phosducin-like orphan protein 1, was reported to be less abundant than phosducin and not to be able to bind transducin, although the functional significance of this and other phosducin orphans remains unclear . Finally, in our third positive example (Figure 4) a significant portion of the interface between the human importin-β and the Ran protein  is spliced out, including the entire tandem repeat 7 as well as parts of repeats 6 and 8. This region also includes the so called "stalk region" with unknown function.
In all three examples above there is a clear overlap between the location of alternatively spliced regions and protein interaction sites. In many other cases, though, no such correlation could be established. For example, as seen in Figure 5, AS does not seem to affect the regions of the RhoA protein with which it binds the effector domain of serine-threonine kinases . Based on an extensive manual analysis of our PPI/AS dataset we were not able to formulate a definitive opinion as to the degree to which the location of AS regions correlates with PPI interfaces.
We subsequently attempted to investigate the strength of the correlation between the positions of AS and PPI regions using statistical analysis. Tables 1 and 2 present the entire body of structural PPI evidence and AS information used for this purpose. Our analysis resulted in delineation of 30 alternatively spliced isoforms for 16 proteins involved in heterodimer interactions and 12 isoforms for 5 proteins involved in homodimer interactions. These data were used as input to different counting procedures described in the Methods section. The overall summary for all types of dimers (with smoothing) is given in Table 3 (for individual positions; [see Additional file 1]) and Table 4 (for entire segments; [see Additional file 2]).
As shown in Table 3, in 15 out of 42 cases, the majority of random windows show less correlation with the contact positions than the real AS segments, in 3 cases the random windows show the same correlation, and in 24 cases the majority of random windows show more correlation with the contact positions than the real AS segment(s). Similarly, for contact segments (Table 4) in 13 cases the majority of windows show less correlation with the contact segments than the real AS segments, in 4 cases the correlation is the same, and in 25 cases most random windows show more correlation. Thus, compared to random control, AS shows a weak tendency to avoid contact regions; however, the chi-squared test shows that in both cases the difference is insignificant. The results of our study are thus negative: the hypothesis that AS might serve as a regulation mechanism for PPI networks could not be confirmed.
This study represents the first attempt to investigate the relationship between two complicated biological phenomena – protein-protein interactions and alternative splicing – based on the scarce experimental evidence currently available in the public databases. It was prompted by results of Kriventseva et al.  that could be interpreted as evidence that AS tends to target functionally important regions in proteins. However, we were not able to obtain convincing evidence of correlation between contact regions in protein-protein interactions and protein segments corresponding to alternatively spliced regions in mRNA. One possible explanation for that could be that our database of AS events was corrupted by aberrant splicing that abounds in EST databases . To guard against this possibility we considered only AS events expected to be reliable by multiple criteria, including presence in multiple ESTs from several clone libraries and conservation in the mouse genome. Another explanation could be that removal by AS of more than a half of the protein sequence would lead to complete disruption of the 3D protein structure, and thus any analysis relying on contact regions in the intact protein becomes meaningless.
However, we obtained essentially the same results on a subset of cases where the influence of AS was relatively modest (less than 25% of the protein sequence removed or substituted by AS; data not shown). Such shorter alternatively spliced regions are still much longer that a typical functionally important sequence motif of the type considered in . On the other hand, as described above, complete removal or substitution of the interaction domain is a well-established mechanism of regulating PPI by AS.
Given more data, it would be interesting to repeat the analysis of the type presented here on the domain level. For instance, one of the remaining open questions is whether AS tends to target (completely remove or partially destroy) interacting domains or other domains of proteins involved in PPI. Partially, this issue was addressed in a recent study  where it was shown that domains of types involved in protein-protein interactions frequently are removed or inserted by alternative splicing, although there was no observed increase in the rate of alternative splicing of such domains compared to other types of domains.
In this study we were not able to establish a statistically significant correlation between the positions of protein interaction sites and alternatively spliced regions. This finding implies that alternative splicing is not a preferred mechanism for controlling protein interaction networks, although many individual protein interactions are known to be isoform-dependent.
Protein interaction data
A comprehensive survey of structurally characterized transient protein interaction complexes has been recently published . We used tables 3 and 4 from this study which list the PDB  codes and chain identifiers of interacting proteins pairs in 24 weak homodimers and 25 heterodimers, respectively. Protein sequence spans involved in physical interactions were identified with the help of the Protein-Protein Interaction Server http://www.biochem.ucl.ac.uk/bsm/PP/server. Contacts between amino acid residues were defined based on the distance threshold of 6 Å between any pair of non-hydrogen atoms. PDB sequences were saved and the positions of residues participating in contacts recorded for each sequence separately.
Detection of AS events
Amino acid chains with known positions of interaction regions were further analysed to detect putative AS events. Initial mapping of the protein sequences to the human genome was conducted using TBLASTN  searches. Each hit that coincided both with a human gene annotated in LocusLink  and the UniGene  cluster linked to the latter was used to obtain a complete set of mRNA and EST sequences corresponding to the gene. mRNAs with annotated protein-coding regions were used to derive the corresponding protein sequences. The exon-intron structure of each gene with all known alternatives was established by spliced alignment of ESTs and mRNAs to the genomic DNA sequence using Pro-EST ; proteins were spliced-aligned to DNA using Pro-Frame . For further analysis only alternatives changing the coding region covered by the known three-dimensional structure were considered.
An alternative region (that is, a region present in a fraction of isoforms) was assumed to be reliable if it was supported by a full-length mRNA or a protein, or by several ESTs from different clone libraries. An alternative region was discarded if it was observed in less than three ESTs or ESTs from only one clone library. Further, only conserved AS events, defined by comparison with the mouse genome, were retained. The procedure for establishing the conservation of the alternative splicing events is described in detail in . Briefly, for each human gene the orthologous mouse gene was obtained from HomoloGene , and human protein isoforms were spliced-aligned to the mouse gene DNA using Pro-Frame. An alternative region was considered as conserved if it could be aligned with the same similarity level as the rest of the protein, and the corresponding mouse exons were bounded by canonical splicing sites.
Counting coincidences of AS and PPI regions
Positions of contact regions in protein sequences were compared with positions of AS regions to determine whether there was a significant correlation between them. For each protein sequence considered, two bit strings were generated representing the involvement of each amino acid position in AS and PPI, as depicted in Figure 6. One technical difficulty is that many contact regions are very short (often comprised by just one amino acid residue) and/or may be interrupted by individual residues or groups of residues that do not take part in interactions. We thus used two smoothing parameters – N and M – to group interacting residues in a given amino acid chain into contiguous contact segments. N was defined as the minimal allowed length of a spacer between two residues involved in interactions, and M represents the minimal allowed length of a contact segment. Contact segments separated by spacers shorter than N were merged, and then the segments shorter than M were deleted. We tested different combinations of N and M, and found that the obtained results were robust as regards these parameters (data not shown). The results for N = 5 and M = 1 are reported below.
Several different approaches to calculate the correlation between AS and PPI bit strings with and without smoothing were tested. The first, residue-by-residue approach involved considering each amino acid position separately and counting the number of positions in which the values of AS and PPI bit strings are equal or differ (Figure 7a). Each position can thus be classified as belonging to one of the four classes: AS/PPI, AS/non-PPI, non-AS/PPI, or non-AS/non-PPI. In a second, area-based approach (Figure 7b), we first grouped the adjacent positions of bit strings possessing identical values and then calculated the correlation between occurrences of such groups as a whole. Entire PPI segments were classified into three classes: entirely overlapping with AS, partially overlapping with AS, and non-overlapping with AS.
To estimate the degree of correlation, two types of statistical analysis were employed. Initially we applied the standard χ2 test to the contingency tables formed by two parameters describing each position (PPI/non-PPI, AS/non-AS). However, as the results of this analysis were inconclusive (data not shown), we compared the observed correlation with that assuming fixed contacts and random placement of AS segments (cf. the procedure in ). Formally, for each AS isoform we considered a window of the same length as the length of the alternatively spliced segment. For each position of the window the correlation between the contact positions (or entire segments) and the current position of the window was computed as described in the previous paragraph. Then for the current position of the window the computed correlation and the correlation of the real AS segment were compared. The current window was subsequently classified into one of three classes: higher correlation, same correlation, and lower correlation. The same procedure was generalized for the case of two (or more) AS segments in an isoform. In this case, all non-overlapping pairs of windows were considered and the same procedure was applied. Again, the window sizes were set to the sizes of the real AS segments.
Ensembl Genome Browser[http://www.ensembl.org/Homo_sapiens/]
maizegenome.org – Why Maize?[http://www.maizegenome.org/why_maize.html]
Comprehensive Yeast Genome Database[http://mips.gsf.de/genre/proj/yeast/index.jsp]
Gene Sweep 2003–2004[http://www.ensembl.org/Genesweep]
Mironov AA, Fickett JW, Gelfand MS: Frequent alternative splicing of human genes. Genome Res 1999, 9: 1288–1293. 10.1101/gr.9.12.1288
Brett D, Hanke J, Lehmann G, Haase S, Delbruck S, Krueger S, Reich J, Bork P: EST comparison indicates 38% of human mRNAs contain possible alternative splice forms. FEBS Lett 2000, 474: 83–86. 10.1016/S0014-5793(00)01581-7
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann N, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blocker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowski J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, Szustakowki J, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, International Human Genome Sequencing Consortium: Initial sequencing and analysis of the human genome. Nature 2001, 409: 860–921. 10.1038/35057062
Kan Z, States D, Gish W: Selecting for functional alternative splices in ESTs. Genome Res 2002, 12: 1837–1845. 10.1101/gr.764102
Brett D, Pospisil H, Valcarcel J, Reich J, Bork P: Alternative splicing and genome complexity. Nat Genet 2002, 30: 29–30. 10.1038/ng803
Modrek B, Lee CJ: Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss. Nat Genet 2003, 34: 177–180. 10.1038/ng1159
Nurtdinov RN, Artamonova II, Mironov AA, Gelfand MS: Low conservation of alternative splicing patterns in the human and mouse genomes. Hum Mol Genet 2003, 12: 1313–1320. 10.1093/hmg/ddg137
Thanaraj TA, Clark F, Muilu J: Conservation of human alternative splice events in mouse. Nucleic Acids Res 2003, 31: 2544–2552. 10.1093/nar/gkg355
Boise LH, Gonzalez-Garcia M, Postema CE, Ding L, Lindsten T, Turka LA, Mao X, Nunez G, Thompson CB: bcl-x, a bcl-2-related gene that functions as a dominant regulator of apoptotic cell death. Cell 1993, 74: 597–608. 10.1016/0092-8674(93)90508-N
Peneff C, Ferrari P, Charrier V, Taburet Y, Monnier C, Zamboni V, Winter J, Harnois M, Fassy F, Bourne Y: Crystal structures of two human pyrophosphorylase isoforms in complexes with UDPGlc(Gal)NAc: role of the alternatively spliced insert in the enzyme oligomeric assembly and active site architecture. EMBO J 2001, 20: 6191–6202. 10.1093/emboj/20.22.6191
Kriventseva EV, Koch I, Apweiler R, Vingron M, Bork P, Gelfand MS, Sunyaev S: Increase of functional diversity by alternative splicing. Trends Genet 2003, 19: 124–128. 10.1016/S0168-9525(03)00023-4
Modrek B, Resch A, Grasso C, Lee C: Genome-wide detection of alternative splicing in expressed sequences of human genes. Nucleic Acids Res 2001, 29: 2850–2859. 10.1093/nar/29.13.2850
Lim S, Naisbitt S, Yoon J, Hwang JI, Suh PG, Sheng M, Kim E: Characterization of the Shank family of synaptic proteins. Multiple genes, alternative splicing, and differential expression in brain and development. J Biol Chem 1999, 274: 29510–29518. 10.1074/jbc.274.41.29510
Sudo T, Hidaka H: Regulation of calcyclin (S100A6) binding by alternative splicing in the N-terminal regulatory domain of annexin XI isoforms. J Biol Chem 1998, 273: 6351–6357. 10.1074/jbc.273.11.6351
Nurminsky DI, Nurminskaya MV, Benevolenskaya EV, Shevelyov YY, Hartl DL, Gvozdev VA: Cytoplasmic dynein intermediate-chain isoforms with different targeting properties created by tissue-specific alternative splicing. Mol Cell Biol 1998, 18: 6816–6825.
Liu X, Szebenyi DM, Anguera MC, Thiel DJ, Stover PJ: Lack of catalytic activity of a murine mRNA cytoplasmic serine hydroxymethyltransferase splice variant: evidence against alternative splicing as a regulatory mechanism. Biochemistry 2001, 40: 4932–4939. 10.1021/bi002598t
Nooren IM, Thornton JM: Structural characterisation and functional significance of transient protein-protein interactions. J Mol Biol 2003, 325: 991–1018. 10.1016/S0022-2836(02)01281-0
Jeffrey PD, Russo AA, Polyak K, Gibbs E, Hurwitz J, Massague J, Pavletich NP: Mechanism of CDK activation revealed by the structure of a cyclinA-CDK2 complex. Nature 1995, 376: 313–320. 10.1038/376313a0
Gaudet R, Bohm A, Sigler PB: Crystal structure at 2.4 angstroms resolution of the complex of transducin betagamma and its regulator, phosducin. Cell 1996, 87: 577–588. 10.1016/S0092-8674(00)81376-8
Craft CM, Xu J, Slepak VZ, Zhan-Poe X, Zhu X, Brown B, Lolley RN: PhLPs and PhLOPs in the phosducin family of G beta gamma binding proteins. Biochemistry 1998, 37: 15758–15772. 10.1021/bi980921a
Vetter IR, Arndt A, Kutay U, Gorlich D, Wittinghofer A: Structural view of the Ran-Importin beta interaction at 2.3 A resolution. Cell 1999, 97: 635–646. 10.1016/S0092-8674(00)80774-6
Maesaki R, Ihara K, Shimizu T, Kuroda S, Kaibuchi K, Hakoshima T: The structural basis of Rho effector recognition revealed by the crystal structure of human RhoA complexed with the effector domain of PKN/PRK1. Mol Cell 1999, 4: 793–803.
Resch A, Xing Y, Modrek B, Gorlick M, Riley R, Lee C: Assessing the impact of alternative splicing on domain interactions in the human proteome. J Proteome Res 2004, 3: 76–83. 10.1021/pr034064v
Berman HM, Battistuz T, Bhat TN, Bluhm WF, Bourne PE, Burkhardt K, Feng Z, Gilliland GL, Iype L, Jain S, Fagan P, Marvin J, Padilla D, Ravichandran V, Schneider B, Thanki N, Weissig H, Westbrook JD, Zardecki C: The Protein Data Bank. Acta Crystallogr D Biol Crystallogr 2002, 58: 899–907. 10.1107/S0907444902003451
Protein-Protein Interaction Server[http://www.biochem.ucl.ac.uk/bsm/PP/server]
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215: 403–410. 10.1006/jmbi.1990.9999
Wheeler DL, Church DM, Federhen S, Lash AE, Madden TL, Pontius JU, Schuler GD, Schriml LM, Sequeira E, Tatusova TA, Wagner L: Database resources of the National Center for Biotechnology. Nucleic Acids Res 2003, 31: 28–33. 10.1093/nar/gkg033
Mironov AA, Novichkov PS, Gelfand MS: Pro-Frame: similarity-based gene recognition in eukaryotic DNA sequences with errors. Bioinformatics 2001, 17: 13–15. 10.1093/bioinformatics/17.1.13
This study was partially supported by grants from the Ludwig Institute of Cancer Research (CRDF RB0-1268), the Howard Hughes Medical Institute (55000309), Russian Fund for Basic Research (04-04-49440), the Fund for Support of the Russian Science, and the Program in Molecular and Cellular Biology of the Russian Academy of Sciences.
MO conducted protein structure analysis and implemented statistical techniques used in this work. He also did a major part of data analysis work, created statistical tables and figures presented in this paper. RNN identified the positions of alternatively spliced regions based on comparative genome analysis and spliced alignment. MSG proposed the statistical methods used in this work and supervised their development in collaboration with MO. DF and MSG, together, conceived the original hypothesis of this study. DF proposed the specific approach adopted in this study and supervised the work of MO in the part related to protein structure analysis. The manuscript was written by DF and MSG. All authors read and approved the manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Offman, M.N., Nurtdinov, R.N., Gelfand, M.S. et al. No statistical support for correlation between the positions of protein interaction sites and alternatively spliced regions. BMC Bioinformatics 5, 41 (2004). https://doi.org/10.1186/1471-2105-5-41
- protein-protein interactions
- alternative splicing
- protein structure analysis