- Research article
- Open Access
Amyloid precursor protein interaction network in human testis: sentinel proteins for male reproduction
BMC Bioinformaticsvolume 16, Article number: 12 (2015)
Amyloid precursor protein (APP) is widely recognized for playing a central role in Alzheimer's disease pathogenesis. Although APP is expressed in several tissues outside the human central nervous system, the functions of APP and its family members in other tissues are still poorly understood. APP is involved in several biological functions which might be potentially important for male fertility, such as cell adhesion, cell motility, signaling, and apoptosis. Furthermore, APP superfamily members are known to be associated with fertility. Knowledge on the protein networks of APP in human testis and spermatozoa will shed light on the function of APP in the male reproductive system.
We performed a Yeast Two-Hybrid screen and a database search to study the interaction network of APP in human testis and sperm. To gain insights into the role of APP superfamily members in fertility, the study was extended to APP-like protein 2 (APLP2). We analyzed several topological properties of the APP interaction network and the biological and physiological properties of the proteins in the APP interaction network were also specified by gene ontologyand pathways analyses. We classified significant features related to the human male reproduction for the APP interacting proteins and identified modules of proteins with similar functional roles which may show cooperative behavior for male fertility.
The present work provides the first report on the APP interactome in human testis. Our approach allowed the identification of novel interactions and recognition of key APP interacting proteins for male reproduction, particularly in sperm-oocyte interaction.
Amyloid precursor protein (APP) is known as a pathological hallmark of Alzheimer's disease (AD). Nevertheless, APP, a type I transmembrane glycoprotein consisting of a large extracellular domain, a single transmembrane domain, and a short cytoplasmic tail, is expressed ubiquitously and given its receptor-like and adhesive characteristics may play important roles outside the nervous system. In fact, we have previously showed that APP is present in spermatozoa . The APP superfamily includes APP and APP-like proteins (APLP) 1 and 2. Alternative splicing of the APP mRNA produces eight isoforms, ranging in size from 677–770 amino acids . Alternative splicing produces four APLP1 and two APLP2 protein isoforms. Although some isoforms may be cell type specific, APP and APLP2 are ubiquitously expressed. In contrast, APLP1 is expressed selectively in the nervous system . Only APP, but not APLP1 and 2, contains a sequence encoding the beta-amyloid domain. The transmembranar structure of APP is consistent with a role of APP as a receptor or a mediator of extracellular interactions. It has been suggested that APP may have CAM (Cell Adhesion Molecule) and SAM (Substrate Adhesion Molecule) like activities .
Various lines of evidence implicate APP and APLP2 in fertility. APP was shown to be expressed in rat testis and localized in the acrosome region and growing tail of spermatids in the seminiferous tubules . Knock-out mice, homozygotes to either APP(−/−) or APLP2(−/−) were fertile, but mice with the deletion of both APP(−/−) and APLP2(−/−) were infertile (9 of 10 females and all males) . We previously characterized the subcellular distribution of the APP superfamily members in spermatozoa using a variety of antibodies that either recognizes APP-specific epitopes or the epitopes shared with other APP family members . The presence of APP superfamily members along the entire length of the tail may be related to signaling events involved in sperm motility, whereas their presence in the head and particularly in the equatorial region suggests their involvement in sperm-oocyte interaction . These results not only were consistent with the previous localization of APLP2 in mammalian sperm, but also prove the presence of APP itself in human sperm. APP and APLPs distribution only partially overlap suggesting that, besides a common role, they might also have distinct functions in spermatozoa. A human sperm transmembrane protein initially termed YWK-II (later shown to be an APLP2 homologue) was shown to be involved in fertilization [7,8]. The YWK-II gene was shown to be expressed in germ cells at various stages of differentiation and in the plasma membrane enveloping the acrosome of mature spermatozoa .
The discovery of tissue-specific interacting proteins can lead to the identification of pathways for the APP family members associated with testis and sperm functions. Hence, we performed a Yeast Two-Hybrid (YTH) screen of a human testis cDNA library using APP as bait. A comprehensive bioinformatic analysis was also performed using the APP interacting proteins identified in the YTH in addition to the proteins selected from public protein-protein interactions (PPI) databases (DB) and published APP interactome data  associated with testis/sperm. APLP2 interacting partners were also included. Additionally, protein interaction maps were constructed allowing the visualization of PPI data as a connectivity graph and the data was subjected to a statistical analysis based on the complex network theory. The advantage of this approach is that it allows the study not only of the local properties of proteins in the network, but also their global structural characteristics in the entire network of PPI. We reveal that proteins with similar biological functions are tightly connected to each other and form dense groups (modules or k-cores) in the networks. Also, the function, cellular distribution and pathways were analyzed and significant features were classified.
In this study, we characterized the testis/spermatozoa interactome of APP using a network-based approach.
Identification of APP interacting proteins in human testis by Yeast Two-Hybrid screening
Nowadays the YTH methodology is a very robust technique to identify PPI [10-13]. The method that we use has been highly improved and overcomes the initial problems of the YTH, e.g. the appearance of false positive or false negative interactions , since, for instance, we use four reporter genes with different strength promoters [10-13].
In order to identify APP interacting proteins expressed in human testis, an YTH screen of a human testis cDNA library was carried out using full-length human APP. The screen yielded 147 positive clones from a total of 3×108 clones screened. After partial or complete sequence analysis (depending on the length of the positive clone’s cDNAs), in silico searches of the GenBank DB allowed their identification and classification into three separate groups. Table 1 corresponds to library inserts encoding known proteins identified as putative APP interactors. The second and third groups correspond to clones putatively encoding novel APP interacting proteins with homology to genomic sequences and lists positives where the GenBank sequence similarity did not correspond to an annotated gene and false positive hits, respectively. Table 1 lists only 1 positive encoding a previously identified APP interacting protein (RANBP9) (Figure 1). 77 clones encoded 36 known proteins that were not previously associated with APP (Figure 1). Only the clones in Table 1 were included in the network and further functional analyses (Figure 2).
Analysis of the YTH screen revealed that the most abundant interaction was detected with SEC22C (9 out of the 147 positive clones) (Figure 1). This protein is involved in vesicle transport between the ER and the Golgi complex.
The 37 proteins identified as APP interactors were classified into broad functional categories according to Gene Ontology annotation using the DAVID bioinformatics resource (Additional file 1: Table S1). Regarding the biological process, the categories with the largest number of proteins were related to intracellular transport (20.6%) and protein localization (20.6%). From the proteins involved in transport, 5 were linked with vesicle-mediated transport (SYNRG, BCAP29, SEC22C, FTL and STX5). Also, 5 proteins (CD99, LYPD3, GPNMB, ITGB5 and SSPN) were associated with cell adhesion. CD81, CREB3 and FANCM were annotated as being involved in reproduction. The majority of APP interactors identified in the YTH (67.6%) are intrinsic to membrane and 7 are specifically at the plasma membrane (Additional file 1: Table S1).
Analysis of human proteomes (testis, epididymis, and spermatozoa) allowed the classification of DPEP1 and TMPRSS12 as testicular proteins; ITGB5 and COPS5 as sperm-located testicular proteins also detected in epididymal fluid; and FTL as a non-sperm located epididymal fluid protein (Additional file 2: Table S2). CD81, CD99, COPS5 and FAM134 were identified as testis/sperm-enriched in tissue-expression DBs [15-18]. Also, TMPRSS12 was reported in the Unigene as a testicular/spermatozoa restricted protein.
To determine which proteins are known to be important for normal male reproductive function, the dataset was screened against the Jackson Laboratory mutant mouse DB  and Phenopedia . From the APP interactors identified in the YTH screen, 3 were connected with reproductive phenotypes in gene knockout models (RANBP9, CREB3 and FANCM). From the comparison with the disease genes listed in Phenopedia no results were obtained.
Identification of literature curated interactions
In order to identify the potentially relevant interactors of APP and APLP2 to male fertility, human PPI were collected from currently available public DBs, including APID , BioGRID , DIP , HPRD , InnateDB , Intact , MINT , Reactome , TopFind , and STRING . Only the interactions between both proteins associated with the terms “testis” and “sperm” in Unigene, HPRD  and Uniprot  were selected. Then, the interactors characterized as highly specific to or strongly expressed in testis/sperm were identified from tissue-expression DBs (C-It , TiGER , UniGene, BioGPS , VeryGene  and HPA ). (See Methods and Additional file 3: Table S3). Besides the DBs, the tissue expression data was also retrieved from the published proteomes of reproductive tissues [34-38]. This analysis allowed the classification of the APP direct interactors into distinct but overlapped localizations (Additional file 2: Table S2).
First, we focused on local interactions of APP/APLP2, that is, the first direct interactors of APP/APLP2 and interactions between them. We identified 455 proteins connected to APP (Methods) including the partners identified by YTH (Figure 3a). All the proteins in the YTH data were newly found as interactors of APP except RANBP9, which was previously published as an APP interactor . The absence of protein overlapping may be due to the fact that the YTH was performed using a library from human testis and the previous APP interactors were mainly identified in neuronal tissues. Indeed, published data indicate that 4% of the mammalian genome (more than 2,300 genes) encodes genes specifically expressed in the male germ line during or after the completion of spermatogenesis . Regarding APLP2, we identified 6 proteins (including APP) as its interactors from the DBs which were highly specific to or strongly expressed in testis. In total, 1,803 interactions were identified between 457 proteins including APP and APLP2. Only one protein (BRCA1) among the nearest neighbors of APLP2 was not directly connected to APP which may reflect an isoform-specific role for APLP2.
Second, we extended the local interaction network of APP/APLP2 into the second nearest neighborhoods since the local network of APP could limit an overview of the pathways in which this protein may be involved in testis and spermatozoa. In this network, we had 2,733 proteins and 17,188 interactions between them (Methods).
Topological analysis of APP/APLP2 PPI network
The overall structural properties for the local and extended APP/APLP2 network showed mostly linear relationship between degree [the number of nearest neighbors (connectivity) of a certain node] and betweenness centrality [fraction of shortest paths between all other nodes that pass through a certain node (Additional file 4: Table S4)]. In our APP/APLP2 local network, proteins with high connectivity also revealed high centrality which can be a significant indication of the relevant proteins in a biological network.
Local APP/APLP2 interaction network
In the testis/sperm related APP/APLP2 network (Figure 3a), most proteins were densely connected to each other. Average degree of this network was < q > = 3.95 and global clustering coefficient was C = 0.51. The clustering coefficient reflects how neighbors of a node are connected to each other (Additional file 4: Table S4).
It is known that proteins with high connectivity (hubs) in PPI networks potentially have functional importance in biological systems and are likely to be critical proteins . The key proteins for disease are known to have low clustering coefficients in addition to high connectivities . In order to characterize the APP network topology, the clustering coefficients of each protein were calculated. Betweenness centrality and closeness centrality of each protein in the APP/APLP2 network were also measured to find the relevant proteins involved in pathways (Additional file 4: Table S4). In biological networks, e.g. signaling pathways and genetic interactions, the dysfunction of the proteins with high centrality may be crucial for the other biological functions due to missing of signal transference. In yeast, the proteins with high betweenness centrality, but small number of degrees were found to be important links between well connected modules . Proteins with high centrality rank in our network were represented in Additional file 5: Table S4 (Supporting Text and Additional file 5: Table S4). The top rated interactors included a calcium/phospholipid-binding protein which promotes membrane fusion and is involved in exocytosis (ANXA1, annexin A1). PIK3CG (phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit gamma isoform), PLCB3 (1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase beta-3), LPAR2 (lysophosphatidic acid receptor 2), RLN3 (relaxin-3 receptor 2) and ADORA3 (protein ADORA, isoform 3) also composed the top rated proteins and were all related with the G-protein coupled receptor signaling pathway.
The proteins identified as APP/APLP2 interactors were classified into functional categories according to Gene Ontology annotation using the DAVID program . Regarding the biological process, the results revealed that the categories with the largest number of proteins were related to cell surface receptor linked signal transduction (43.0%; p value = 7.0E-60) and G-protein coupled receptor protein signaling pathway (33.3%; p value = 7.5E-58). 50.8% of the proteins were located at the plasma membrane (p value = 2.4E-26) and 23.2% in the extracellular region (p value = 1.7E-6). 13.0% were associated with vesicles (p value = 1.8E-11). These vesicle related proteins may participate in specialized vesicle activity in the testis, such as acrosome formation. Additionally, 13.9% are annotated as part of a cell projection (e.g. a flagellum) (p value = 1.0E-12).
Metabolic pathways were analyzed using the KEGG PATHWAY , which indicated that the top 4 significant categories were: Neuroactive ligand-receptor interaction (16.3%; p value = 2.5E-34); Chemokine signaling pathway (10.4%; p value = 1.3E-18); Calcium signaling pathway (8.6%; p value = 1.8E-13); and Progesterone-mediated oocyte maturation (5.5%; p value = 2.2E-11).
In order to find the core and peripheral part of the local APP/APLP2 network, k-core analysis was performed. k-core is a subgraph of a graph in which all vertices have at least k-degree (Additional file 4: Table S4). The core of this network has a connectivity, k = 11, between them. 75.0% of the proteins within this core shared the same biological process GO category (G-protein coupled receptor protein signaling pathway; p value = 6.6E-15) and 50.0% share the same subcellular localization (plasma membrane; p value = 6.9E-4).
Four modules were identified by community detection analysis (Additional file 4: Table S4). The nodes in a community are more tightly connected to each other than to nodes out of the community and may perform a common function. Our analysis shows that, in the local APP/APLP2 network, 206 proteins are included in module 1 (light orange in Figure 3a). Among them, 19.7% were involved in the regulation of apoptosis (p value = 1.5E-12) and the most significant represented cellular localization was the cell surface (16.3%, p value = 2.3E-17. The most prominent biological process detected in module 2 (which comprised 84 proteins in total, dark green in Figure 3a) was G-protein coupled receptor protein signaling pathway (76.2%; p value = 6.6E-52) and 71.4% of the proteins in module 2 shared the same localization (plasma membrane; p value = 5.6E-15). Module 3 included 85 proteins (light green in the Figure 3a), which were mainly involved in G-protein coupled receptor protein signaling pathway (94.0%; p value = 5.0E-80) and localized at the plasma membrane (76.2%, p value = 2.5E-18). Similarly to module 1, the most significant category in module 4 (which comprised 82 proteins in total, dark orange in Figure 3a) was regulation of apoptosis (39.0%, p value = 4.8E-19). Modules 2 and 3 also share a common biological function and cellular component. In the local APP/APLP2 network, the core part of the network, that is, the proteins with the highest k-core (k = 11) which includes APP, shared the same biological function (G-protein coupled receptor protein signaling pathway). The core part was included mostly in module 2, which was also associated with the same function. Therefore, this result showed that APP might be involved in G-protein coupled receptor protein signaling pathway in human testis/sperm. In addition, APP has high possibility that it is associated with regulation of apoptosis according to the results that the most interaction partners surrounding APP share the same function within modular structure.
Extended APP/APLP2 network
The local APP network only allows us to study relationships between APP and its nearest neighboring proteins. In order to study relationships with other proteins, we extended the network to the second nearest neighbors of APP. Topological properties of the extended networks are analyzed in Additional file 6: Table S5. From k-core analysis, a densely connected group with high k (=17)-core was found (186 proteins). The proteins in the core were involved in cell cycle (41.1%; p value = 6.4E-46). Also, the majority of proteins were found in the cytosol (49.2%; p value = 1.0E-41) and the nucleoplasm (40.0%, p value = 2.0E-38).
Based on the community structure analysis, APP is located in module 2. The most significant biological processes associated with this module were proteolysis (19.3%; p value = 6.6E-9) and cell adhesion (15.9%; p value = 1.3E-9). Additionally, the majority of proteins were located at the plasma membrane (42.0%; p value = 1.3E-6) and at the extracellular region (32.4%; p value = 5.4E-12).
Table 2 represents the gene ontology analysis for the modules of the extended network in which at least 40% of proteins shared a biological function.
Specific topological features of proteins from YTH
Based on the extended network structure analyses, COPS5 has a large number of connections (q = 152) and also a relatively high betweenness centrality (b = 0.046) among our 37 YTH proteins, contrasting with a low clustering coefficient (C = 0.014). COPS5 is sperm-located testicular protein . CD81 (q = 27, b = 0.005, C = 0.029), CD99 (q = 13, b = 0.001, C = 0.064) and IGTB5 (q = 10, b = 0.0003, C = 0.089) also revealed prominent topological properties (Additional file 6: Table S5).
Topological role of APP and APLP2 in the network
Previous data has shown that the absence of both APP and APLP2 lead to the abnormal developments of sexual organs, the reduction of synaptic vesicles, and even postnatal lethality in mice . On the other hand, the absence of either APP or APLP2 does not affect viability and fertility. Based on these, one can imagine that these two proteins should co-exist for the mammalian life maintenance. Here, we focus on the role of APP and APLP2 for the human male fertility. Based on experimental results of gene knock-outs in mice , one can assume that APP and APLP2 are simultaneously involved in important pathways. Some proteins or protein complexes cannot accomplish biological functions in the absence of APP/APLP2, because this blocks the functional routes between the proteins. In order to find a conformity of the functional property within a structural property, we checked the local triangle structure between APP, APLP2, and the common interactors (CDK1, DAB2, JUN, and PIK3CA). Among the interactors of APLP2, only BRCA1 is not connected to APP. These common interactors of APP and APLP2 form a small modular structure (Additional file 7: Figure S1). Therefore, one can guess the proteins in this module possibly share a biological function in testis.
Biomolecular networks are now frameworks that facilitate many discoveries in molecular biology. The theoretical advances in network science in parallel with high throughput efforts to map biological networks, offer an excellent opportunity to apply the principles of theoretical physics to the molecular biomedicine field.
The APP network in testis/sperm was built first using an YTH screen and then expanded by incorporating literature curated interactions. Since protein profiles of the different tissues are critical to understand the unique characteristics of the various human cell types, in this study, we took into account the tissue expression of the interactors in the network. From the YTH screen, we reported the identification of 36 novel APP interacting proteins in human testis/sperm. Only 1 positive encoded a previously identified APP interacting protein (RANBP9). This may be explained by the fact of testis being a very peculiar organ which possesses specific patterns of transcription and expresses novel protein isoforms [39,40]. APLP2 interacting partners were also included in this study.
To determine which PPI in our APP/APLP2 network were biologically more relevant for male reproduction, we performed network structure analyses and bioinformatic analyses. Based on the community/modularity analysis of the PPI network along with gene ontology analysis, we confirmed that proteins involved in similar functions are group together and form modules. The biological process GO category more significantly represented in the APP/APLP2 local network in human testis/sperm was cell surface receptor linked signal transduction with 43.0% of the proteins annotated in this class. These proteins may indicate how the male germ cells interact with the outside world. Among those proteins, 33.3% carry the GO functional tag for G-protein coupled receptor protein signaling pathway. Some studies indicate that full length APP can function as a cell surface GPCR and show that APP binds heterotrimeric G proteins (Gαo) [45,46]. Recently, Deyts and colleagues discovered an interaction between APP intracellular domain and the heterotrimeric G-protein subunit Gαs . G protein-coupled receptors signalling pathways have been proposed to control several processes essential for sperm function and fertilization, namely in sperm capacitation and acrosome reaction [48-51]. APLP2/YWK-II also exhibits properties of a receptor and its extracellular domain was shown to interact with Müllerian-inhibiting substance . Müllerian-inhibiting substance increases the viability and longevity of human spermatozoa through binding the APLP2/YWK-II component on the sperm membrane . Huang and colleagues showed that APLP2/YWK-II component binds to a GTP-binding protein (Gαo).
The most abundant interaction detected in the YTH was with SEC22C. This protein is involved in vesicle transport between the ER and the Golgi complex . Vesicular membrane trafficking is an essential process during acrosome biogenesis . Also, SEC22C may control the APP traffic through the secretory pathway. Besides SEC22C, other four YTH clones (BCAP29, FTL, STX5, and SYNRG) are involved in vesicle-mediated transport (Figure 3c). This GO term includes the regulation of the acrosomal vesicle exocytosis, an essential process for fertilization, which begins with the fusion of the outer acrosomal membrane with the sperm plasma membrane and ends with the exocytosis of the acrosomal contents into the oocyte.
The cellular component category most enriched in the GO term analysis of the APP/APLP local network was the plasma membrane. Fertilization is achieved through gamete interactions, specifically cell adhesion and then membrane fusion of the gamete plasma membranes. The occurrence of 50.8% of proteins in the plasma membrane may suggest their involvement in sperm-egg interaction. Additionally, 10.2% of APP interactors are involved in cell adhesion (Figure 3b). Of these, CD99, GPNMB, ITGB5, LYPD3, and SSPN were identified in the YTH screen performed using a testis library. The APP yeast mating efficiency in the YTH was much higher than usual (50%, when compared to a normal 5%), which may be related to APP cell adhesion properties. This strengthens previous results suggesting APP to be involved in cell-to-cell contact, a very important process in gamete fusion. Recent approaches to identify candidate proteins involved in sperm-egg interaction have been characterizing the sperm proteome and analyzing specific subpopulations of interest, for instance, glycoslylated proteins and integrins. Additionally, proteins with motifs or belonging to families of interest like transmembrane domains and the tetrasparin family should also be considered. Interestingly, some of the YTH positive clones are included in those categories. APP interacts with ITGB5, identified in the YTH screen, and ITGB1 , both belonging to the integrin beta chain family. Integrins on eggs became of interest with the discovery of an integrin ligand-like domain in ADAM2, a sperm antigen essential for sperm-egg interaction.TSPAN6 and CD81 belong to the tetraspanin family. The discovery that the knockout of CD9, a member of the tetraspanin family, in mouse leads to healthy, but subfertile females due to defective sperm-egg interaction revolutionized the fertility field. CD81 is 45% identical to CD9 and Cd81 knockout mouse also presents defects in female fertility. Cd9−/−/Cd81−/− female mice are completely infertile. We found that, in local network, APP, TSPAN6, ITGB1, ITGB5, GPNMB, LYPD3, SSPN, CD81 and CD99 were in the same module (module 1). However, in extended network, APP, TSPAN6, GPNMB, LYPD3, SSPN, and ITGB5 were in module 2, whereas ITGB1, CD81, CD9 and CD99 were well connected in module 9 in which 20.3% of the proteins share the biological function – cell adhesion (Figure 3b).
We also identified tissue-specific APP interacting proteins which can lead to the identification of pathways for the APP family members associated with testis and sperm functions. TMPRSS12, a transmembrane serine protease, was identified in the YTH screen and was reported in the Unigene as testicular/spermatozoa restricted. TMPRSS12 belongs to the same module from network community as APP. Since this protein is connected to APP only, it cannot have any route to the main network without APP. Sperm-surface proteases were already shown to be required for fertilization . There is also evidence for the participation of serine proteolytic activities during spermatogenesis and sperm maturation . However, most of the specific proteases that are involved in these processes are unknown. The exact localization of TMPRSS12 at sperm membrane has to be determined.
The present work provided the first report on APP interactome in human testis. We identified several novel APP interactions in human testis and incorporated YTH data and PPI databases to construct the PPI network of APP in human testis and spermatozoa. The protein interaction network allowed the recognition of proteins complexes and modules crucial for several biological functions, such as cell adhesion.
Human testis library screening by Yeast Two-Hybrid
The APP cDNA was directionally subcloned into EcoRI/BamHI digested pAS2-1 (GAL4 binding domain expression vector) to produce pAS-APP. This expression vector was first used to confirm the expression of the resulting fusion proteins (GAL4-APP) in yeast strain AH109. For library screening, the yeast strain AH109 transformed with pAS-APP, was mated with yeast strain Y187 expressing the human testis cDNA library in the pACT-2 vector (Gal4 activation domain expression vector). Half the mating mixture was plated onto high stringency medium (quadruple dropout medium (QDO): SD/-Ade/-His/-Leu/-Trp) and the other half onto low stringency medium (triple dropout medium (TDO): SD/-His/- Leu/-Trp), and the plates were incubated at 30°C. Colonies obtained in the low stringency plates were replica plated onto high stringency medium. Finally, all high stringency surviving colonies were plated onto selective medium containing X-α-Gal and incubated at 30°C to check for MEL-1 expression (indicated by the appearance of a blue colour) . All the YTH reagents were purchased from Enzifarma, Clontech, Portugal. All other nonspecified reagents were purchased from Sigma-Aldrich, Portugal. This study did not required ethics approval since the material used was purchased for Enzifarma, Clontech, Portugal (human testis cDNA library which contained cDNAs already inserted in pACT-2 vector.
Recovery of plasmids from yeast and sequence analysis
Yeast plasmid DNA was recovered and used to transform E. coli XL1-Blue. Plasmid DNA was obtained from each resulting bacterial colony and digested with the restriction enzyme HindIII (NEB, Ipswich, USA) to identify the corresponding library plasmids. DNA sequence analysis was performed using an Automated DNA Sequencer (Applied Biosystems, Carlsbad, USA) using the GAL4-AD primer - TACCACTACAATGGATG (Enzifarma, Clontech, Portugal). The DNA sequences obtained were compared to the GenBank DB, using the BLAST algorithm, to identify the corresponding encoded proteins.
Data mining of APP and APLP2 interacting proteins from public DB and published interactome
Several data sources were used to human protein-protein interaction data retrieval. First, we collected all interaction data of human proteins from: APID , BioGRID , DIP , HPRD , InnateDB , Intact , MINT , Reactome , TopFind , and STRING . The interaction search was restricted to Homo sapiens (Human, 9606) protein pairs. Then, only the interactions defined as “association (MI:0941)” under the interaction type categories (http://www.ebi.ac.uk/ontology-lookup/) and “experimental interaction detection (MI:0045)” from STRING were extracted (See Additional file 3: Table S3). Next, we unified protein names based on the Uniprot ID and gene symbols in order to prevent abundant interactions caused by the different notations for the same gene between DBs. We removed proteins which have unreviewed, no gene symbols or Uniprot IDs, and removed (obsolete) genes from the Uniprot database up to the date of our data-mining and their interactors from our PPI list. We also included the interacting proteins from the published APP interactome  and the YTH experiment (37 proteins). Finally, 248,714 interactions between 15,189 proteins were obtained in total.
Identification of testis/sperm specific proteins in public DBs and publish human proteomes
From the large PPI data obtained in the previous data-mining, we narrowed down the candidate proteins into the testis/sperm enriched proteins. We first used three distinct data sources to select the proteins associated with the testis: UniProt , UniGene and the Human Protein Reference Database (HPRD) . From UniGene expressed sequence tags (ESTs) from Homo sapiens were used as source of gene expression data. Among our previous interaction data set, we chose all proteins associated with the keywords “testis” and/or “sperm” in the description of tissue-specificity. Then, we kept the interactions, if both proteins in a pair of interaction were associated to testis/sperm. We got totally 155,457 interactions among 12,884 proteins in this procedure.
Second, tissue-expression DBs (C-It, TiGER, UniGene, BioGPS, VeryGene and HPA) were used in order to identify the interactors characterized as highly specific or strongly expressed in testis/sperm. The C-it DB was queried with the keywords, 'testis-enriched' for 'Human'. The limitation factors for the literature information were 5 for PubMed and 3 for MeSH terms. Proteins with a SymAtals z-score higher than |1.96| were chosen. The TiGER and the Very Gene database were also searched for 'Testis' in 'Tissue View' category. In HPA (Human Protein Atlas), proteins listed within the fields of high or medium HPA evidence and annotated protein expression based on IHC staining patterns in normal male reproductive tissues were selected. Also, the BioGPS was used to find testis/sperm restricted proteins with the keywords, 'testis, sperm, epididymis, spermatid, spermatogonia, spermatozoa, spermatocyte' in 'Human'. Using the plugin 'Gene expression/activity chart', the proteins with highly/strongly expressed in testis were selected. If the expression level was less than mean value or the data were not shown, those proteins were removed from the list. Also the proteins with high correlation level of expression (≥0.9) with testis-specific proteins, such as ACRV1, AKAP4, BRDT, PGK2, TSGA10, and TSPY8 were selected. From this search, 1,949 testis/sperm enriched proteins were obtained.
Development of protein-protein network and network properties analysis
From testis/sperm enriched proteins, APP/APLP2 interactors were selected. Functional relationships can be neglected when considering only tissue-enriched/specific proteins. This challenge was addressed by integrating tissue-enriched/specific APP/APLP2 interactors with its interacting proteins regardless whether they were enriched or not. Using the breath-first searching (BFS) algorithm, direct connectors with APP and the interactions between those proteins were kept. The same procedure was applied to APLP2. Since APLP2 is the nearest neighbor of APP, we combined two sub-networks and analyzed several network properties. We extended this local network to the second order neighbors of APP in order to see the wide relations around APP and APLP2. All YTH data were included in this process. Finally, 1,803 interactions between 457 proteins were obtained for the local APP/APLP2 network and 17,188 interactions between 2,733 proteins for the extended APP/APLP2 network.
Bioinformatic analyses: gene ontology, pathways and involvement in diseases
The interactome was analyzed using the Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.7 . The UniProt  identifiers (UniProt_ID) for the proteins were entered into the DAVID functional annotation program. Overall, the proteins were analyzed for gene ontologies and pathways using the Homo sapiens genome-wide genes with at least one annotation in the analyzing categories as background. Proteins associated with defects in male fertility, or a functional or morphological defect in the epididymis, testis, or sperm were obtained from the Jackson Laboratories mouse knockout database (http://www.informatics.jax.org/). To obtain a list of the genes that have a male infertility phenotype, we also queried the OMIM , Phenopedia  and the Uniprot database  with the term “male infertility”, and then downloaded the associated genes.
Availability of supporting data
The data sets supporting the results of this article are included within the article and its additional files.
Fardilha M, Vieira SI, Barros A, Sousa M, Da Cruz e Silva OAB, Da Cruz e Silva EF: Differential distribution of Alzheimer's amyloid precursor protein family variants in human sperm. Ann N Y Acad Sci 2007, 1096:196–206.
Tanzi R, Gaston S, Bush A, Romano D, Pettingell W, Peppercorn J, Paradis M, Gurubhagavatula S, Jenkins B, Wasco W: Genetic heterogeneity of gene defects responsible for familial Alzheimer disease. Genetica 1993, 91:255–263.
Thinakaran G, Koo EH: Amyloid precursor protein trafficking, processing, and function. J Biol Chem 2008, 283:29615–29619.
Multhaup G: Identification and regulation of the high affinity binding site of the Alzheimer's disease amyloid protein precursor (APP) to glycosaminoglycans. Biochimie 1994, 76:304–311.
Shoji M, Kawarabayashi T, Harigaya Y, Yamaguchi H, Hirai S, Kamimura T, Sugiyama T: Alzheimer amyloid beta-protein precursor in sperm development. Am J Pathol 1990, 137:1027–1032.
von Koch CS, Zheng H, Chen H, Trumbauer M, Thinakaran G, van der Ploeg LHT, Price DL, Sisodia SS: Generation of APLP2 KO mice and early postnatal lethality in APLP2/APP double KO mice. Neurobiol Aging 1997, 18:661–669.
Zhuang D, Qiao Y, Zhang X, Miao S, Koide SS, Wang L: YWK-II protein/APLP2 in mouse gametes: potential role in fertilization. Mol Reprod Dev 2006, 73:61–67.
Huang P, Miao S, Fan H, Sheng Q, Yan Y, Wang L, Koide SS: Expression and characterization of the human YWK-II gene, encoding a sperm membrane protein related to the Alzheimer βA4-amyloid precursor protein. Mol Hum Reprod 2000, 6:1069–1078.
Perreau VM, Orchard S, Adlard PA, Bellingham SA, Cappai R, Ciccotosto GD, Cowie TF, Crouch PJ, Duce JA, Evin G, Faux NG, Hill AF, Hung YH, James SA, Li QX, Mok SS, Tew DJ, White AR, Bush AI, Hermjakob H, Masters CL: A domain level interaction network of amyloid precursor protein and Aβ of Alzheimer's disease. PROTEOMICS – Clin Appl 2010, 4:851–851.
Fardilha M, Esteves SL, Korrodi-Gregorio L, Vintem AP, Domingues SC, Rebelo S, Morrice N, Cohen PT, da Cruz e Silva OA, da Cruz e Silva EF: Identification of the human testis protein phosphatase 1 interactome. Biochem Pharmacol 2011, 82:1403–1415.
Esteves SL, Korrodi-Gregorio L, Cotrim CZ, van Kleeff PJ, Domingues SC, da Cruz ESOA, Fardilha M, da Cruz ESEF: Protein Phosphatase 1gamma Isoforms Linked Interactions in the Brain. J Mol Neurosci 2012, 19:19.
Esteves SL, Domingues SC, da Cruz e Silva OA, Fardilha M, da Cruz e Silva EF: Protein phosphatase 1alpha interacting proteins in the human brain. Omics 2012, 16:3–17.
Fields S: Interactive learning: lessons from two hybrids over two decades. Proteomics 2009, 9:5209–5213.
Hamdi A, Colas P: Yeast two-hybrid methods and their applications in drug discovery. Trends Pharmacol Sci 2012, 33:109–118.
Gellert P, Jenniches K, Braun T, Uchida S: C-It: a knowledge database for tissue-enriched genes. Bioinformatics 2010, 26:2328–2333.
Liu X, Yu X, Zack DJ, Zhu H, Qian J: TiGER: a database for tissue-specific gene expression and regulation. BMC Bioinformatics 2008, 9:271.
Yang X, Ye Y, Wang G, Huang H, Yu D, Liang S: VeryGene: linking tissue-specific genes to diseases, drugs, and beyond for knowledge discovery. Physiol Genomics 2011, 43:457–460.
Uhlen M, Oksvold P, Fagerberg L, Lundberg E, Jonasson K, Forsberg M, Zwahlen M, Kampf C, Wester K, Hober S, Wernerus H, Björling L, Ponten F: Towards a knowledge-based human protein atlas. Nat Biotechnol 2010, 28:1248–1250.
Eppig JT, Blake JA, Bult CJ, Kadin JA, Richardson JE: The Mouse Genome Database (MGD): comprehensive resource for genetics and genomics of the laboratory mouse. Nucleic Acids Res 2012, 40:D881–D886.
Yu W, Clyne M, Khoury MJ, Gwinn M: Phenopedia and Genopedia: disease-centered and gene-centered views of the evolving knowledge of human genetic associations. Bioinformatics 2010, 26:145–146.
Prieto C, De Las RJ: APID: Agile Protein Interaction DataAnalyzer. Nucleic Acids Res 2006, 34:W298–W302.
Chatr-Aryamontri A, Breitkreutz BJ, Heinicke S, Boucher L, Winter A, Stark C, Nixon J, Ramage L, Kolas N, O'Donnell L, Reguly T, Breitkreutz A, Sellam A, Chen D, Chang C, Rust J, Livstone M, Oughtred R, Dolinski K, Tyers M: The BioGRID interaction database: 2013 update. Nucleic Acids Res 2012, 30:30.
Xenarios I, Salwinski L, Duan XJ, Higney P, Kim SM, Eisenberg D: DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res 2002, 30:303–305.
Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, Balakrishnan L, Marimuthu A, Banerjee S, Somanathan DS, Sebastian A, Rani S, Ray S, Harrys Kishore CJ, Kanth S, Ahmed M, Kashyap MK, Mohmood R, Ramachandra YL, Krishna V, Rahiman BA, Mohan S, Ranganathan P, Ramabadran S, Chaerkady R, Pandey A: Human protein reference database–2009 update. Nucleic Acids Res 2009, 37:6.
Lynn DJ, Winsor GL, Chan C, Richard N, Laird MR, Barsky A, Gardy JL, Roche FM, Chan TH, Shah N, Lo R, Naseer M, Que J, Yau M, Acab M, Tulpan D, Whiteside MD, Chikatamarla A, Mah B, Munzner T, Hokamp K, Hancock RE, Brinkman FS: InnateDB: facilitating systems-level analyses of the mammalian innate immune response. Mol Syst Biol 2008, 4:218.
Kerrien S, Aranda B, Breuza L, Bridge A, Broackes-Carter F, Chen C, Duesbury M, Dumousseau M, Feuermann M, Hinz U, Jandrasits C, Jimenez RC, Khadake J, Mahadevan U, Masson P, Pedruzzi I, Pfeiffenberger E, Porras P, Raghunath A, Roechert B, Orchard S, Hermjakob H: The IntAct molecular interaction database in 2012. Nucleic Acids Res 2012, 40:D841–D846.
Licata L, Briganti L, Peluso D, Perfetto L, Iannuccelli M, Galeota E, Sacco F, Palma A, Nardozza AP, Santonico E, Castagnoli L, Cesareni G: MINT, the molecular interaction database: 2012 update. Nucleic Acids Research 2011.
Joshi-Tope G, Gillespie M, Vastrik I, D'Eustachio P, Schmidt E, de Bono B, Jassal B, Gopinath GR, Wu GR, Matthews L, Lewis S, Birney E, Stein L: Reactome: a knowledgebase of biological pathways. Nucleic Acids Res 2005, 33:D428–D432.
Lange PF, Huesgen PF, Overall CM: TopFIND 2.0–linking protein termini with proteolytic processing and modifications altering protein function. Nucleic Acids Res 2012, 40:D351–D361.
Snel B, Lehmann G, Bork P, Huynen MA: STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene. Nucleic Acids Res 2000, 28:3442–3444.
Consortium TU: Reorganizing the protein space at the Universal Protein Resource (UniProt). Nucleic Acids Res 2012, 40:D71–D75.
Wu C, Orozco C, Boyer J, Leglise M, Goodale J, Batalov S, Hodge CL, Haase J, Janes J, Huss JW 3rd, Su AI: BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources. Genome Biol 2009, 10:R130.
Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, Cooke MP, Walker JR, Hogenesch JB: A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci U S A 2004, 101:6062–6067.
Baker MA, Reeves G, Hetherington L, Muller J, Baur I, Aitken RJ: Identification of gene products present in Triton X-100 soluble and insoluble fractions of human spermatozoa lysates using LC-MS/MS analysis. Proteomics Clin Appl 2007, 1:524–532.
Baker MA, Aitken RJ: Proteomic insights into spermatozoa: critiques, comments and concerns. Expert Rev Proteomics 2009, 6:691–705.
Li J, Liu F, Wang H, Liu X, Liu J, Li N, Wan F, Wang W, Zhang C, Jin S, Liu J, Zhu P, Liu Y: Systematic mapping and functional analysis of a family of human epididymal secretory sperm-located proteins. Mol Cell Proteomics 2010, 9:2517–2528.
Li J, Liu F, Liu X, Liu J, Zhu P, Wan F, Jin S, Wang W, Li N, Wang H: Mapping of the human testicular proteome and its relationship with that of the epididymis and spermatozoa. Mol Cell Proteomics 2011, 10:22.
Thimon V, Frenette G, Saez F, Thabet M, Sullivan R: Protein composition of human epididymosomes collected during surgical vasectomy reversal: a proteomic and genomic approach. Hum Reprod 2008, 23:1698–1707.
Schultz N, Hamra FK, Garbers DL: A multitude of genes expressed solely in meiotic or postmeiotic spermatogenic cells offers a myriad of contraceptive targets. Proc Natl Acad Sci U S A 2003, 100:12201–12206.
Goh KI, Cusick ME, Valle D, Childs B, Vidal M, Barabasi AL: The human disease network. Proc Natl Acad Sci U S A 2007, 104:8685–8690.
Guan Y, Myers CL, Lu R, Lemischka IR, Bult CJ, Troyanskaya OG: A genomewide functional network for the laboratory mouse. PLoS Comput Biol 2008, 4:e1000165.
Joy MP, Brock A, Ingber DE, Huang S: High-betweenness proteins in the yeast protein interaction network. J Biomed Biotechnol 2005, 2005:96–103.
da Huang W, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009, 4:44–57.
Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M: KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res 2012, 40:10.
Okamoto T, Takeda S, Murayama Y, Ogata E, Nishimoto I: Ligand-dependent G protein coupling function of amyloid transmembrane precursor. J Biol Chem 1995, 270:4205–4208.
Brouillet E, Trembleau A, Galanaud D, Volovitch M, Bouillot C, Valenza C, Prochiantz A, Allinquant B: The amyloid precursor protein interacts with Go heterotrimeric protein within a cell compartment specialized in signal transduction. J Neurosci 1999, 19:1717–1727.
Deyts C, Vetrivel KS, Das S, Shepherd YM, Dupre DJ, Thinakaran G, Parent AT: Novel GalphaS-protein signaling associated with membrane-tethered amyloid precursor protein intracellular domain. J Neurosci 2012, 32:1714–1729.
Etkovitz N, Tirosh Y, Chazan R, Jaldety Y, Daniel L, Rubinstein S, Breitbart H: Bovine sperm acrosome reaction induced by G-protein-coupled receptor agonists is mediated by epidermal growth factor receptor transactivation. Dev Biol 2009, 334:447–457.
Ward CR, Storey BT, Kopf GS: Selective activation of Gi1 and Gi2 in mouse sperm by the zona pellucida, the egg's extracellular matrix. J Biol Chem 1994, 269:13254–13258.
Tian X, Sha Y, Zhang S, Chen Y, Miao S, Wang L, Koide S: Extracellular domain of YWK-II, a human sperm transmembrane protein, interacts with rat Mullerian-inhibiting substance. Reproduction 2001, 121:873–880.
Yin X, Ouyang S, Xu W, Zhang X, Fok KL, Wong HY, Zhang J, Qiu X, Miao S, Chan HC, Wang L: YWK-II protein as a novel G(o)-coupled receptor for Mullerian inhibiting substance in cell survival. J Cell Sci 2007, 120:1521–1528.
Tang BL, Low DY, Hong W: Hsec22c: a homolog of yeast Sec22p and mammalian rsec22a and msec22b/ERS-24. Biochem Biophys Res Commun 1998, 243:885–891.
Berruti G, Paiardi C: Acrosome biogenesis: Revisiting old questions to yield new insights. Spermatogenesis 2011, 1:95–98.
Young-Pearse TL, Chen AC, Chang R, Marquez C, Selkoe DJ: Secreted APP regulates the function of full-length APP in neurite outgrowth through interaction with integrin beta1. Neural Dev 2008, 3:1749–8104.
Miyamoto H, Chang MC: Effects of protease inhibitors on the fertilizing capacity of hamster spermatozoa. Biol Reprod 1973, 9:533–537.
Phelps BM, Koppel DE, Primakoff P, Myles DG: Evidence that proteolysis of the surface is an initial step in the mechanism of formation of sperm cell surface domains. J Cell Biol 1990, 111:1839–1847.
This work was supported by “FCT – Fundação para a Ciência e Tecnologia (PTDC/DTP-PIC/0460/2012) and cofinanced by FEDER through “Eixo I do Programa Operacional Fatores de Competitividade (POFC) do QREN” (COMPETE: FCOMP-01-0124-FEDER-028692).
This work was also supported by grants from FCT of the Portuguese Ministry of Science and Higher Education to JVS (SFRH/BD/81458/2011), SD (SFRH/BD/21559/2005). SY and AVG were funded with individual research grants “Investigador mais centro” from the project “New Strategies Applied to Neuropathological Disorders” (CENTRO-07-ST24-FEDER-002034). This project also had the contribution of the project APOPIS (Abnormal Protein in the Pathogenesis of Neurodegenerative Disorders - EU Project PL503330).
The authors declare that they have no competing interests.
JVS participated in the design of the methodological approach to data analyses, contributed in YTH data analyses, data retrieval and bioinformatic analyses and wrote the manuscript. SY carried out the data retrieval and network analyses and wrote the manuscript. SD carried out the construction of the plasmid pASAPP and analyzed some positive clones of the YTH screen. SG analyzed some positive clones of the YTH screen. AVG participated in network analyses and final planning of the methodological approach to data analyses. EFCS and OABCS were involved in planning the YTH screen and initial methodological approach to analyze the YTH positive clones. JFFM was involved in manuscript revision and final planning of the methodological approach to data analyses. MF performed the YTH screen and analysis of some positives clones of the YTH screen, final planning of the methodological approach to data analyses and manuscript revision. All authors read and approved the final manuscript.
Edgar Figueiredo da Cruz e Silva is deceased.
Joana Vieira Silva and Sooyeon Yoon contributed equally to this work
Enriched GO categories of the APP interactors identified by YTH. Enriched categories are identified as those with p<0.05.
Distribution of APP interacting proteins in human testicular, epididymal and sperm proteomes, and their overlap. The human epididymis proteome includes both epididymal tissue and fluid proteomes. The secretory vesicular (epididymosome) part of the human epididymosome proteome was also considered. In bold are the APP interactors identified in the YTH screen.
Statistics of collected protein-protein interaction data.
Topological properties of network.
(a). Rank of topological properties of 457 proteins in local APP/APLP2 network.
(a). Top 1000 rank of topological properties of proteins in extended APP/APLP2 network. Note that proteins with clustering coefficient 0 are neglected.
Modular structures of poteins sharing APP-APLP2 interaction. Four common proteins interacting with APP and APLP2 form triangle or square modules (indicated by green links) which shows that they can be highly possible functional modules. Blue nodes are testis-specific and orange node (APLP2) is not.