Integration of shot-gun proteomics and bioinformatics analysis to explore plant hormone responses
© Zhang et al.; licensee BioMed Central Ltd. 2012
Published: 11 September 2012
Skip to main content
© Zhang et al.; licensee BioMed Central Ltd. 2012
Published: 11 September 2012
Multidimensional protein identification technology (MudPIT)-based shot-gun proteomics has been proven to be an effective platform for functional proteomics. In particular, the various sample preparation methods and bioinformatics tools can be integrated to improve the proteomics platform for applications like target organelle proteomics. We have recently integrated a rapid sample preparation method and bioinformatics classification system for comparative analysis of plant responses to two plant hormones, zeatin and brassinosteroid (BR). These hormones belong to two distinct classes of plant growth regulators, yet both can promote cell elongation and growth. An understanding of the differences and the cross-talk between the two types of hormone responses will allow us to better understand the molecular mechanisms and to identify new candidate genes for plant engineering.
As compared to traditional organelle proteomics, the organelle-enrichment method both simplifies the sample preparation and increases the number of proteins identified in the targeted organelle as well as the entire sample. Both zeatin and BR induce dramatic changes in signaling and metabolism. Their shared-regulated protein components indicate that both hormones may down-regulate some key components in auxin responses. However, they have shown distinct induction and suppression of metabolic pathways in mitochondria and chloroplast. For zeatin, the metabolic pathways in sucrose and starch biosynthesis and utilization were significantly changed, yet the lipid biosynthesis remained unchanged. For BR, lipid biosynthesis and β-oxidation were both down-regulated, yet the changes in sucrose and starch metabolism were minor.
We present a rapid sample preparation method and bioinformatics classification for effective proteomics analysis of plant hormone responses. The study highlighted the largely differing response to zeatin and brassinosteroid by the metabolic pathways in chloroplast and mitochondria.
Proteomics can directly address many biological questions by revealing the abundance of certain proteins within organisms. Traditionally, two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) was the golden standard for proteomics analysis, yet the platform is limited by both protein identification and quantification capacities. The recent advances in mass spectrometry instrumentation, separation methods, data acquisition and analysis tools have enabled use of the so-called 'shot-gun' proteomics. It uses tandem mass spectrometry and the multidimensional protein identification technology (MudPIT) . In the MudPIT platform, the whole proteome is directly digested with protease, and the resulting peptides are subjected to multidimensional chromatography separation. The separated peptides are then analyzed online by mass spectrometry. The so called MudPIT platform eliminates the tedious gel separation and has been broadly applied in plant biology studies [2, 3]. Even though the platform has superior performance as compared to 2-D gel platforms, limitations still exist for several reasons. First, profiling the whole proteome is complicated by the complexity of the protein sample, the number of proteins expressed, the differing molecular weights, and other variations in chemical and physical characteristics[4, 5]. Also, many functional proteins such as GTPases, kinases and phosphatases exist in low abundance. Their signals can be easily masked by highly abundant proteins such as ribulose 1, 5-bisphosphate carboxylase/oxygenase (Rubisco) . These challenges can be addressed by improving sample preparation methods, bioinformatics analysis, sample processing, and mass spectrometry instrumentation. We hereby present the integration of a rapid sample preparation method with bioinformatics analysis to achieve better peptide identification and focused study of chloroplast and mitochondrial proteins.
We are particularly interested in chloroplast and mitochondria because the two organelles are important for energy metabolism and plant growth, among many other functions. In particular, the proteome dynamics of these two organelles in response to growth relevant hormones like auxin, cytokine, and brassinosteroid will shed light onto the mechanisms for plant hormone responses. It will also identify candidate genes for improving crop seed and biomass yield for food, fiber and energy usages.
Traditionally, in order to identify proteins in a particular type of organelle, the organelle is separated by gradient density centrifugation and ultra-centrifuged from a large quantity of initial samples [6, 7]. Proteomics studies toward specific organelles have been done on nuclei, mitochondria, chloroplasts, Golgi apparatuses, and endoplasmic reticulum, etc. [5, 8]. For example, Dunkley and colleagues used localization of organelle proteins by isotope tagging (LOPIT) to simultaneously localize 527 proteins out of 689 proteins identified in several organelles of Arabidopsis . Most of the traditional organelle purification involves time-consuming and tedious separation steps, which could introduce extra errors . We hereby simplified the traditional method of organelle separation by implementing a rapid centrifugation step. The rapid sample preparation method integrated with bioinformatics classification was evaluated as an alternative to study mitochondria and chloroplast proteomics in plant's responses to growth hormones.
The proteomics analysis of hormone responses is part of our long-term efforts to identify important genes involved in plant biomass increases for bioenergy purposes. Several plant hormones such as auxin and gibberellic acid can promote plant growth and are known to be able to increase plant biomass accumulation through different mechanisms. Among these plant hormones, zeatin and BR are of particular interest to us. Zeatin is a plant hormone belonging to cytokinins and regulates plant development and growth. Zeatin has been widely applied in agriculture to increase fruit or seed size and is well known to promote cell elongation and root development . Interestingly, BR is also known to be able to promote plant growth through cell elongation . Even though both plant hormones can promote cell elongation and growth, the underlying mechanisms are believed to be widely different; the hormone signaling pathways for the two are unique to one another. However, very few studies have focused on studying the differences and cross-talk between the responses in the two hormones at the proteome level. We hereby utilized the aforementioned platform to explore the proteome responses of Arabidopsis in response to treatment by the two plant hormones. The metabolic pathways in chloroplasts and mitochondria are of the particular interest.
Overall, in this article, we have integrated a simple sample preparation method with bioinformatics classification to analyze plant responses to zeatin and BR. The new method has been shown to improve protein identification, in particular in mitochondria. Using this platform, we have revealed that both zeatin and BR induce significant changes in signaling and metabolism. The shared regulated protein components indicated that both hormones may down-regulate some components in auxin responses. However, the two plant hormones have shown distinctive induction and suppression of metabolic pathways. For zeatin, the metabolic pathways in sucrose and starch biosynthesis and utilization were significantly up-regulated, yet the lipid biosynthesis remained unchanged. For BR, the lipid biosynthesis and β-oxidation were both down-regulated, yet the changes in sucrose and starch metabolism are minor. These differences highlight the different molecular and metabolic mechanisms for response to zeatin and BR. The data can help us to design better strategies to promote plant biomass accumulations.
Arabidopsis thaliana ecotype Col-0 was used. Seeds were stratified at 4°C to synchronize germination for 2 days and then grown at 23°C/19°C under a 12 h/12 h light/dark cycle for 4 weeks.
Zeatin (Sigma-Aldrich, St. Louis, MO) and 24-epibrassinolide (PhytoTechnology Laboratories, Shawnee Mission, KS) were sprayed at 100 μm and 0.5 mg/L, respectively. 0.8% methanol solution was sprayed as mock. The aerial parts of plant were collected at 24 hours after the spray.
For the total protein isolation, a plant total protein extraction kit (Sigma-Aldrich, St.Louis, MO) was used, and the entire procedure followed the manufacturer's manual. 150 mg of aerial tissue from Arabidopsis was collected and ground in liquid nitrogen to a fine powder. Pre-cooled methanol solution with protease inhibitor was added to the powder and vortexed for 30 seconds. The mixture was incubated at -20°C then centrifuged at 16,000 × g for 5 minutes at 4°C. Supernatant was removed and the pellet was washed by methanol solution for two more times. The resulting pellet was washed by pre-cooled acetone and centrifuged at 16,000 × g for 5 minutes at 4°C. SpeedVac was used to remove residual acetone and Reagent Type 4 Working Solution provided by the kit was used to incubate the pellet for 15 minutes at room temperature. The pellet was then centrifuged at 16,000 × g for 30 minutes, and supernatant was collected and stored at -80°C for future proteomics use (See Additional File 1).
The organelle enrichment procedure was developed based on the method from Santoni  with some modification. 5 g of fresh aerial tissue of Arabidopsis was collected and washed by ice-cold water to remove the soil. A blender was used to disrupt the tissue after adding a 2:1 (mL medium/g fresh weight) homogenization buffer (50 mM TRIZMA base, 500 mM Sucrose, 10% Glycerol, 20 mM EDTA-Na2, 20 mM EGTA, 50 mM NaF, 5 mM beta-glycerophosphate, 1 mM phenantroline, 0.6% PVP40, 10 mM ascorbic acid, 1 mM leupeptin, 5 mM DTT, 1 mM Naorthovanadate, pH 8.0 adjusted by MES). The homogenate was then filtered through Miracloth to remove plant debris. Centrifugation of filtered homogenate was conducted at 1,000 × g for 5 minutes to remove the nuclei. The supernatant was then centrifuged at 26,000 × g for 25 minutes to pellet organelles.
For protein isolation of enriched organelles, pre-cooled methanol with protease inhibitor was added to the organelle-enriched pellet, which was collected after the centrifugation described in the Organelle enrichment section. The sample was then vortexed for 30 seconds. The mixture was incubated at -20°C then centrifuged at 16,000 × g for 5 minutes at 4°C. The supernatant was removed and the pellet was washed twice by methanol solution. The resulting pellet was again washed by pre-cooled acetone and centrifuged at 16,000 × g for 5 minutes at 4°C. Residual acetone was removed by SpeedVac, and Reagent Type 4 Working Solution was used to incubate the pellet for 15 minutes at room temperature. The pellet was then centrifuged at 16,000 × g for 30 minutes, and supernatant was collected and stored at -80°C for future proteomics use (See Additional File 1A).
MudPIT-based shot-gun proteomics was carried out to analyze each sample. Approximately 100 μg of protein was digested by Trypsin Gold, Mass Spectrometry Grade (Promega, WI, USA) with 1:40 w/w at 37°C for 24 h. The digested peptides were desalted using a Sep-Pak plus C18 column (Waters Limited, ON, Canada) and then loaded onto a biphasic (strong cation exchange/reversed phase) capillary column using a pressure tank. The 2D back column was composed of 5 cm of C18 reverse phase resin and 3 cm of strong cation exchange (SCX) resin. The back column was then connected to a 15-cm-long 100 um-ID C18 column (packed in house with the same C18 reverse phase in the back column) and sprayed through a SilicaTip (New objective, Inc, Woburn, MA). The two-dimensional liquid chromatography separation and tandem mass spectrometry conditions followed the protocols previously described by Washburn et al. . Before SCX separation, a 1 h RP gradient from 100% Solvent A (95% H2O, 5% ACN, and 0.1% formic acid) to 100% Solvent B (30% H2O, 70% ACN, and 0.1% formic acid) was configured to move peptides from C18 resin to SCX resin in the back column. The SCX LC separation was performed with eleven salt pulses containing increasing concentrations of ammonium acetate. Each salt pulse was followed by a 2 h reverse phase gradient from 100% Solvent A to 60% Solvent B. The LC eluent was directly nanosprayed into a linear ion trap mass spectrometer, Finnigan LTQ (Thermo Fisher Scientific, San Jose, CA). The mass spectrometer was set to the data-dependent data acquisition mode, and full mass spectra were recorded on the peptides over a 300-1700 m/z range, followed by five tandem mass (MS/MS) events for the most abundant ions from the first MS analysis. The Xcalibur data system (Thermo Fisher Scientific, San Jose, CA) was used to control the LC-LTQ system and collect the data.
Tandem mass spectra were extracted from the raw files and converted into the MS2 file. The MS2 file was searched against the Arabidopsis protein database downloaded from The Arabidopsis Information Resource (TAIR); it contains reverse sequence and common contaminant proteins. A ProLuCID algorithm was used to search for data using the Texas A&M Supercomputing Facility. The validity of peptide/spectrum matches was assessed in DTASelect2.0 using a 0.05 false discovery cutoff, with a cross-correlation score (XCorr) that's larger than 1, and normalized difference in cross-correlation scores (DeltaCN) larger than 0.08. Proteins with more than two peptides were identified as detected and were recorded.
PatternLab  software is used for data analysis to discover differentially expressed proteins. The cutoff of p-value and Fold-change is 0.05 and 2.0 respectively. Gene ontology annotations for proteins were performed by VirtualPlant . The pathway analysis of proteins differentially expressed was analyzed by Aracyc http://www.arabidopsis.org/biocyc/. Cluster analysis was carried out by MeV.
A python package was developed to parse proteins based on their GO keywords (See Additional File 1B and Additional File 2). The report containing differentially expressed proteins searched each protein ID against GO Slim, which can be downloaded from ftp://ftp.arabidopsis.org/home/tair/Ontologies/Gene_Ontology/. If the annotation of the protein matches the keyword set, the ID will be output to a text file and the number of matched protein ID will be displayed.
Improved protein identification using the organelle enrichment method (OEM) as compared traditional method (TM)
Percentage of mitochondrial proteins (%)
TM sample 1
TM sample 2
TM sample 3
The pair-wise student's t-test
We further processed the proteomics data with protein classification software. The analysis indicated the organelle enrichment method has identified over 30% more mitochondrial proteins, even though the chloroplast protein identification didn't change significantly. As compared to the traditional method, the organelle enrichment method led to a greater percentage of mitochondrial protein identified (Table 1). This suggests our protocol has effectively enriched the proteins in mitochondria for the research purpose. Overall, with this simplified sample preparation method, we successfully enriched mitochondrial protein and identified more proteins that are involved in energy metabolism. The integration of bioinformatics classification allowed us to focus more on mitochondrial pathways. We therefore used this method to explore the proteome dynamics during plant hormone responses.
Shared differentially expressed proteins between zeatin and BR treated samples
Fold change (zeatin)
Fold change (BR)
NAD-dependent epimerase/dehydratase family protein
co-chaperone grpE family protein
plastid developmental protein DAG, putative
DEAD/DEAH box helicase, putative
cytochrome b5, putative
eukaryotic translation initiation factor 3 subunit 7
serine carboxypeptidase S10 family protein
5'-adenylylsulfate reductase 2
protein kinase family protein
cytidine/deoxycytidylate deaminase family protein
protoporphyrinogen oxidase, putative
transducin family protein/WD-40 repeat family protein
DC1 domain-containing protein
tetrahydrofolate dehydrogenase/cyclohydrolase, putative
5-methylthioribose kinase family SEC14 cytosolic factor family
protein/phosphoglyceride transfer family protein
auxin-binding protein 1 (ABP1)
translation initiation factor-related
triose phosphate/phosphate translocator, putative
2-oxoglutarate-dependent dioxygenase, putative
WD-40 repeat protein
UDP-glucose transferase (UGT75B2)
hydroxyproline-rich glycoprotein family protein
epsin N-terminal homology (ENTH) domain- containing protein
beta-galactosidase, putative/lactase, putative
chaperone protein dnaJ-related
disease resistance family protein
merozoite surface protein-related
SCPL34, similar to serine carboxypeptidase S10 family
biotin/lipoyl attachment domain-containing protein
C2 domain-containing protein
jacalin lectin family protein
NAD-dependent epimerase/dehydratase family protein
immunophilin/FKBP-type peptidyl-prolyl cis- trans isomerase family protein
plastid-lipid associated protein PAP/fibrillin family protein
oligosaccharyl transferase STT3 subunit family protein
dormancy/auxin associated family protein
S2P-like putative metalloprotease
sulfate adenylyltransferase 4/ATP-sulfurylase 4 (APS4)
ATP-dependent Clp protease ATP-binding subunit ClpX, putative
NAD-dependent epimerase/dehydratase family protein
hydrolase, alpha/beta fold family protein
histone H2B, putative
Both zeatin and BR are relevant to plant growth regulation, cell elongation, and energy metabolism. We therefore carried out pathway analysis using AraCyc to investigate if both hormone treatments promote the plant and cellular growth with the same metabolic pathway or not. The pathway analysis revealed distinctive patterns.
Besides the down-regulation of fatty acid biosynthesis, the pathways for utilization and oxidation of fatty acid were also down-regulated in BR treated samples. Four down-regulated gene products (AT1G76150, AT3G06860, AT5G65110 and AT5G27600) play distinct roles in fatty acid β-oxidation pathway. AT1G76150 degrades even cis-unsaturated fatty acids. AT5G65110 encodes an acyl-CoA oxidase for fatty acid oxidation. AT5G27600 involves in oxidation of very long chain fatty acid in peroxisomes.
Even though lipid metabolisms were significantly changed in response to BR treatment, sucrose and starch metabolisms seem to be changed more by zeatin treatments. Two gene products in the starch biosynthesis pathway were up-regulated in the zeatin-treated sample. These two proteins are AT5G48300, a small subunit of ADP-glucose pyrophosphorylase and AT5G03650, starch branching enzyme. Meanwhile, proteins were found up-regulated in both the starch degradation pathway I and pathway II. Among these proteins are AT2G40840, a crucial enzyme for starch to sucrose conversion; AT1G10760, α-glucan dikinase; and AT5G26570, chloroplastidic phosphoglucan water dikinase. However, in the BR treated sample, only ApL4, a large subunit of ADP-glucose pyrophosphorylase catalyzing the first rate limiting step in starch biosynthesis was found down-regulated in the starch biosynthesis pathway.
In addition to starch metabolism, zeatin treatment has induced three enzymes in sucrose biosynthesis and catabolism. These enzymes are AT5G20830 (sucrose synthase, Sus1), AT2G22480 (phosphofructokinase) and AT5G52920 (pyruvate kinase beta subunit). No proteins in the sucrose degradation pathway showed significant changes in the BR treated sample.
Different strategies can be used for proteomics analysis in organelles like mitochondria and chloroplast . The traditional approach is to isolate these organelles with multiple steps of gradient centrifugation and ultra-centrifugation. Proteins were further isolated from the organelles for proteomics analysis. The limitation of the strategy lies in the requirement of a large quantity of initial sample and the potential errors that could be introduced during the multiple step purification [22, 23]. We hereby adopted another strategy to combine a simple and rapid sample preparation method with bioinformatics classification. One simple centrifugation step was used to separate the mitochondria and chloroplast from other plant organelles. The separated mitochondria and chloroplast protein was then used for shot-gun proteomics and bioinformatics classification. The method has led to the enrichment of mitochondrial protein identification by 30%, and reduction of initial sample amount by more than 10-fold. We went ahead and utilized the method to study an important biological question in plant hormone responses.
Previous research, mainly utilizing 2D DIGE (Two-dimensional differential gel electrophoresis), was carried out to study the BR-treated Arabidopsis . The study has led to the discovery of 103 of differentially expressed proteins. As compared to the previous studies, more differentially proteins were identified in the presented study, demonstrating the effectiveness of the shot-gun proteomics platform and our sampling strategy. A total of 355 proteins have been identified in the zeatin treated samples and a total of 288 proteins were found to be differentially expressed in the BR treated samples. The deep coverage of differentially regulated proteins and focused study of energy-related pathways in mitochondria and chloroplasts allow us to have a global comparison of metabolic pathway regulations at the proteome level between the two types of hormone responses.
The study has revealed significantly differential regulation of metabolic pathways in zeatin and BR, in particular for pathways located in mitochondria and chloroplasts. Even though both zeatin and BR can promote cell elongation, the mechanisms are expected to be different. Our results highlighted that BR down-regulates key proteins in both fatty acid biosynthesis and oxidation. Fatty acid β-oxidation eventually breaks down the long-chain fatty acids and produces acetyl-CoA to enter TCA cycle . The fact that both fatty acid biosynthesis and catabolism are down-regulated indicates that BR may promote the cell elongation and growth through shutting down the energy storage through lipid biosynthesis. Interestingly, zeatin treated plants showed essentially no changes in these two pathways, indicating a completely different metabolic regulatory mechanism.
For zeatin treatment, some of the sucrose and starch biosynthesis proteins were up-regulated. Additionally, proteins involved in sucrose and starch degradation were also up-regulated. The use of sucrose is one way that plants transport energy; synthesized sucrose from photosynthetic tissues can be transported to other tissues and cells for utilization . The fact that both biosynthesis and degradation were up-regulated indicates the rapid metabolism of these energy source compounds. Interestingly, BR treatment only induces the down-regulation of one gene involved in starch biosynthesis.
The comparison of the two hormone responses indicated that the two types of plant hormones regulate cell elongation and growth through distinctive pathways. BR down-regulates key proteins in lipid metabolisms and energy storage, while zeatin up-regulates key proteins in sucrose and starch metabolisms for energy utilization. The future work can be developed to coordinate the expression of genes involved in the responses to two plant hormones to develop new ways for manipulating plant growth and development.
Multidimensional protein identification technology)
ribulose 1, 5-bisphosphate carboxylase/oxygenase
two-dimensional polyacrylamide gel electrophoresis
We appreciate Mr. Jay Jackson for extensive revision of the text. We appreciate Dr. Zhuo Xing for helpful suggestions toward the manuscript preparation. The research is supported by SouthCentral Sungrant (SC-SGP 2012), Texas Agrilife Bioenergy Research Initiative, and Texas Agrilife Start-up Fund for Joshua Yuan. The data search was carried out using Texas A&M Supercomputer Facility.
This article has been published as part of BMC Bioinformatics Volume 13 Supplement 15, 2012: Proceedings of the Ninth Annual MCBIOS Conference. Dealing with the Omics Data Deluge. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/13/S15
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.