PMTED: a plant microRNA target expression database
© Sun et al.; licensee BioMed Central Ltd. 2013
Received: 8 November 2012
Accepted: 30 May 2013
Published: 3 June 2013
MicroRNAs (miRNAs) are identified in nearly all plants where they play important roles in development and stress responses by target mRNA cleavage or translation repression. MiRNAs exert their functions by sequence complementation with target genes and hence their targets can be predicted using bioinformatics algorithms. In the past two decades, microarray technology has been employed to study genes involved in important biological processes such as biotic response, abiotic response, and specific tissues and developmental stages, many of which are miRNA targets. Despite their value in assisting research work for plant biologists, miRNA target genes are difficult to access without pre-processing and assistance of necessary analytical and visualization tools because they are embedded in a large body of microarray data that are scattered around in public databases.
Plant MiRNA Target Expression Database (PMTED) is designed to retrieve and analyze expression profiles of miRNA targets represented in the plethora of existing microarray data that are manually curated. It provides a Basic Information query function for miRNAs and their target sequences, gene ontology, and differential expression profiles. It also provides searching and browsing functions for a global Meta-network among species, bioprocesses, conditions, and miRNAs, meta-terms curated from well annotated microarray experiments. Networks are displayed through a Cytoscape Web-based graphical interface. In addition to conserved miRNAs, PMTED provides a target prediction portal for user-defined novel miRNAs and corresponding target expression profile retrieval. Hypotheses that are suggested by miRNA-target networks should provide starting points for further experimental validation.
PMTED exploits value-added microarray data to study the contextual significance of miRNA target genes and should assist functional investigation for both miRNAs and their targets. PMTED will be updated over time and is freely available for non-commercial use at http://pmted.agrinome.org.
KeywordsMiRNA Microarray Meta-analysis Stress response
MiRNAs are emerging key regulators in plant development and stress responses  and have been identified in a number of plants as demonstrated in miRBase . Plant genomes contain ~200 miRNAs, each with an average of 2-3 target genes, many of which are important transcription factors. Study of miRNA target gene expression patterns may provide novel clues about the functions of miRNA-target interactions, which may be already available in the large amount of existing microarray experiments. Microarray technology has been a major tool for genome wide transcriptome analysis and is extensively used in the past two decades. Thousands of experiments were performed and a humongous amount of data has been generated and deposited in databases such as ArrayExpress [3, 4] and Gene Expression Omnibus (GEO; ). These experiments study important biological processes such as biotic stress, abiotic stress, various tissues and developmental stages and involve many plant species, including important crops such as rice, maize, wheat and soybean.
The large spectrum of biological conditions and species covered by microarray experiments render them valuable information to be explored by bioinformatics efforts. For example, Pathogen, as a relational database for annotation of all identified signal transduction mechanisms during plant-pathogen interactions, has incorporated gene expression data from Arabidopsis thaliana microarray experiments enabling easy access to specific genes regulated upon pathogen infection or elicitor treatment . Among the genes on the microarrays are targets of miRNAs. However, without further processing, these data are difficult to reach by biologists. Toward this end, we present PMTED, a manually curated plant miRNA target expression database that comprises query, analytical, and visualization functionalities for more than 5,000 miRNA targets under several hundred experiments for 12 plant species. In addition, meta-data from 92 experiments were curated and can be searched and browsed so that novel hypotheses of miRNA and target interactions can be generated for further validation. PMTED is designed (1) to facilitate the plant miRNA research by making the large amount of target expression information available; and (2) to provide a portal to facilitate the transfer of model plant resource such as those of Arabidopsis and rice to less studied crop species such as wheat and maize.
Construction and content
Datasets and processing
MiRNA target prediction
Statistics of miRNAs, targets, series, and microarrays in PMTED
Data curation and classification
We further screened the annotation information of high quality experiments. A total of 92 experiments from nine species of 202 sample comparisons were selected and classified into three major bioprocesses: abiotic stress, biotic stress, and development (Additional file 1: Table S1). These experiments were further divided into subgroups according to their commonality in experimental conditions that were represented by unique terms. A total of 55 terms were extracted from the annotation, including 25 abiotic stress terms (such as drought, salt, and acid), 20 biotic stress terms (such as rice stripe virus, powdery mildew and fungus), and 10 development terms (such as leaf, root and stem) (Additional file 3: Table S2). Since the data were derived from the same platform (i.e. Affymetrix) and processed through the same pipeline, these terms should be suitable for meta-network construction.
Taking the number of validated miRNAs and the volume of Affymetrix microarray experiments into consideration, we eventually selected 12 species to be included in the first version of our database. These species are Arabidopsis thaliana, Glycine max, Zea mays, Oryza sativa, Gossypium hirsutum, Citrus sinensis, Medicago truncatula, Populus trichocarpa, Solanum lycopersicum, Saccharum officinarum, Vitis vinifera and Triticum aestivum. Table 1 lists the miRNAs, targets and experiments hosted in PMTED with a detailed experiment list in Additional file 1: Table S1.
Differentially expressed genes and meta-data connection
Where n is the number of samples, and xi is the value of ith sample.
This section is for querying information about predicted target genes, such as sequence, miRNA alignment and location, gene ontology, and expression profiles in the related experiments. For easy access, four entries were provided to the users: (1) By miRNA: Browse or search a known miRNA and then its targets and associated experiments; (2) By experiment: Select an experiment of interest and retrieve expression data of target genes of a miRNA of interest; (3) By target gene ID: Browse or search of a target gene of a user’s interest and hence the experiments containing its expression patterns. (4) By target prediction: This entry is for a novel miRNA defined by a user. Targets were firstly predicted and then the expression patterns of these genes in the associated experiments can be retrieved. The default quality score of prediction is set as 4 which we consider as a stringent threshold; but users have a wide range to choose (0-8) and can select more relaxed conditions such that more candidate targets can be returned.
This module enables users to search and browse the result of a meta-analysis. The search function allows combinatorial queries of condition terms, species, and miRNA family names. Results were displayed in table format with cross references for additional species and experimental conditions containing the same miRNA(s). The browse function provides direct access to meta-data via interactive graphs generated by the Cytoscape web program . There are three entries presenting different levels and angles of the curated meta-networks: (1) By miRNA presents users with all the experimental conditions of a selected miRNA family and associated experiments and their details; (2) By condition shows all the miRNAs with experiments containing their target genes that are associated with the selected condition; (3) By bio-process presents all conditions associated with user selected bioprocess and then the miRNAs with experiments containing their target genes. Meta-data structures provide global views of differentially expressed genes centering a miRNA family, a condition, a bioprocess or a species, where hypothesis can be created for experimental validation.
Basic information query by miRNA
This module enables users to perform a flexible combinatorial query of annotated bioprocesses, species, and miRNA family names for meta-analysis (Figure 2c). Results are displayed in table format which contains the list of selected objects, such as bioprocess, biology term, miRNA family name, species, experiments and the strength of relation (by CV values; Figure 2d). The presence of multiple terms demonstrates meta-relationship among them and can be considered as putative biological properties for selected miRNAs. This information can assist in raising novel hypothesis for experimental validation. For example, when we search miR156 for Biotic stress and Development in rice (Oryza sativa) and Arabidopsis, we found that miR156 targets are associated with leaf development in both rice and Arabidopsis. They are also related to stem, seedling and embryogenesis development in Arabidopsis. In addition, our results suggest that miR156 targets are associated with biotic stress responses, such as rice stripe virus and powdery mildew in Arabidopsis and rice. Since miRNA156 has been reported to be involved in virus-induced biological stress response in tobacco and Arabidopsis , results from this database search may provide a valuable clue for probable roles of miR156 as well as its targets in rice disease resistance.
Click on the edges to check CV value or fold changes of targets.
Draw a node to place it in a more prominent position. We also provide a “Recalculate layout” button to make the results more clearly structured, especially after users append additional child nodes.
“Reload” button enables users to reload a picture.
Click on leaf nodes (those without out edge) can add more child nodes.
Click on an experiment node will give detailed expression profiles of the related target genes in the category plot.
For example, when we select to browse by miRNA and choose Oryza sativa and miR1432, a network of osa-miR1432 and five nodes-- aerobic and anoxic, Magnaporthe oryzae, drought, arsenate and salt was obtained. This result is consistent with the function of predicted rice miR1432 target, a calmodulin-binding protein, and suggests a role of this miRNA and probably associated targets in calcium signaling . This rice gene has been found to be differentially regulated in response to drought stress . The information obtained by clicking the five bio-condition nodes will provide further related experiments and then target expression profiles which may help in further experimental validation.
Since the emergence of plant miRNAs as important regulators for various biological processes, a number of databases have been designed to host miRNAs [17-19], together with various target prediction programs [11, 20-22]. For example, miRBase hosts both plant and animal miRNAs and their precursor sequences [2, 19, 23], while PMRD, a specialized plant miRNA database, integrates plant miRNA data curated from recent literatures with functions for sequence information, secondary structure, and target gene retrieval . PMTED takes advantage of the extant gene expression patterns in the high volume of microarray data generated in the past two decades, providing a platform for plant researchers to infer miRNA functions through their target performance. Furthermore, the expression patterns among experiments with the same target genes were linked together by curated common biological terms allowing for a global meta-analysis of the experiments. The Affymetrix platform provides a standardized system with a high degree of reproducibility [24, 25]. Despite this, the quality of the experiments is variable and needs manual screening in terms of the design and the completeness. The experiments we used were carefully screened and processed with stringent criteria set in the Robin program . After rigorous filtering, the retained microarray experiments were of high quality with clear biological object and experimental design. To ensure high quality target gene prediction, our pipeline followed the stringent rules as Allen et al. (2005) which were derived from a set of experimentally validated miRNA-target pairing rules. With a score of ≤ 4, the program has a false negative rate of 0.03 .
PMTED boasts a number of querying and analysis functionalities developed to facilitate functional discovery, suitable for both validated miRNAs and potential novel miRNAs designated by users. In the mean time, the connection of miRNAs with differentially expressed targets in a number of microarray experiments allows a cross species/experiments/bioprocess meta-analysis. The contextual information of target gene expression data should be indicative of potential biological functions and the regulatory network that a miRNA involves, facilitating the process of biological discovery. The current version of PMTED focuses on abiotic, biotic and development experiments and should expand its scope in the near future. Despite the appearance of new technologies for gene expression study, such as RNA-seq, microarray analysis remains as an effective option for genome-wide gene expression analysis. With further accumulation of the microarray data, more species will be included. Pipelines are being developed for data compatibility from other microarray platforms. Experiments from next generation sequencing technology such as RNA-seq data, expression data of miRNAs themselves, as well as related literatures will also fall in the scope of collection for the future version of PMTED.
PMTED is a database that comprises a number of querying, analytical, and visualization functionalities for miRNAs targets. It is designed to link miRNA target gene expression patterns from various microarray experiments and reveal contextual significance of miRNA-target gene regulatory networks. PMTED should be a useful resource for researchers working in both model plants and agriculturally important crops.
Availability and requirements
Database name: PMTED
Database homepage: http://pmted.agrinome.org
Browser requirement: the application is optimized for Internet Explorer 9, Mozilla FireFox 16.0.2 and Safari 5.1.7.
Datasets in PMTED is freely available. Please use the link ‘Contact Us’ on the PMTED homepage or email Dr. Long Mao at firstname.lastname@example.org to request specific data subsets.
China National High Tech ‘863’ Program No. 2012AA10A308; NSFC Nos. 61073075, 61272207; Sci-Tech Development Project of Jilin Province No. 20120730.
- Khraiwesh B, Zhu JK, Zhu J: Role of miRNAs and siRNAs in biotic and abiotic stress responses of plants. Biochimica et Biophysica Acta (BBA)-Gene Regulatory Mechanisms. 2012, 1819: 137-148. 10.1016/j.bbagrm.2011.05.001.View ArticleGoogle Scholar
- Griffiths-Jones S, Saini HK, Van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic Acids Res. 2008, 36 (suppl 1): D154-PubMed CentralPubMedGoogle Scholar
- Parkinson H, Kapushesky M, Shojatalab M, Abeygunawardena N, Coulson R, Farne A, Holloway E, Kolesnykov N, Lilja P, Lukk M: ArrayExpress—a public database of microarray experiments and gene expression profiles. Nucleic Acids Res. 2006, 35 (suppl 1): D747-PubMed CentralPubMedGoogle Scholar
- Parkinson H, Sarkans U, Shojatalab M, Abeygunawardena N, Contrino S, Coulson R, Farne A, Garcia Lara G, Holloway E, Kapushesky M: ArrayExpress—a public repository for microarray gene expression data at the EBI. Nucleic Acids Res. 2005, 33 (suppl 1): D553-PubMed CentralPubMedGoogle Scholar
- Edgar R, Domrachev M, Lash AE: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30 (1): 207-10.1093/nar/30.1.207.PubMed CentralView ArticlePubMedGoogle Scholar
- Bülow L, Schindler M, Choi C, Hehl R: PathoPlant: a database on plant-pathogen interactions. Silico Biol. 2004, 4: 529-536.Google Scholar
- Lohse M, Nunes Nesi A, Kruger P, Nagel A, Hannemann J, Giorgi FM, Childs L, Osorio S, Walther D, Selbig J: Robin: an intuitive wizard application for R-based expression microarray quality assessment and analysis. Plant Physiol. 2010, 153 (2): 642-10.1104/pp.109.152553.PubMed CentralView ArticlePubMedGoogle Scholar
- Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003, 4 (2): 249-10.1093/biostatistics/4.2.249.View ArticlePubMedGoogle Scholar
- Zhang Z, Yu J, Li D, Liu F, Zhou X, Wang T, Ling Y, Su Z: PMRD: plant microRNA database. Nucleic Acids Res. 2010, 38 (suppl 1): D806-PubMed CentralView ArticlePubMedGoogle Scholar
- Allen E, Xie Z, Gustafson AM, Carrington JC: microRNA-directed phasing during trans-acting siRNA biogenesis in plants. Cell. 2005, 121 (2): 207-221. 10.1016/j.cell.2005.04.004.View ArticlePubMedGoogle Scholar
- Dai X, Zhao PX: psRNATarget: a plant small RNA target analysis server. Nucleic Acids Res. 2011, 39 (suppl 2): W155-W159.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhou M, Gu L, Li P, Song X, Wei L, Chen Z, Cao X: Degradome sequencing reveals endogenous small RNA targets in rice (Oryza sativa L. ssp. indica). Front Biol. 2010, 5 (1): 67-90. 10.1007/s11515-010-0007-8.View ArticleGoogle Scholar
- Smyth GK: Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004, 3 (1): 3-Google Scholar
- Lopes CT, Franz M, Kazi F, Donaldson SL, Morris Q, Bader GD: Cytoscape Web: an interactive web-based network browser. Bioinformatics. 2010, 26 (18): 2347-2348. 10.1093/bioinformatics/btq430.PubMed CentralView ArticlePubMedGoogle Scholar
- Sunkar R, Girke T, Jain PK, Zhu J-K: Cloning and characterization of microRNAs from rice. Plant Cell Online. 2005, 17 (5): 1397-1411. 10.1105/tpc.105.031682.View ArticleGoogle Scholar
- Kantar M, Lucas SJ, Budak H: miRNA expression patterns of Triticum dicoccoides in response to shock drought stress. Planta. 2011, 233 (3): 471-484. 10.1007/s00425-010-1309-4.View ArticlePubMedGoogle Scholar
- Szcześniak MW, Deorowicz S, Gapski J, Kaczyński L, Makałowska I: miRNEST database: an integrative approach in microRNA search and annotation. Nucleic Acids Res. 2012, 40 (D1): D198-D204. 10.1093/nar/gkr1159.PubMed CentralView ArticlePubMedGoogle Scholar
- Bielewicz D, Dolata J, Zielezinski A, Alaba S, Szarzynska B, Szczesniak MW, Jarmolowski A, Szweykowska-Kulinska Z, Karlowski WM: mirEX: a platform for comparative exploration of plant pri-miRNA expression data. Nucleic Acids Res. 2012, 40 (D1): D191-D197. 10.1093/nar/gkr878.PubMed CentralView ArticlePubMedGoogle Scholar
- Griffiths Jones S, Grocock RJ, Van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2006, 34 (suppl 1): D140-PubMed CentralView ArticlePubMedGoogle Scholar
- Lewis BP, Burge CB, Bartel DP: Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005, 120 (1): 15-20. 10.1016/j.cell.2004.12.035.View ArticlePubMedGoogle Scholar
- Sethupathy P, Corda B, Hatzigeorgiou AG: TarBase: A comprehensive database of experimentally supported animal microRNA targets. RNA. 2006, 12 (2): 192-197.PubMed CentralView ArticlePubMedGoogle Scholar
- Yang JH, Li JH, Shao P, Zhou H, Chen YQ, Qu LH: starBase: a database for exploring microRNA-mRNA interaction maps from Argonaute CLIP-Seq and Degradome-Seq data. Nucleic Acids Res. 2011, 39 (suppl 1): D202-D209.PubMed CentralView ArticlePubMedGoogle Scholar
- Kozomara A, Griffiths Jones S: miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 2011, 39 (suppl 1): D152-PubMed CentralView ArticlePubMedGoogle Scholar
- Hennig L, Menges M, Murray JAH, Gruissem W: Arabidopsis transcript profiling on Affymetrix GeneChip arrays. Plant Mol Biol. 2003, 53 (4): 457-465.View ArticlePubMedGoogle Scholar
- Redman JC, Haas BJ, Tanimoto G, Town CD: Development and evaluation of an Arabidopsis whole genome Affymetrix probe array. Plant J. 2004, 38 (3): 545-561. 10.1111/j.1365-313X.2004.02061.x.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.