'PACLIMS': A component LIM system for high-throughput functional genomic analysis
© Donofrio et al; licensee BioMed Central Ltd. 2005
Received: 17 December 2004
Accepted: 12 April 2005
Published: 12 April 2005
Recent advances in sequencing techniques leading to cost reduction have resulted in the generation of a growing number of sequenced eukaryotic genomes. Computational tools greatly assist in defining open reading frames and assigning tentative annotations. However, gene functions cannot be asserted without biological support through, among other things, mutational analysis. In taking a genome-wide approach to functionally annotate an entire organism, in this application the ~11,000 predicted genes in the rice blast fungus (Magnaporthe grisea), an effective platform for tracking and storing both the biological materials created and the data produced across several participating institutions was required.
The platform designed, named PACLIMS, was built to support our high throughput pipeline for generating 50,000 random insertion mutants of Magnaporthe grisea. To be a useful tool for materials and data tracking and storage, PACLIMS was designed to be simple to use, modifiable to accommodate refinement of research protocols, and cost-efficient. Data entry into PACLIMS was simplified through the use of barcodes and scanners, thus reducing the potential human error, time constraints, and labor. This platform was designed in concert with our experimental protocol so that it leads the researchers through each step of the process from mutant generation through phenotypic assays, thus ensuring that every mutant produced is handled in an identical manner and all necessary data is captured.
Many sequenced eukaryotes have reached the point where computational analyses are no longer sufficient and require biological support for their predicted genes. Consequently, there is an increasing need for platforms that support high throughput genome-wide mutational analyses. While PACLIMS was designed specifically for this project, the source and ideas present in its implementation can be used as a model for other high throughput mutational endeavors.
Genome sequencing is the first step towards understanding the complex interplay between pathways and networks that determine the biology of living organisms. The next important step in these analyses is to perform genome-wide investigations to identify the functions of individual genes. While hybridization techniques such as DNA-based microarrays can provide insight into groups of genes that potentially operate in common pathways, validation is required before final functional assignment . Furthermore, many genes are regulated in a post-transcriptional manner, thus their function would not be definable by microarrays . Genome-wide screens of mutants created by targeted and random mutagenesis, as well as the method of gene silencing, are particularly powerful for ascribing phenotypes to individual genes and gene families and can potentially validate predictions from sequence and microarray data [3–7].
In many cases, taking a genome-wide approach to functional gene analysis requires the combined skills and resources of several research groups working with a semi-automated, rapid-throughput pipeline. To facilitate our goal of a comprehensive functional gene analysis in the fungus Magnaporthe grisea, we have developed a platform for high-throughput mutagenesis and phenotypic characterization. Using this platform, we are seeking to elucidate the functions of the approximately 11,000 genes in the thirty-eight megabase genome of this fungus . M. grisea is the causal agent of rice blast disease, the most devastating disease of rice worldwide . The economic importance of this pathogen and its genetic tractability make it a model system for understanding fungal biology, as well as plant-pathogen interactions .
One of the strategies that we have adopted to determine the functions of individual genes is to create 50,000 M. grisea strains, each carrying a single random mutation within the genome. The mutant strains are generated by introducing a disruption cassette into the fungus, which consists of a DNA fragment that confers resistance to the antibiotic, hygromycin B . Transformed M. grisea cells that incorporate the cassette into their chromosomal DNA are then able to grow on media containing the antibiotic. During the process, the cassette will often insert into an open reading frame or regulatory region, resulting in a loss of gene function and thus a biochemical or structural deficiency. Identification and characterization of phenotypic changes in each mutant provides information about the normal biological role(s) of the disrupted gene, whose identity is established by taking advantage of the fact that it has been "tagged" by the inserted antibiotic resistance marker [12, 13].
Research groups from two universities, University of Arizona (UA) and University of Kentucky (UKY), are cooperating to create the tagged M. grisea lines and to characterize any phenotypic changes. The mutant strains are then shipped to North Carolina State University (NCSU), where they are screened for changes in pathogenicity using susceptible rice varieties. Finally, all mutant strains are sent to the Fungal Genetics Stock Center (Kansas City, MO), a fungal strain repository, where they will be archived and made available to the public. The distribution of research efforts and pooling of the resources and data generated dramatically increases the necessity of having a system for each research laboratory to enter and access the information being produced.
From creation to final analysis, each mutant is processed through a total of eight barcoded steps and four phenotypic assays resulting in the capture of a dozen individual pieces of data over a period of 3–6 months. The ability to log, process and archive information in an efficient and secure manner is vital to the success of this project. To record data and track these mutants, we have developed a minimal Laboratory Information Management System (LIMS), called PACLIMS (P henotype A ssay C omponent LIMS) that is described in this report. This system was designed to be flexible in order to accommodate the experimental protocol as it evolved. The software fulfills the role of process control by enforcing the steps of our protocol, and reduces laboratory and data entry errors while allowing the data generated at the three universities to be entered from separate locations.
The PACLIMS system was implemented with Open Source, freely available software. The server machine runs Red Hat linux (RH), which runs on a large variety of commodity PC hardware. The RH distribution includes most of the software components that are required to construct PACLIMS. The Postgresql relational database system was used for data storage . This allows the utilization of transactions for data integrity, network based access, and supports numerous interface technologies. The Apache web server was employed for the user interface and for interconnecting the database and control programs, via a simple CGI oriented mechanism that follows normal web practices . Implementation was performed using Perl, a common bioinformatics language, allowing the system to be readily modified [20, 21].
Distributed operations and client/server web interface
A centralized, web-based client/server paradigm was chosen to reduce the management burden presented by the system. All server-based processing occurs on a single computer. Web server dependence was minimized by using a simple CGI interface between the server and the PACLIMS control programs. Secure access is ensured by employing the SSL-based HTTPS protocol. Secure user-access and presentation of security credentials occurs through a web browser such as Netscape and Internet Explorer, so that when a user logs into the system, their identity is associated with all subsequent actions.
Mutant production pipeline
The initial stages of mutant production and morphological characterization are performed at UKY and UA. After the creation of mutants and genetic purification each mutant is transferred to a well of a 24-well plate containing complete medium agar plus hygromycin with three cellulose paper disks on the agar surface. This "parent plate" marks the entry point for PACLIMS. All subsequent daughter plates can be tracked back to their parent. A barcode is attached to the plate and scanned into PACLIMS, which then directs the user through web forms, in order to record details about the plate's contents (Figure 3A and 3B). The parent plate is incubated for a defined period of time at which point the user collects phenotype data such as growth rate and enters it into the system (Figure 2, Module 9). PACLIMS also directs the user to create other copies of the mutants for sporulation and auxotrophy analyses. Permanent stocks are created in triplicate (Figure 2, Module 5), with one replicate being retained at the site of origin, one being shipped to NCSU for pathogenicity screening, and the final replicate going to the Fungal Genetics Stock Center (Kansas City, MO) for public request (Figure 2, Module 6). PACLIMS is used to direct the creation of these stocks, and to record the receipt of their shipment. All phenotype data generated are recorded in PACLIMS (Figure 2, Module 4, Module 9).
Sufficient reporting functionality is built into the system to support the data entry process. Contextual information is supplied to the user to allow review of the entered information prior to permanently committing it to the database. Robust reporting is provided by third party software such as Microsoft Access database communications protocol or database systems like MGOS by using Postgresql's own network communications protocol [17, 18]. Separating and relegating reporting to an external component increases the reusability and component nature of the implementation. PACLIMS can be readily modified to account for different research protocols without disrupting the reporting mechanism. Moreover, specialized third party reporting tools provide a ready means of creating custom reports, as need dictates.
The current version of PACLIMS is freely available to academic and non-profit users at http://paclims.sourceforge.net. Furthermore, the system is modular and readily customized to suit a laboratory's specific needs for a high-throughput screen. There is no need for purchasing additional software to use the system. Laboratory personnel who have introductory level experience with Perl can readily adapt the software to different protocols. Please contact Ralph_Dean@ncsu.edu for further details.
This project is funded by a grant from the National Science Foundation Plant Genome Program award number DBI #0115642.
- Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. PNAS 1998, (25):14863–14868. DEC 8 1998 10.1073/pnas.95.25.14863Google Scholar
- Klaff P, Riesner D, Steger G: RNA structure and the regulation of gene expression. Plant Molecular Biology 1996, 32(1–2):89–106. OCT 10.1007/BF00039379View ArticlePubMedGoogle Scholar
- Dufresne M, Bailey JA, Dron M, Langin T: clk1, a serine/threonine protein kinase-encoding gene, is involved in pathogenicity of Colletotrichum lindemuthianum on common bean. MPMI 1998, 11: 99–108.View ArticlePubMedGoogle Scholar
- Sweigard JA, Carroll AM, Farrall L, Chumley FG, Valent B: Magnaporthe grisea pathogenicity genes obtained through insertional mutagenesis. MPMI 1998, 11: 404–412.View ArticlePubMedGoogle Scholar
- Balhadère PV, Foster AJ, Talbot NJ: Identification of pathogenicity mutants of the rice blast fungus Magnaporthe grisea by insertional mutagenesis. Mol Plant Microbe Interact 1999, 12: 129–142.View ArticleGoogle Scholar
- Kadotani N, Nakayashiki H, Tosa Y, Mayama S: RNA silencing in the phytopathogenic fungus Magnaporthe grisea . MPMI 2003, 16: 769–776.View ArticlePubMedGoogle Scholar
- Leonhardt N, Kwak JM, Robert N, Waner D, Leonhardt G, Schroeder JI: Microarray expression analyses of Arabidopsis guard cells and isolation of a recessive abscisic acid hypersensitive protein phosphatase 2C mutant. The Plant Cell 2004, 16: 596–615. 10.1105/tpc.019000PubMed CentralView ArticlePubMedGoogle Scholar
- Broad Institute:[http://www.broad.mit.edu/annotation/fungi/magnaporthe/]
- Talbot NJ: Having a blast: Exploring the pathogenicity of Magnaporthe grisea . Trends Microbiol 1995, 3: 9–16. 10.1016/S0966-842X(00)88862-9View ArticlePubMedGoogle Scholar
- Valent B, Farrall L, Chumley FG: Magnaporthe grisea genes for pathogenicity and virulence identified through a series of backcrosses. Genetics 1991, 127(1):87–101.PubMed CentralPubMedGoogle Scholar
- Leung H, Lehtinen U, Karjalainen U, et al.: Transformation of the rice blast fungus Magnaporthe grisea to hygromycin B resistance. Curr Genet 1990, 17: 409–411. 10.1007/BF00334519View ArticlePubMedGoogle Scholar
- Shi Z, Christian D, Leung H: Enhanced transformation in Magnaporthe grisea by restriction enzyme mediated integration of plasmid DNA. Phytopathology 1995, 85: 329–333.View ArticleGoogle Scholar
- Gold SE, Garcia-Pedrajas MD, Martinez-Espinoza AD: New (and used) approaches to the study of fungal pathogenicity. Annu Rev Phytopathol 2001, 39: 337–65. 10.1146/annurev.phyto.39.1.337View ArticlePubMedGoogle Scholar
- Goodman N, Rozen S, Stein LD, Smith AG: The LabBase system for data management in large scale biology research laboratories. Bioinformatics 1998, 14: 562–574. 10.1093/bioinformatics/14.7.562View ArticlePubMedGoogle Scholar
- Imbert MC, Nguyen VK, Granjeaud S, Nguyen C, Jordan BR: 'LABNOTE', a laboratory notebook system designed for academic genomics groups. Nucleic Acids Res 1999, 27: 601–607. 10.1093/nar/27.2.601PubMed CentralView ArticlePubMedGoogle Scholar
- Kokocinski F, Wrobel G, Hahn M, Lichter P: QuickLIMS: facilitating the data management for DNA-microarray fabrication. Bioinformatics 2003, 19: 283–284. 10.1093/bioinformatics/19.2.283View ArticlePubMedGoogle Scholar
- Stein L: How perl saved the Human Genome Project. Dr. Dobbs Journal 1997.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.