eQTL Viewer: visualizing how sequence variation affects genome-wide transcription
© Zou et al; licensee BioMed Central Ltd. 2007
Received: 16 August 2006
Accepted: 09 January 2007
Published: 09 January 2007
Expression Quantitative Trait Locus (eQTL) mapping methods have been used to identify the genetic basis of gene expression variations. To map eQTL, thousands of expression profiles are related with sequence polymorphisms across the genome through their correlated variations. These eQTL distribute in many chromosomal regions, each of which can include many genes. The large number of mapping results produced makes it difficult to consider simultaneously the relationships between multiple genomic regions and multiple expressional profiles. There is a need for informative bioinformatics tools to assist the visualization and interpretation of these mapping results.
We have developed a web-based tool, called eQTL Viewer, to visualize the relationships between the expression trait genes and the candidate genes in the eQTL regions using Scalable Vector Graphics. The plot generated by eQTL Viewer has the capacity to display mapping results with high resolutions at a variety of scales, and superimpose biological annotations onto the mapping results dynamically.
Our tool provides an efficient and intuitive way for biologists to explore transcriptional regulation patterns, and to generate hypotheses on the genetic basis of transcriptional regulations.
Transcriptional control is a crucial step in organ development and cellular responses to environmental changes. Recent studies have demonstrated that mRNA expression levels vary in both natural and experimental populations . Expression Quantitative Trait Locus (eQTL) mapping seeks to explain such variations by identifying the relationships between the transcript abundance and specific genomic markers . When the transcript abundance is treated as a continuous trait for the purpose of mapping, it is termed an expression trait (eTrait).
eQTL mapping differs from the classical QTL mapping in two important ways, which allow us to ask questions that we cannot address by the traditional approach. First, there is a one-to-one relationship between an eTrait and a gene with its expression profile assayed in the mapping population. The biological information associated with an eTrait gene can be used to suggest potential causal genes for an eQTL. This is important as an eQTL usually spans a large genomic region that contains many genes. For example, eQTL analysis has been used to infer cis- and trans-acting regulatory regions in yeast [3–5], mice and rat [6–8]. If an eQTL is mapped to a genomic region where the eTrait gene is located, it may suggest the cis-regulatory mechanism for the eQTL, i.e., certain sequence variations around the gene region of the eTrait may directly influence the transcript abundance of the gene. In such a case, the genomic region around the eTrait gene can be prioritized for further experimental scrutiny to identify the causal element that affects the eTrait. Otherwise, mapping results may indicate trans-acting regulations, i.e., the variation of an eTrait is affected by sequence polymorphisms in other genes. Secondly, the classical QTL mapping focuses on one or a few traits, but an eQTL mapping study may have thousands of eTraits. Biochemical or co-expression relationships among eTrait genes contain additional layers of information beyond just the trait-marker linkages.
To display the myriad of relationships between eTraits, markers, and genes, we need a convenient bioinformatics tool to visualize eQTL mapping results at a variety of scales ranging from a single locus to the entire genome. Additionally, researchers need quick and straightforward ways to integrate these results with the extra information from previous studies on the organism. To address these needs, we have developed eQTL Viewer, a web-based tool that plots eQTL mapping results. The resulting plot displays eQTL for thousands of eTraits in a single view, which makes patterns such as cis- and trans-regulations readily identifiable. We have also empowered such a plot with the ability to present annotations, highlight features, and organize eTraits in biological groups, such as biochemical pathways. All these characteristics make eQTL Viewer an intuitive and information rich environment to discover and understand genome-wide transcriptional regulation patterns.
In eQTL Viewer, each eQTL is represented by a consecutive list of genes that fall within the genomic region where the eQTL is located. An eQTL region can be estimated by a LOD support interval from the mapping procedure . Given a genomic interval, the genes in it can be found using text parsing tools like PERL. We have written example programs to translate eQTL regions into gene lists, and organize them in a XML format file. The eQTL Viewer then converts the XML input into a scalable graph with eTrait genes arranged along the vertical axis and eQTL along the horizontal axis. This basic plot is prevalent in the eQTL literature (see examples in ).
Rather than just summarizing the results in a static plot, eQTL Viewer creates an interactive plot that has many useful features. The graph allows a user to zoom in to study genes in each eQTL or zoom out to look at the overall regulation patterns for the genome-wide eTraits. The graph has a search function that can be used to query and highlight an eTrait and its eQTL according to the eTrait gene name. A user can also use customizable scripts to have a list of eTraits and their eQTL highlighted. For example, one can highlight eQTL of all eTraits associated with a certain metabolic pathway. eQTL can also be grouped along the vertical axis according to the biological properties of eTrait genes. This would reveal the eQTL distribution pattern for a number of eTrait genes that share similar biological functions. Each genomic element in the graph can be linked to its annotation information in an external database, such as NCBI. All these features make eQTL Viewer a unique tool for organizing eQTL mapping results and integrating related biological knowledge.
The eQTL on the diagonal overlap the genomic regions where the eTrait genes reside, indicating potential cis-acting regulations for the eQTL. In Figure 1, there are 736 eTraits with cis-acting eQTL along the diagonal (about 12% of the 6195 eTraits assayed). There are also 2969 eTraits with trans-acting eQTL off the diagonal (about 48% of all eTraits). There are a number of eQTL "hotspots" formed by trans-acting eQTL. Three major ones are located on chromosome 2 (about 450 eTraits), chromosome 14 (about 540 eTraits), and chromosome 15 (about 380 eTraits) (Figure 1). When pointing at an eQTL, the eTrait gene and the candidate genes in the eQTL will be displayed in the right sidebar.
We can superimpose on the graph annotations related with transcriptional regulations. For example, based on the information of yeast transcriptional factors and their binding sites , we found 49 eTrait-candidate gene pairs that are parts of the known transcriptional factor regulation network. They are indicated as green dots in Figure 1. Also, we found 734 cases in which a gene within an eQTL region can form a protein complex with the eTrait gene . These are shown in red dots in Figure 1. As there are so many of them, these dots crowd together in the regions where eQTL are dense. In this case, one can zoom in on a cluster to scatter the dots and inspect each eTrait-candidate gene pair as a potential biological mechanism underlying an eQTL. After rearranging eTraits according to KEGG pathways, the graph (available in our website) shows that eTraits of genes in the oxidative phosphorylation pathway  have eQTL clustered in the middle of chromosome 15. This suggests a regulatory region for that pathway. Without the grouping feature of eQTL Viewer, such a pattern would not be easily visualized.
We emphasize gene-gene relationships in eQTL Viewer. These relationships provide important information for biologists to understand and search for the genetic basis of eQTL. An eQTL can span physically a large genomic region, depending on the mapping experimental design. Due to the limitations of linkage studies it is difficult to pin down which gene within an eQTL is the source of eTrait variation . By relating eTraits and genetic markers to their corresponding genes, our eQTL Viewer organizes each eQTL as a list of pairwise relationships between an eTrait gene and the multiple candidate genes in the eQTL region. This goes a step further than just showing the relationships between mRNA probes and polymorphic markers.
Mueller et al.  recently introduced their eQTL Explorer package in a similar spirit. While both software packages provide features for exploring eQTL results, they use differing approaches and fulfil complementary functions. eQTL Explorer integrates physiological QTL mapping into eQTL mapping to generate biological hypotheses. Fully recognizing the characteristics of eQTL mapping as compared with classical physiological QTL mapping, we put more emphasis on gene-gene relationships in developing eQTL Viewer as discussed above. Also graphs produced by eQTL Viewer can capture all the eQTL in one plot to display the transcriptional regulatory pattern for the entire transcriptome. Such graphs can be scaled up to hold thousands of eQTL mapped onto a single chromosomal region.
The GenomeGraph module from WebQTL [22, 23] is being developed using the same SVG technology to visualize eQTL. It takes the great advantage of WebQTL, an online repository of analysis tools and multiscale mapping data from various model organisms. However, like eQTL Explorer, GenomeGraph visualizes the relationships among probe intensities and genetic markers inferred from eQTL studies. As a standalone visualization tool, our eQTL Viewer allows more flexible incorporation of recent development in statistical methods and biological discoveries. When we interpret an eQTL interval as a list of pairwise relationships between an eTrait gene and multiple candidate genes, additional relationships among genes can be readily plotted on top of the mapping results. Thus, this opens a gate between a single mapping study and the rest of biological investigations in the area.
eQTL Viewer is a robust web-based bioinformatics tool that generates a scalable graph to visualize estimated relationships between sequence polymorphisms and gene expression profiles. It is our intent to help form a bridge between quantitative genetic analysis and systems biology, and provide a bioinformatics platform to interpret statistical patterns using biological information.
Availability and requirements
Project name: eQTL Viewer
Project home page: http://statgen.ncsu.edu/eQTLViewer
Operating system(s): Platform independent
Other requirements: To view Scalable Vector Graphics, users may need appropriate plug-in for browsers from Adobe http://www.adobe.com/svg/viewer/install/.
License: GNU GPL
Any restrictions to use by non-academics: Licence needed
This work was partially supported by NIH GM45344 and by the National Research Initiative of the USDA Cooperative State Research, Education and Extension Service, grant number 2005-00754. WZ is supported by a university genomics science fellowship and DLA is supported by a NIEHS training grant in Bioinformatics.
- Oleksiak MF, Churchill GA, Crawford DL: Variation in gene expression within and among natural populations. Nat Genet 2002, 32: 261–266. 10.1038/ng983View ArticlePubMed
- Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, Colinayo V, Ruff TG, Milligan SB, Lamb JR, Cavet G, Linsley PS, Mao M, Stoughton RB, Friend SH: Genetics of gene expression surveyed in maize, mouse and man. Nature 2003, 422: 297–302. 10.1038/nature01434View ArticlePubMed
- Brem RB, Yvert G, Clinton R, Kruglyak L: Genetic dissection of transcriptional regulation in budding yeast. Science 2002, 296: 752–755. 10.1126/science.1069516View ArticlePubMed
- Brem RB, Storey JD, Whittle J, Kruglyak L: Genetic interactions between polymorphisms that affect gene expression in yeast. Nature 2005, 436: 701–703. 10.1038/nature03865PubMed CentralView ArticlePubMed
- Yvert G, Brem RB, Whittle J, Akey JM, Foss E, Smith EN, Mackelprang R, Kruglyak L: Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat Genet 2003, 35: 57–64. 10.1038/ng1222View ArticlePubMed
- Bystrykh L, Weersing E, Dontje B, Sutton S, Pletcher MT, Wiltshire T, Su AI, Vellenga E, Wang J, Manly KF, Lu L, Chesler EJ, Alberts R, Jansen RC, Williams RW, Cooke MP, de Haan G: Uncovering regulatory pathways that affect hematopoietic stem cell function using 'genetical genomics'. Nat Genet 2005, 37: 225–232. 10.1038/ng1497View ArticlePubMed
- Chesler EJ, Lu L, Shou S, Qu Y, Gu J, Wang J, Hsu HC, Mountz JD, Baldwin NE, Langston MA, Threadgill DW, Manly KF, Williams RW: Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function. Nat Genet 2005, 37: 233–242. 10.1038/ng1518View ArticlePubMed
- Hubner N, Wallace CA, Zimdahl H, Petretto E, Schulz H, Maciver F, Mueller M, Hummel O, Monti J, Zidek V, Musilova A, Kren V, Causton H, Game L, Born G, Schmidt S, Muller A, Cook SA, Kurtz TW, Whittaker J, Pravenec M, Aitman TJ: Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease. Nat Genet 2005, 37: 243–253. 10.1038/ng1522View ArticlePubMed
- Scalable Vector Graphics (SVG) 1.1 Specification [www.w3.org/TR/SVG/]
- Tanoue J, Yoshikawa M, Uemura S: The GeneAround GO viewer. Bioinformatics 2002, 18: 1705–1706. 10.1093/bioinformatics/18.12.1705View ArticlePubMed
- Beynon RJ: A simple tool for drawing proteolytic peptide maps. Bioinformatics 2005, 21: 674–675. 10.1093/bioinformatics/bti077View ArticlePubMed
- Blake J, Schwager C, Kapushesky M, Brazma A: ChroCoLoc: an application for calculating the probability of co-localization of microarray gene expression. Bioinformatics 2006, 22: 765–767. 10.1093/bioinformatics/btk022View ArticlePubMed
- Kerkhoven R, van Enckevort FHJ, Boekhorst J, Molenaar D, Siezen RJ: Visualization for genomics: the Microbial Genome Viewer. Bioinformatics 2004, 20: 1812–1814. 10.1093/bioinformatics/bth159View ArticlePubMed
- Stothard P, Wishart DS: Circular genome visualization and exploration using CGView. Bioinformatics 2005, 21: 537–539. 10.1093/bioinformatics/bti054View ArticlePubMed
- Conneally PM, Edwards JH, Kidd KK, Lalouel JM, Morton NE, Ott J, White R: Report of the Committee on Methods of Linkage Analysis and Reporting. Cytogenet Cell Genet 1985, 40: 356–359.View ArticlePubMed
- Kao CH, Zeng ZB, Teasdale RD: Multiple interval mapping for quantitative trait loci. Genetics 1999, 152: 1203–1216.PubMed CentralPubMed
- Luscombe NM, Babu MM, Yu H, Snyder M, Teichmann SA, Gerstein M: Genomic analysis of regulatory network dynamics reveals large topological changes. Nature 2004, 431: 308–312. 10.1038/nature02782View ArticlePubMed
- Hong EL, Balakrishnan R, Christie KR, Costanzo MC, Dwight SS, Engel SR, Fisk DG, Hirschman JE, Livstone MS, Nash R, Park J, Oughtred R, Skrzypek M, Starr B, Theesfeld CL, Andrada R, Binkley G, Dong Q, Lane C, Hitz B, Miyasato S, Schroeder M, Sethuraman A, Weng S, Dolinski K, Botstein D, Cherry JM: Saccharamyces Genome Database.2005. [http://www.yeastgenome.org/]
- Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M: The KEGG resource for deciphering the genome. Nucleic Acids Res 2004, 32: D277–80. 10.1093/nar/gkh063PubMed CentralView ArticlePubMed
- Mackay TF: Quantitative trait loci in Drosophila. Nat Rev Genet 2001, 2: 11–20. 10.1038/35047544View ArticlePubMed
- Mueller M, Goel A, Thimma M, Dickens NJ, Aitman TJ, Mangion J: eQTL Explorer: integrated mining of combined genetic linkage and expression experiments. Bioinformatics 2006, 22: 509–511. 10.1093/bioinformatics/btk007View ArticlePubMed
- WebQTL [www.genenetwork.org]
- Wang J, Williams RW, Manly KF: WebQTL: web-based complex trait analysis. Neuroinformatics 2003, 1: 299–308. 10.1385/NI:1:4:299View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.