A fisheye viewer for microarray-based gene expression data
© Wu et al; licensee BioMed Central Ltd. 2006
Received: 05 June 2006
Accepted: 13 October 2006
Published: 13 October 2006
Microarray has been widely used to measure the relative amounts of every mRNA transcript from the genome in a single scan. Biologists have been accustomed to reading their experimental data directly from tables. However, microarray data are quite large and are stored in a series of files in a machine-readable format, so direct reading of the full data set is not feasible. The challenge is to design a user interface that allows biologists to usefully view large tables of raw microarray-based gene expression data. This paper presents one such interface – an electronic table (E-table) that uses fisheye distortion technology.
The Fisheye Viewer for microarray-based gene expression data has been successfully developed to view MIAME data stored in the MAGE-ML format. The viewer can be downloaded from the project web site http://polaris.imt.uwm.edu:7777/fisheye/. The fisheye viewer was implemented in Java so that it could run on multiple platforms. We implemented the E-table by adapting JTable, a default table implementation in the Java Swing user interface library. Fisheye views use variable magnification to balance magnification for easy viewing and compression for maximizing the amount of data on the screen.
This Fisheye Viewer is a lightweight but useful tool for biologists to quickly overview the raw microarray-based gene expression data in an E-table.
The massively parallel research technique called microarray was developed ([1, 2]) to take advantage of the unprecedented amount of information available about an organism's genetic makeup. Microarray enables researchers to measure the relative amounts of every mRNA transcript from the genome in a single scan, thus increasing the number of data points from an experiment by several thousand folds. Biological researchers, who have been used to studying a small number of genes thoroughly over a period of years, must now use new concepts and methods to store and analyze their experimental results. Because of variations in microarray technology and the immaturity of the field, the computation of gene expression level and usage of semantics varies between platforms. Minimum Information About a Microarray Experiment (MIAME)  was proposed in 2001 as a uniform standard for recording and reporting microarray gene expression data. MIAME has been widely adopted because it eases the interpretation of expression data and the independent verification of experimental results. An XML-based data format, Microarray-based Gene Expression – Markup Language (MAGE-ML), was developed to facilitate the exchange of MIAME data . The MAGE-ML is the XML representation of MAGE-OM, which is an object model. The MAGE-OM contains 132 classes grouped into 17 packages. For example, Experiment is a package of MAGE-OM to describe the experiment goals and design; BioAssayData package stores gene-expression data; BioMaterial package describes biological materials used and description of their creation; and DesignElement package contains a mapping of features.
Biologists have been accustomed to reading their experimental data directly from tables. However, microarray data are quite large and are stored in a series of files in a machine-readable format, so direct reading of the full data set is not feasible. Even though a large amount of gene expression data can be integrated into tables, it is still difficult to browse. Incoming integrated bioinformatics systems will support simultaneous query on both knowledge databases and microarray data simultaneously. For example, FlyMine for Drosophila melanogaster is a data warehouse that integrates several genomic and proteomic data sets in one place and its website allows users to build arbitrary complex queries across all data . But even when they have sophisticated tools like FlyMine, biologists will still want to preview the raw gene expression data in order to construct appropriate queries in the integrated database system. For example, a biologist is looking for the same gene expression profile from a microarray experiment. If the biologist can browse the raw data in the microarray, such as "ratio of means (ROM)" values, he or she can have a preliminary idea of the experimental data, such as the range of values. The biologist can efficiently use other computer systems to construct queries or confirm the accuracy of the query results.
The challenge is to design a user interface that allows biologists to usefully view large tables of raw microarray-based gene expression data. This paper presents one such interface – an electronic table (E-table) that uses fisheye distortion technology. Fisheye views have been widely used to deliver large amounts of data in limited screen space. Their use is motivated by the observation that, at any one time, users are only focusing on a small part of the data. Fisheye views use variable magnification: the data on which the user is focusing is large, neighboring data is smaller, and distant data is very small. The Fisheye technique was initially called "bifocal displays" . As this "degree-of-interest" approach developed, it came to be called "fisheye views"  by analogy with the optical effect seen in photographs taken using fisheye lenses that have very short focal lengths. Table lens  is a "focus+context" based fisheye technology that works on tabular information to display of crucial label information and multiple distal focal areas. Graphical information is also integrated in the display of large tables using visualization technology . Visualization help improve the presentation of tabular data because humans are good at spot patterns and features in well-designed graphical rendering of collection of values. In fact, the combination of fisheye and visual graphical technologies can reduce navigation time when viewing a large tabular data collection. In this project, an E-table for MAGE data was successfully implemented as a Java application.
The Fisheye Viewer was implemented in Java so that it could run on multiple platforms. It uses the MAGE-ML Software Toolkit (MAGE-ML stk)  to read MAGE-ML [11, 12] files. The MAGE-ML stk is itself based on the Xerces XML parser . The majority of the viewer's interfaces were built using Netbeans Mantissa . Rather than reinventing the wheel, we implemented the E-table by adapting JTable, a default table implementation in the Java Swing user interface library. This extended table class is called FishEyeTable and contains methods to provide focus. The variation of row height and font sizes is handled by the FishEyeTableCellRenderer class, which extends the DefaultTableCellRenderer class of the Java Swing library.
Fisheye views use variable magnification to balance magnification for easy viewing and compression for maximizing the amount of data on the screen. In our Fisheye Viewer, the user can click on any row to bring the focus to that row. The focus row is shown larger than all other rows and its text is larger and in bold face. The height and font size of other rows is determined by their distance from the focus row, with row height and font size becoming progressively smaller as the distance from the focus row grows. The height of the focus row is determined when the focus method of the FishEyeTable class is called by a selection listener. Once the row height is determined, the cells of the row are rendered using a corresponding font size by the FishEyeTableCellRenderer class. The heights and text fonts of the neighboring rows are controlled by a ListListener that observes changes in the user's selection. A separate listener is required for the neighboring rows because of the protocol used by the classes of the JTable package in Swing. Finally, because the tables are too large to show in their entirety on a single screen, a scroll pane allows the user to scroll the table up or down to see the hidden rows.
For gene expression data tables, it is useful to have both column and row headings, but JTable and the default table model only support column headings. In a non-fisheye table, a list could be used as the row header. However, list item heights are fixed, so there would be no way to vary the heights of the rows based on the focus. So instead, we used a second E-table just for the row headers. Then, we extended the ListListener class to be able can notify multiple tables and attached it to the FishEyeTable object. So, when a row is focused in the data table, the row header is also notified which to set its focus on the corresponding row header.
The result of a pilot user study
User feedback was quite positive, with a mean overall reaction score of 8.2 (on a scale from zero to nine). The QUIS scale with the highest score was the system information scale (8.8), which includes 1) use of terms throughout system, 2) terminology related to task, 3) prompts for input, 4) computer informs about its progress, and 5) error messages. User 2 and User 3 gave all nines, the highest scores to each item, while Users 1 and 5 gave many scores of 9. User 4, however, gave only 7.1 on average to the software and he also questioned the usefulness of the software. In particular, he indicated that he did not think this software organized information well on the screen (1 from 0–9 level) and did not designed for all levels of users (2 of 0–9 scale). User 4's overall reaction score was only 6.0. The inconsistency of the user responses in the pilot study indicated that further improvements to the software will be needed for it to gain wide acceptance. However, the enthusiastic response of some users indicates that the system shows real promise.
The design of the E-table is generic for large tabular data, so it could also be integrated into other biological data warehouses to preview the data before constructing complex queries or to confirm the results after the queries.
The sizes of MAGE-ML files vary greatly: some are only several megabytes while other can require hundreds of megabytes. If the user attempts to view a large MAGE-ML file will cause an out of memory error. This is the nature of MAGE-ML documents: they are simply huge. We have tested the application on a desktop PC with 512 MB of RAM running Windows XP and Linux, and on a Tablet PC with 512 MB of RAM running Windows XP Tablet PC edition. The sample MAGE-ML files used were around 1 – 3 MB in size.
A new MAGE data viewer using fisheye distortion technique was successfully developed. The viewer can be used to view most types of data elements in the MAGE-ML format. This Fisheye Viewer is a lightweight but useful tool for biologists to quickly overview the raw microarray-based gene expression data in an E-table. The software package is made freely available for the scientific community via the project web site .
Availability and requirements
Project name: Fisheye Viewer for Microarray-based Gene Expression
Project home page: http://polaris.imt.uwm.edu:7777/fisheye
Operating system(s): Platform independent
Programming language: Java
Other requirements: Java JRE 1.4, 512 MB of RAM or more
Any restrictions to use by non-academics: N/A
On-going research in MW's laboratory is partially supported by the Helen Bader Foundation in Milwaukee, WI. EVM is partially supported by NSF Grant CNS-0420312.
- Schena M, Shalon D, Davis RW, Brown PO: Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 1995, 270(5235):467–470.View ArticlePubMedGoogle Scholar
- Lockhart DJ, Dong H, Byrne MC, Follettie MT, Gallo MV, Chee MS, Mittmann M, Wang C, Kobayashi M, Horton H, et al.: Expression monitoring by hybridization to high-density oligonucleotide arrays. Nat Biotechnol 1996, 14(13):1675–1680.View ArticlePubMedGoogle Scholar
- Brazma A, Hingamp P, Quackenbush J, Sherlock G, Spellman P, Stoeckert C, Aach J, Ansorge W, Ball CA, Causton HC, et al.: Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nat Genet 2001, 29(4):365–371.View ArticlePubMedGoogle Scholar
- Spellman PT, Miller M, Stewart J, Troup C, Sarkans U, Chervitz S, Bernhart D, Sherlock G, Ball C, Lepage M, Swiatek M, Marks WL, Goncalves J, Markel S, Iordan D, Shojatalab M, Pizarro A, White J, Hubley R, Deutsch E, Senger M, Aronow BJ, Robinson A, Bassett D, Stoeckert CJ Jr, Brazma A: Design and implementation of microarray gene expression markup language (MAGE-ML). Genome Biol 2002, 3(9):RESEARCH0046. Epub 2002 Aug 23. 2002 Aug 23 Epub 2002 Aug 23. 2002 Aug 23PubMed CentralView ArticlePubMedGoogle Scholar
- FlyMine, [homepage on the Internet] [cited 2005 Jan 26][http://www.flymine.org/]
- Spence R, Apperley M: Data Base Navigation: an Office Environment for the Professional. Behavior & Information Technology 1982, 1(1):43–54.View ArticleGoogle Scholar
- Furnas GW: The Fisheye Calendar System. (Report N. TM-ARH-020558). Bellcore, Morristown, NJ 1991.Google Scholar
- Rao R, Card SK: The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus+Context Visualization for Tabular Information. Proceedings of CHI'94 1994, 318–322.View ArticleGoogle Scholar
- Sarkar M, Snibbe S, Reiss S: Stretching the rubber sheet: a metaphor for visualizing large structure on small screen. Proceedings of the ACM Symposium on User Interface Software and Technology 1993.Google Scholar
- MAGE-stk: the MAGE Software Toolkit, [homepage on the Internet] [cited 2006 May 1][http://mged.sourceforge.net/software/MAGEstk.php]
- MAGE-ML formal specification, [homepage on the Internet] [cited 2006 May 1][http://www.omg.org/docs/formal/03–02–03.pdf]
- MAGE-ML DTD, [homepage on the Internet] [cited 2006 May 1][http://www.omg.org/docs/dtc/02–09–03.dtd]
- Xerces XML Parser, [homepage on the Internet] [cited 2006 May 1][http://xerces.apache.org/]
- Netbeans IDE, [homepage on the Internet] [cited 2006 May 1][http://www.netbeans.org]
- MAGE-ML Viewer, [homepage on the Internet] [cited 2006 May 1][http://polaris.imt.uwm.edu:7777/fisheye]
- Slaughter L, Norman KL, Shneiderman B: Assessing users' subjective satisfaction with the Information System for Youth Services (ISYS). VA Tech Proc of Third Annual Mid-Atlantic Human Factors Conference (Blacksburg, VA, March 26–28, 1995) 164–170.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.