Using partially ordered sets to represent and predict true patterns of gene response to treatments
© Vo and Phan; licensee BioMed Central Ltd. 2013
Published: 22 October 2013
Advances in biotechnology have empowered high-throughput measurement of gene expression levels for tens of thousands of genes simultaneously. This means that one sample size must be used for all genes in most experimental designs [1, 2], which implies that patterns of response of highly variantly expressed genes might not be measured accurately. Response patterns of gene expression data with multiple treatments have been characterized using post hoc pairwise comparisons by several researchers [3, 4]. Nevertheless, these researchers did not address how to cope with highly variantly expressed genes with inaccurate patterns due to having too few experimental samples.
We show that dependencies of pairwise comparison outcomes in post hoc calculations can be exploited to infer true response patterns of genes with inaccurate patterns due to having too few experimental samples.
Characterizing such response patterns as partially ordered sets, we show that linearly orderable patterns are more likely true patterns and those that are not linearly orderable cannot be true patterns. We propose a strategy to predict most likely linearly orderable extensions of such patterns. Using microarray data of rats' liver cells, we showed that this approach yielded more and better functionally enriched gene lists than a conventional approach.
This approach opens up opportunities to design cost-effective experiments, in which only a conservatively large sample size is needed to collect expression levels of almost all genes. For most genes, such a sample size is sufficient. For highly variantly expressed genes, our method can help infer true response patterns.
- Lee ML, Kuo FC, Whitmore GA, Sklar J: Importance of replication in microarray gene expression studies: statistical methods and evidence from repetitive cDNA hybridization. Proc Natl Acad Sci. 2000, 97 (18): 9834-9839. 10.1073/pnas.97.18.9834.PubMed CentralView ArticlePubMedGoogle Scholar
- Glaus P, Honkela A, Rattray M: Identifying differentially expressed transcripts from RNA-seq data with biological variation. Bioinformatics. 2012, 28 (13): 1721-1728. 10.1093/bioinformatics/bts260.PubMed CentralView ArticlePubMedGoogle Scholar
- Sutter T, He X, Dimitrov P, Xu L, Narasimhan G, George EO, Sutter CH, Grubbs C, Savory R, Stephan-Gueldner M, Kreder D, Taylor MJ, Lubet R, Patterson TA, Kensler TW: Multiple comparisons model-based clustering and ternary pattern tree numerical display of gene response to treatment: procedure and application to the preclinical evaluation of chemopreventive agents. Mol Cancer Ther. 2002, 1: 1283-1292.PubMedGoogle Scholar
- Hulshizer R, Blalock EM: Post hoc pattern matching: assigning significance to statistically defined expression patterns in single channel microarray data. BMC Bioinformatics. 2007, 8: 240-10.1186/1471-2105-8-240.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.