 Research article
 Open Access
A methodology for the analysis of differential coexpression across the human lifespan
 Jesse Gillis^{1, 2} and
 Paul Pavlidis^{1, 2}Email author
https://doi.org/10.1186/1471210510306
© Gillis and Pavlidis; licensee BioMed Central Ltd. 2009
Received: 30 April 2009
Accepted: 22 September 2009
Published: 22 September 2009
Abstract
Background
Differential coexpression is a change in coexpression between genes that may reflect 'rewiring' of transcriptional networks. It has previously been hypothesized that such changes might be occurring over time in the lifespan of an organism. While both coexpression and differential expression of genes have been previously studied in life stage change or aging, differential coexpression has not. Generalizing differential coexpression analysis to many time points presents a methodological challenge. Here we introduce a method for analyzing changes in coexpression across multiple ordered groups (e.g., over time) and extensively test its validity and usefulness.
Results
Our method is based on the use of the Haar basis set to efficiently represent changes in coexpression at multiple time scales, and thus represents a principled and generalizable extension of the idea of differential coexpression to life stage data. We used published microarray studies categorized by age to test the methodology. We validated the methodology by testing our ability to reconstruct Gene Ontology (GO) categories using our measure of differential coexpression and compared this result to using coexpression alone. Our method allows significant improvement in characterizing these groups of genes. Further, we examine the statistical properties of our measure of differential coexpression and establish that the results are significant both statistically and by an improvement in semantic similarity. In addition, we found that our method finds more significant changes in gene relationships compared to several other methods of expressing temporal relationships between genes, such as coexpression over time.
Conclusion
Differential coexpression over age generates significant and biologically relevant information about the genes producing it. Our Haar basis methodology for determining agerelated differential coexpression performs better than other tested methods. The Haar basis set also lends itself to ready interpretation in terms of both evolutionary and physiological mechanisms of aging and can be seen as a natural generalization of twocategory differential coexpression.
Contact: paul@bioinformatics.ubc.ca
Keywords
Background
Differential coexpression has previously been studied primarily in the context of changes in coexpression between two contrasting sample groups such as tumors and normal tissue [3–5]. However, no method to handle multiple ordered groups, such as over age or time, has been proposed.
The current study was motivated by our interest in studying human life stage and aging. For our purposes, we take 'life stage' to include both developmental changes and normal senescent changes. In searching for biomarkers for life stage and aging it has been usual to look for differential expression over time [6–8], sometimes in conjunction with coexpression [9], but not differential coexpression over age. The equivalent task in differential coexpression analysis would look for differences in coexpression across time. Previous expression profiling studies have demonstrated that the expression patterns of ageregulated genes are indicators for a functional measure of aging in humans [10, 11]. Because many functional changes occur over the lifespan and biological function often involves the interactions of many genes, we hypothesize that life stage is associated with differential coexpression  many changes in functional relationships or rewiring of transcriptional networks.
 1)
What is an appropriate, generalizable and principled basis set corresponding to differential coexpression over time?
 2)
How feasible is the repurposing of preexisting data for a life stage metaanalysis of differential coexpression and how statistically confident can we be in the results?
 3)
Does differential coexpression provide novel information beyond coexpression or differential expression alone?
In reference to the first question, it is important to consider what a basis set for differential coexpression would constitute. Coexpression over time is taken to be the correlation between genes at each point in time. Differential coexpression, then, is the difference in coexpression over time. If this difference is taken by linear combination of the original data and conforms to other desirable properties (e.g., spans the original data), it will constitute a basis set for coexpression. Because differential coexpression is defined as the change in coexpression, ideally one of the basis set vectors will represent coexpression, while the rest represent differential coexpression. Another important issue is how to partition lifespan into stages that can be compared, and how to compare those stages. One approach to analyzing changes would be to take a derivative of gene coexpression across time, describing the differential coexpression between each age group and the next. However, the derivative comparison will fail to detect gradual changes which can only be characterized over many groups in the long term. Another possible approach to characterize multiple time points would be to compare every age group to every other, but this is highly redundant and ignores the temporal relationship between data points. We hypothesize that changes might occur over both short scales and long scales.
Processes related to aging have been hypothesized to occur over a variety of timescales up to and including linking development and old age [13]. Thus, it may be a desirable property for changes in coexpression over age to be characterized both in their rapid (short scale) change and their gradual (long scale) change.
Thus a good method for differential coexpression should have the following properties:

It would characterize the change in coexpression at each time.

It would characterize the change in coexpression over functionally relevant timescales.

It would form a basis set for the temporal data.
For two groups of data, this reduces to conventional differential coexpression (i.e. a difference between gene correlations).
In the more general case, these three properties suggest we require a transformation incorporating both changes in scale and timing. In order to reduce to conventional twogroup differential coexpression, our basis transform of agespecific coexpression data should be seen as taking the difference between groups. This corresponds to the Haar basis set [14], which consists of the difference between adjacent values, adjacent pairs of values, adjacent quartets of values, and so on, in addition to the overall mean. The use of the Haar transform (or D2 wavelet) follows from our belief that different scales of activity will be present. This can be compared to an alternative hypothesis that the relevant timescales are not lifelong, and only consider instantaneous change, or the discrete derivative over time. This too would meet the criteria laid out above (although not the criteria of an orthogonal basis set), under the assumption that functional timescales are not lifelong (each interval can be explained best by looking at the previous interval). We use coarse age bins to validate our approach, but any of the possible basis sets mentioned (Haar, derivative, direct temporal coefficients) can be generalized to any resolution or length of time, and all are particularly appropriate to temporal (or ordered data).
In this paper, in order to validate our choice of the Haar basis as an answer to the first question we posed, we use semantic similarity and statistical independence to show the relative performance of reasonable differential coexpression basis sets. In order to answer the second question posed, we assess the data with respect to a Haar coefficient null distribution and show that repurposing by age produces significant results. Lastly, in answer to our third question, we demonstrate that differential coexpression captures functionallyrelevant information not identified using coexpression alone. Because life stage is a process so strongly characterized by changes in function, these techniques developed to characterize life stage may also shed light on how life stage changes function and thus how dysfunction occurs.
Methods
Data grouping and Standardization
The Gemma ID number, experiment name, organism part, array design and age category for the experiments are listed in each column.
Experiments used for analysis.  

Gemma ID  Name  Organism part  Array Design  Age category 
622  GSE8586  Umbilical cord  GPL570  Prenatal 
726  GSE9164  Foreskin cells  GPL5876  Prenatal 
233  GSE1397  Brain, heart  GPL96  Prenatal 
215  khatuaastrocytoma  Brain  GPL91  Child/young adult 
218  pomeroyembryonal  Brain, kidney  GPL80  Child/young adult 
555  GSE5808  Blood cell  GPL96  Child/young adult 
585  GSE7586  Placenta  GPL570  Adult 
178  GSE80  Muscle  GPL91  Adult 
633  GSE8607  Testis  GPL91  Adult 
275  GSE4757  Brain  GPL570  Older adult 
721  GSE8919  Brain  GPL2700  Older adult 
263  GSE5281  Brain  GPL570  Older adult 
To allow the investigation of differential expression over age, we computed a relative rankbased measure of expression level for each gene. Each gene's expression level for each study was averaged across samples in each study, converted into a rank with the study, and then averaged within each age group.
Haar transform
Over our four age groups (prenatal, child/young adult, adult, older adult), the Haar basis consists of four values:

The averaged correlation across genes across all four time points: (1/2, 1/2, 1/2, 1/2)

The averaged correlation difference between preadult and postadult: (1/2, 1/2, 1/2, 1/2)

The averaged correlation difference between prenatal to child/young adult: (1/sqrt(2), 1/sqrt(2), 0, 0)

The averaged correlation difference between adult to older adult: (0, 0, 1/sqrt(2),1/sqrt(2))
The first coefficient in the discrete derivative remained the first value in time, while the remaining coefficients were calculated by taking a discrete derivative, or difference between successive pairs of values across time. This set will be referred to as the derivative set. Conceptually this overlaps with the direct temporal basis first coefficient and the Haar third and fourth coefficients, but lacks any scale variation, such as that found in the Haar second coefficient.
We performed the Haar transform both on the averaged correlation between pairs of genes (as a time series across our age groups) as well as on the averaged expression level ranks of individual genes. To ensure that the differential coexpression findings may not be explained by simpler underlying changes in expression level, genes exhibiting significant differential expression values at a given time and scale (top 5% rank change) were removed. If a gene exhibited differential expression for one coefficient, other coefficient values for the same gene which were not exhibiting differential expression were retained. This procedure resulted in the removal of 448 genes for each coefficient, on average. (There was slight variation in the number of genes removed due to variation in missing data across experiments). This allows our analysis to focus on genes which are relatively stable in expression level, but which might exhibit changes in coexpression relationships with other genes. In pure biological discovery, as opposed to this validation of the utility of differential coexpression and this methodology specifically, such a step may not be necessary.
Statistical Assessment
A final complication is that there are missing data, as not all data sets have data for all genes (and thus gene pairs). Failure to address this would cause us to underestimate false discovery rates. We therefore computed null distributions specific to the combination of data sets in which any given gene pair was measured. Fortunately the number of combinations which must be considered is small as many combinations are not present (i.e., missing in data sets 1, 5,7 would be one combination, missing only in data set 4 would be another combination, etc). Fewer than 10 combinations of missing data accounted for 95% of all data combinations, although the true total number of combinations of missing data was as high as 88 across all datasets. The methods code is available in the supplementary matlab methods data (see additional file 1: Supplementary.rar) and at http://www.chibi.ubc.ca/diffExAge.
Semantic similarity validation
Semantic similarity was used to assess the quality of links generated using different basis sets [16]. The number of overlapping Gene Ontology (GO) terms for each pair of genes was calculated. The change in this semantic similarity was calculated as a function of an increasing threshold on the coefficient score for each of our basis sets; that is, the semantic similarity, y, of the top x% of high value coefficients (representing a large positive change in correlation) was calculated as a function of x. Links were generated across all coefficients for each basis set.
Gene function prediction validation
All GO groups of genes for which 1030 genes had differential coexpression values as calculated in section 2.3 were assembled using the Gemma web services (see supplementary data at http://www.chibi.ubc.ca/diffExAge). This size range was chosen for computational tractability of crossvalidation, and to avoid using GO categories which overlap extensively. This generated 648 separate GO groups with 91% coverage of the genes for which a GO category is assigned.
We used these GO groups for validation of the differential coexpression results. By an analogy with the use of coexpression to predict gene function, we propose that each gene within a given GO set might have a characteristic differential coexpression relationship with genes inside the set and a characteristic relationship with genes outside the set. For each gene inside the set, gene A, the respective distributions of differential coexpression coefficients can be calculated. An arbitrary gene, gene B, outside the set has differential coexpression values with gene A that may also be calculated. We may then ask if gene B's relationship with gene A resembles gene A's relationship with other genes inside the set or with genes outside the set. As a control, the same calculation was performed using the coefficient representing coexpression.
A leaveoneout methodology was employed in which one gene was removed from the GO group to form a new set. Then for each gene now in the set, the rank score it possesses with each other genes in the set was calculated, and the genes outside the set ranked by how close their own score is to the inset score. This procedure was then repeated by rotating through each gene in the set and each coefficient for differential coexpression. Ranks across coefficients were averaged and values reranked. Then, this entire method was repeated by rotating through each gene originally in the GO group (leaveoneout methodology). Receiver operator characteristic (ROC) curves were then calculated from this data.
ErmineJ [17] was used to perform overrepresentation analysis (under the ROC setting) for each gene for each coefficient. The full gene set was used for the overrepresentation analysis (e.g., not limited to 1030 genes as in the ROC analysis). Multiple test corrected p values less than 0.001 were retained for each gene. ErmineJ uses BenjaminiHochberg correction [18].
Results
To test our approach, we analyzed differential coexpression across human lifespan in a corpus of 12 expression studies (579 individual microarrays in total). This produced 4 symmetric matrices of 18534 by 18534 genes with potential Haar coefficient values, consisting of coexpression (the first coefficient) or differential coexpression over different time (the other three coefficients). Discounting missing data, this yields 320,201,152 data points.
For each time period listed, an overrepresentation analysis was performed for each gene's coexpression or change in coexpression.
The top 5 Gene Ontology (GO) groups for each age range.  

Coexpression  Lifelong change  Early change  Late change 
translational elongation (GO:0006414)  Glycolysis (GO:0006096)  hormone activity (GO:0005179)  ATP metabolic process (GO:0046034) 
Mitochondrial membrane part (GO:0044455)  aerobic respiration (GO:0009060)  muscle system process (GO:0003012)  ribonucleoside triphosphate metabolic process (GO:0009199) 
ribosomal subunit (GO:0033279)  cellular respiration (GO:0045333)  Hemostasis (GO:0007599)  nucleotide biosynthetic process (GO:0009165) 
regulation of ubiquitinprotein ligase activity (GO:0051438)  glucose metabolic process (GO:0006006)  secretory granule (GO:0030141)  monovalent inorganic cation transmembrane transporter activity (GO:0015077) 
proteasome complex (GO:0000502)  pigment granule (GO:0048770)  calcium ion homeostasis (GO:0055074)  purine ribonucleotide biosynthetic process (GO:0009152) 
In the following sections, we analyze the differential coexpression data set in several other ways. First, we discuss how statistical significance of differential coexpression can be determined. Second, we compared our Haar transformation approach to one based on derivatives of coexpression changes or other coexpression measures. Finally, we explored whether differential coexpression is relevant to gene function using an analysis of Gene Ontology categories.
Statistical analysis
Because this is a large and complex data set, we stress that important trends may exist outside of examining only the most statistically significant cases. Our validation experiments below use all the data; however, we have also constructed an adjacency matrix consisting of the most significant genepairs across all coefficients (false discovery rate <0.01). This produced 367161 significant unique genepair relationships, available as supplementary data.
Comparison to other basis sets
As mentioned, an alternative basis is the discrete derivative of coexpression, starting with the first time point's expression level and then each subsequent coefficient reflecting change from the previous value. It might also be reasonable to question whether changes in coexpression are helpful to consider at all, rather than simply independently observing each time point for coexpression. In that case, the natural basis set would be the direct temporal basis set (age groups 14) of coexpression data directly. Each pair of genes has 4 coefficients associated with it. There is little point to this if significance in one coefficient implies significance in another.
Supplementary Figure 1 (see additional file 2: supfig1_sup.eps) shows the correlation of basis coefficients with one another within each basis set. Strongly correlated coefficients will diminish the chances of observing any change with age. The lack of orthogonality in the derivative basis set makes it the worst performing set with respect to independence (mean correlation coefficient magnitude of 0.27). By this standard, the Haar basis set performs better (correlation of 0.08) than the derivative basis set. The Haar coefficients are also less correlated than the direct temporal basis set (correlation of 0.16). The corrected Haar basis set performs better still (correlation of 0.03). Just adding noise to our data would produce a similar effect so it is important to verify that this increase in independence does not cause a decrease in functional links.
Gene function is reflected in differential coexpression patterns
Discussion
Our results suggest that the Haar basis set view of differential coexpression is a useful tool for capturing functional relationships between genes as their expression changes over ordered sets of conditions. Using this method, we provided evidence that there is substantial differential coexpression associated with life stage. In addition to their good performance compared to a derivative approach, the Haar basis had an additional feature making it attractive for metaanalysis, in that individual studies with samples containing multiple 'ages' could be combined and analyzed at multiple scales. For example, if a study covered the age groups prenatal and child/young adult, it could be included in both, ensuring it contributed to only the first and second Haar coefficients, as appropriate for the scale at which the study was performed. It should be noted that since its introduction, the Haar transform has become more widely known as the first wavelet transform. Our use of the wavelet transform is unconventional because we have only four time points, in contrast to more typical applications where temporal resolution is much finer. However, in this case, the specific wavelet transformation is a useful basis set with a biological underpinning, and having a convenient generalization. Thus the wavelet method could equally well be applied to studies specifically geared toward the analysis of data with different or finer temporal resolution, or to other ordered conditions.
Differential coexpression over time using the Haar basis
The relatively poor performance of the temporal coefficient method was not surprising, since lifelong coexpression is not an independent coefficient in this basis set and will dominate at each time point. That is, if a gene pair is highly coexpressed, even if it is also highly variable over age (yielding large coefficients in the other two bases), it is still quite likely to be considered highly coexpressed at all ages. The inferior performance of the derivative basis suggests that longer scale dynamics are relevant to expression changes with age. This would seem consistent with the proposal of Barker et al. [22] that fetal programming can play a significant role in determining factors affecting longevity [23].
One possible concern with our study is that we have mixed data sets from various tissues. The precise tissue for prenatal vs. adult, for example, may differ. Likewise there are study parameters (such as population type or array design) that might vary with age. It could then be argued that the differential coexpression across age that we see is simply a proxy measure for differential coexpression between tissue type. However, this is not consistent with the statistical assessment we have performed. It is quite likely that controlling for tissue (and other experimental parameters) would yield even better results, but even absent that, grouping by age produces a clear statistical effect compared to the null distribution, relative to randomized groupings compared to the null distribution. Previous studies (e.g., [6]) have shown tissue dependent aging signatures which may well be excluded from the present analysis. However, we find differential coexpression across tissue to have a less pronounced signature than across aging. To bolster this finding, we examined the tissue dependent coexpression signature in a wholly independent dataset, the full mouse experiment sets available in Gemma (492 datasets). The coexpression signatures in all brain datasets is very similar to the coexpression in all datasets excluding brain tissue using Gemma's significant coexpression list [12]; in particular, the nobrain set (sparsity of 2.5%) obtains 98% of the significantly coexpressed gene pairs in the brain only set of significantly coexpressed gene pairs. This suggests there is a common coexpression signature which may be differentially present over age, but does not imply there is not differential coexpression between tissues. A particular concern might be that tissue heterogeneities particularly could have an effect on the last of our Haar coefficients (all brain data), but this does not appear to be the case. Comparing brain to nobrain data across all datasets produced false discovery rates much higher than across aging, suggesting it is not a significant contributory factor in this case, where brain (widespread expression) is mixed with a variety of tissues without strong homogeneities. Another consideration is that because differentially expressed genes are easily differentially coexpressed, our filtering by differential expression removed prominent cases of differential coexpression, and may well have contributed to minimizing tissue dependencies in our analyses. However, those trends are better considered as cases of differential expression (a much more tractable phenomenon) than differential coexpression.
Another advantage of the Haar method is that studies originally performed over a variety of ages can be more effectively repurposed since including a study across multiple ages tends to remove its effects not operating at the scale of the original data (that is, including a study in older adult and adult will tend to have no lateinlife only differential coexpression effects).
Our results make the point that in general, the choice of an appropriate basis set is important to extracting information from the data. A contrasting approach would be to choose a basis post hoc, as is effectively done in any dimension reduction. This presents a number of problems. First, it will vary from dataset to dataset and introduce new normalization difficulties. The variation from dataset to dataset would also reduce the general applicability of any results since any past findings would have to be reinterpreted in the context of whatever basis set is calculated for the new work. In addition, there would be no reason to see the new basis set as a form of differential coexpression precisely since it is unlikely that a single component would end up wholly representing coexpression (thereby removing it from the others, as was the case with the Haar basis). Finally, a principled transformation of the data can be geared to offer value as an interpretive tool, as opposed to a purely methodological tool.
A reasonable concern for our statistical analysis might be that in considering only consistency of rank across experiments, we have thrown away useful information in the form of the correlation distributions themselves. In that case, a natural alternative to the nonparametric statistical analysis we have performed would be to consider the distribution of correlation values themselves, and then combine those p values and determine the false discovery rate. This could be performed by using Fisher's transformation [24] to test the significance of correlation values and then combining datasets using a metaanalysis technique such as Fisher's method [25]. Such an analysis would be erroneous for a number of reasons. Most importantly, it would be dominated by heterogeneity unrelated to the variable of interest because significance in any one experiment will tend to dominate (since usually very significantly heavytailed), but individual experiments have many heterogeneities unrelated to age. More reasonably, one could construct null distributions by computing a reasonable sample of distributions with randomly labeled datasets. As we show, in figure 5, this is quite close to the approach we took (absent weighting), but would become very cumbersome for even intermediate numbers of datasets (remembering that each combination of missing data requires a different null distribution).
Biological interpretation of the Haar approach
The three gene pairs exhibiting the largest change in correlation with one another over lifespan for each differential coexpression coefficient are shown.
Genes differentially coexpressed over age.  

Lifestage  Gene 1  Gene 2  KEGG relationships 
Lifelong change (+)  plateletderived growth factor receptor, beta polypeptide (PDGFRB) *  SHC (Src homology 2 domain containing) transforming protein 1 (SHC1)*  Focal adhesion (hsa04510) Glioma (hsa05214) 
Lifelong change ()  prostaglandin E receptor 3 (subtype EP3) (PTGER3)  ablinteractor 1 (ABI1)  
Lifelong change (+)  hematopoietic cellspecific Lyn substrate 1 (HCLS1)  CD14 molecule (CD14)*  Pathogenic Escherichia coli infection (hsa05130) 
Early life change ()  eukaryotic translation elongation factor 1 alpha 1 (EEF1A1)*  exportin 5 (XPO5)  
Early life change ()  glutamate receptor, metabotropic 4 (GRM4)  calcium channel, voltagedependent, P/Q type, alpha 1A subunit (CACNA1A)*  Taste transduction (hsa04742) 
Early life change ()  '6pyruvoyltetrahydropterin synthase (PTS)*  thioredoxin domain containing 9 (TXNDC9)  
Late life change (+)  Janus kinase 1 (a protein tyrosine kinase) (JAK1)  mitogenactivated protein kinase 9 (MAPK9)*  Pancreatic cancer (hsa05212) 
Late life change ()  progesterone receptor membrane component 1 (PGRMC1)  ubiquitinconjugating enzyme E2N (UBC13 homolog, yeast) (UBE2N)  
Late life change ()  neutral sphingomyelinase (NSMase) activation associated factor (NSMAF)  zinc finger protein 609 (ZNF609) 
Our data covers a range of age categories including both development and senescence. Two fundamental theories of senescence are Williams [29] theory of antagonistic pleiotropy and Medawar's [30] theory of mutation accumulation. These theories present an interesting interpretation in the context of our differential coexpression coefficients. Antagonistic pleiotropy posits a longscale connection between early states and late states, in which a characteristic useful in youth is harmful later (e.g., [31]). A changing functional role over age in this way would be a good candidate for finding differential coexpression, and, in particular, we would expect such differential coexpression to be present in the coefficient at the appropriate timescale. We would expect to see antagonistic pleiotropy candidate genes showing significant values in their lifelong change coefficient (youthful tradeoff), while mutation accumulation should particularly exhibit enrichment significance in the fourth coefficient (senescent change). The third Haar coefficient maps most readily onto developmental change, representing rapid and early changes in coexpression which we are tempted to interpret as developmentallyrelated changes in function.
More specific mechanistic interpretations of function and dysfunction over age also map more readily onto a Haar basis than other bases because they typically involve both a factor in time and scale. As previously mentioned, Barker's theory of fetal programming suggests a long term effect between early and late. One well studied mechanism reviewed by Maric [32] involves fetal programming for high blood pressure. While some research has cast doubt on the importance of fetal programming in longevity itself [33], there may well be similar processes that do not affect longevity simply, but do affect function in more complex ways [34] or relate developmental and late stage changes [35]. Because the Haar coefficients can capture both the scale and timing of this event, they might serve to elucidate the unknown genetic causes for the well characterized physiological changes. With only four groupings by age, interpretation of this sort in our data must remain somewhat restrained, but finer resolution age groupings could make this a valuable characteristic of our method.
Conclusion
Differential coexpression over age generates significant information about the genes producing it. Our Haar basis methodology for determining agerelated differential coexpression performs better than either a derivative based method, or using the age groups independently. The Haar basis set also lends itself to ready interpretation in terms of both evolutionary and physiological mechanisms of aging and can be seen as a natural generalization of twocategory differential coexpression. The good performance across the multiple GO sets implies that age related differential coexpression may be a common process due to the degree to which life stage produces changes in function and functional relationships. Because our Haarbased method for differential coexpression draws upon such a well established signal processing tool for temporal data, it offers a well characterized, efficient and convenient avenue for further study.
Declarations
Acknowledgements
We thank Gavin Ha and Raymond Lim for technical support.
Funding: The work was supported by NIH grant GM076990, a Michael Smith Foundation for Health Research career award and a CIHR New Investigator award.
Authors’ Affiliations
References
 Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genomewide expression patterns. Proc Natl Acad Sci USA 1998, 95(25):14863–14868. 10.1073/pnas.95.25.14863PubMed CentralView ArticlePubMedGoogle Scholar
 Lee HK, Hsu AK, Sajdak J, Qin J, Pavlidis P: Coexpression analysis of human genes across many microarray data sets. Genome Res 2004, 14(6):1085–1094. 10.1101/gr.1910904PubMed CentralView ArticlePubMedGoogle Scholar
 Watson M: CoXpress: differential coexpression in gene expression data. BMC Bioinformatics 2006, 7: 509. 10.1186/147121057509PubMed CentralView ArticlePubMedGoogle Scholar
 Choi JK, Yu U, Yoo OJ, Kim S: Differential coexpression analysis using microarray data and its application to human cancer. Bioinformatics 2005, 21(24):4348–4355. 10.1093/bioinformatics/bti722View ArticlePubMedGoogle Scholar
 Kostka D, Spang R: Finding Disease Specific Alterations in the Coexpression of Genes. Bioinformatics 2004, 20: 5. 10.1093/bioinformatics/bth909View ArticleGoogle Scholar
 Zahn JM, Poosala S, Owen AB, Ingram DK, Lustig A, Carter A, Weeraratna AT, Taub DD, Gorospe M, MazanMamczarz K, et al.: AGEMAP: a gene expression database for aging in mice. PLoS Genet 2007, 3(11):e201. 10.1371/journal.pgen.0030201PubMed CentralView ArticlePubMedGoogle Scholar
 Lee CK, Klopp RG, Weindruch R, Prolla TA: Gene expression profile of aging and its retardation by caloric restriction. Science 1999, 285(5432):1390–1393. 10.1126/science.285.5432.1390View ArticlePubMedGoogle Scholar
 de Magalhaes JP, Curado J, Church GM: Metaanalysis of agerelated gene expression profiles identifies common signatures of aging. Bioinformatics 2009, 25(7):875–881. 10.1093/bioinformatics/btp073PubMed CentralView ArticlePubMedGoogle Scholar
 Xue H, Xian B, Dong D, Xia K, Zhu S, Zhang Z, Hou L, Zhang Q, Zhang Y, Han JD: A modular network model of aging. Mol Syst Biol 2007, 3: 147. 10.1038/msb4100189PubMed CentralView ArticlePubMedGoogle Scholar
 Zahn JM, Sonu R, Vogel H, Crane E, MazanMamczarz K, Rabkin R, Davis RW, Becker KG, Owen AB, Kim SK: Transcriptional profiling of aging in human muscle reveals a common aging signature. PLoS Genet 2006, 2(7):e115. 10.1371/journal.pgen.0020115PubMed CentralView ArticlePubMedGoogle Scholar
 Rodwell GE, Sonu R, Zahn JM, Lund J, Wilhelmy J, Wang L, Xiao W, Mindrinos M, Crane E, Segal E, et al.: A transcriptional profile of aging in the human kidney. PLoS Biol 2004, 2(12):e427. 10.1371/journal.pbio.0020427PubMed CentralView ArticlePubMedGoogle Scholar
 Hamer K, Pavlidis P: GEMMA.2009. [http://www.chibi.ubc.ca/Gemma]Google Scholar
 Bowen RL, Atwood CS: Living and dying for sex. A theory of aging based on the modulation of cell cycle signaling by reproductive hormones. Gerontology 2004, 50(5):265–290. 10.1159/000079125View ArticlePubMedGoogle Scholar
 Haar A: Zur Theorie der orthogonalen Funktionensysteme. Mathematische Annalen 1909, 69(3):40.Google Scholar
 Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D: The human genome browser at UCSC. Genome Res 2002, 12(6):996–1006.PubMed CentralView ArticlePubMedGoogle Scholar
 Mistry M, Pavlidis P: Gene Ontology term overlap as a measure of gene functional similarity. BMC Bioinformatics 2008, 9: 327. 10.1186/147121059327PubMed CentralView ArticlePubMedGoogle Scholar
 Lee HK, Braynen W, Keshav K, Pavlidis P: ErmineJ: tool for functional analysis of gene expression data sets. BMC Bioinformatics 2005, 6: 269. 10.1186/147121056269PubMed CentralView ArticlePubMedGoogle Scholar
 Hochberg Y, Benjamini Y: More powerful procedures for multiple significance testing. Stat Med 1990, 9(7):811–818. 10.1002/sim.4780090710View ArticlePubMedGoogle Scholar
 Mostafavi S, Ray D, WardeFarley D, Grouios C, Morris Q: GeneMANIA: a realtime multiple association network integration algorithm for predicting gene function. Genome Biol 2008, 9(Suppl 1):S4. 10.1186/gb20089s1s4PubMed CentralView ArticlePubMedGoogle Scholar
 Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al.: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25(1):25–29. 10.1038/75556PubMed CentralView ArticlePubMedGoogle Scholar
 Li KC, Liu CT, Sun W, Yuan S, Yu T: A system for enhancing genomewide coexpression dynamics study. Proc Natl Acad Sci USA 2004, 101(44):15561–15566. 10.1073/pnas.0402962101PubMed CentralView ArticlePubMedGoogle Scholar
 Barker DJ, Osmond C, Golding J, Kuh D, Wadsworth ME: Growth in utero, blood pressure in childhood and adult life, and mortality from cardiovascular disease. BMJ 1989, 298(6673):564–567. 10.1136/bmj.298.6673.564PubMed CentralView ArticlePubMedGoogle Scholar
 Gluckman PD, Hanson MA, Morton SM, Pinal CS: Lifelong echoesa critical analysis of the developmental origins of adult disease model. Biol Neonate 2005, 87(2):127–139. 10.1159/000082311View ArticlePubMedGoogle Scholar
 Fisher RA: Frequency distribution of the values of the correlation coefficient in samples of an indefinitely large population. Biometrika 1915, 10: 507–521.Google Scholar
 Fisher RA: Combining independent tests of significance. American Statistician 1948, 2(5):30. 10.2307/2681650Google Scholar
 James PT, Rigby N, Leach R: The obesity epidemic, metabolic syndrome and future prevention strategies. Eur J Cardiovasc Prev Rehabil 2004, 11(1):3–8. 10.1097/01.hjr.0000114707.27531.48View ArticlePubMedGoogle Scholar
 Kanehisa M: The KEGG database. Novartis Found Symp 2002, 247: 91–101. discussion 101–103, 119–128, 244–152 discussion 101103, 119128, 244152 full_textView ArticlePubMedGoogle Scholar
 de Magalhaes JP, Toussaint O: GenAge: a genomic and proteomic network map of human ageing. FEBS Lett 2004, 571(1–3):243–247. 10.1016/j.febslet.2004.07.006View ArticlePubMedGoogle Scholar
 Williams GC: Pleiotropy, natural selection, and the evolution of senescence. Evolution 1957, 11: 13.Google Scholar
 Medawar PB: Old age and natural death. Mod 1946, 1: 27.Google Scholar
 Rodier F, Campisi J, Bhaumik D: Two faces of p53: aging and tumor suppression. Nucleic Acids Res 2007, 35(22):7475–7484. 10.1093/nar/gkm744PubMed CentralView ArticlePubMedGoogle Scholar
 Maric C: Mechanisms of fetal programming of adult hypertension: role of sex hormones. Hypertension 2007, 50(4):605–606. 10.1161/HYPERTENSIONAHA.107.096768View ArticlePubMedGoogle Scholar
 Williams S, Poulton R: Twins and maternal smoking: ordeals for the fetal origins hypothesis? A cohort study. BMJ 1999, 318(7188):897–900.PubMed CentralView ArticlePubMedGoogle Scholar
 Reynolds RM, Walker BR, Phillips DI, Dennison EM, Fraser R, Mackenzie SM, Davies E, Connell JM: Programming of hypertension: associations of plasma aldosterone in adult men and women with birthweight, cortisol, and blood pressure. Hypertension 2009, 53(6):932–936. 10.1161/HYPERTENSIONAHA.109.129320View ArticlePubMedGoogle Scholar
 Finch CE: The regulation of physiological changes during mammalian aging. Q Rev Biol 1976, 51(1):49–83. 10.1086/409053View ArticlePubMedGoogle Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.