Computational genetic neuroanatomy of the developing mouse brain: dimensionality reduction, visualization, and clustering
© Ji; licensee BioMed Central Ltd. 2013
Received: 15 April 2013
Accepted: 1 July 2013
Published: 11 July 2013
The structured organization of cells in the brain plays a key role in its functional efficiency. This delicate organization is the consequence of unique molecular identity of each cell gradually established by precise spatiotemporal gene expression control during development. Currently, studies on the molecular-structural association are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development.
In this article, we aim at a global, data-driven study of the relationship between gene expressions and neuroanatomy in the developing mouse brain. To enable visual explorations of the high-dimensional data, we map the in situ hybridization gene expression data to a two-dimensional space by preserving both the global and the local structures. Our results show that the developing brain anatomy is largely preserved in the reduced gene expression space. To provide a quantitative analysis, we cluster the reduced data into groups and measure the consistency with neuroanatomy at multiple levels. Our results show that the clusters in the low-dimensional space are more consistent with neuroanatomy than those in the original space.
Gene expression patterns and developing brain anatomy are closely related. Dimensionality reduction and visual exploration facilitate the study of this relationship.
The brain consists of an enormous number of cells organized into structures [1, 2]. The structured organization of cells is the key to the functional efficiency of the brain [3-6]. Hence, a natural first step toward understanding the brain function would be to address basic research questions at the structure level. How cells are organized into structures [7, 8]? What are the functions of structures ? How the structures are connected to each other [10, 11]? However, a fundamental difficulty of understanding brain functions at the structure level lies in that there is no universally agreed division of cells into structures .
From a developmental perspective, the delicate organization of brain into structures is the consequence of stringent spatiotemporal patterning controlled by the molecular signals during development. In this process, cells at different spatial locations read different morphogenetic positional signals produced by the graded distribution of signaling molecules. These signals control the expression of a relatively small set of transcription factors, which in turn regulate the expression of a larger number of genes. This sequential cascade of expression control ultimately leads to cell differentiation and the emergence of connections and functional properties . The discovery that certain marker genes are expressed in regionally restricted patterns in the developing brain has either led to the introduction of new structural boundaries or made it possible to re-define existing boundaries at a higher resolution . Currently, studies on the molecular-structural associations are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development [15-18].
In this article, we study the relationship between brain anatomy and spatiotemporal gene expression patterns in the developing mouse brain. This global study of developing neuroanatomy is made possible by the high-resolution, three-dimensional (3-D) gene expression patterns provided by the Allen Brain Atlas (ABA) [19-22]. As part of the ABA, the Allen Developing Mouse Brain Atlas provides spatiotemporal in situ hybridization (ISH) gene expression pattern images across four embryonic and three postnatal developmental ages [21, 22], yielding effectively a four-dimensional brain atlas. To establish a common coordinate framework for analyzing the ISH data, the ISH image series are aligned to the Allen Developing Mouse Brain Reference Atlas. This enables the global, computational study of the spatiotemporal gene expression patterns of many genes and comparison of the results with developmental anatomy.
To enable visual explorations of the gene expression patterns and correlate the results with classically defined neuroanatomy, we first map the high-dimensional, voxel-level gene expression data to low-dimensional space in which data visualization can be readily achieved. Numerous multivariate analysis methods can be used for this purpose. However, traditional methods either retain the global structures or the local structures in computing the mapping, producing results that are not satisfactory. To preserve both the local and the global structures in the spatial gene expression space, we employ a recent method known as the t-distributed stochastic neighbor embedding (t-SNE)  for mapping the high-dimensional data. This method is able to capture the local similarities in the high-dimensional space, while retaining the global structures as much as possible.
We map the high-dimensional gene expression data to 2-D space using t-SNE and visualize the reduced data at multiple levels of the Allen Developing Mouse Brain Reference Atlas ontology, which was created based on the “prosomeric model” [24-26]. This models proposes that the neural tube is divided into grid-like pattern of longitudinal and transverse regions. Our results show that the brain anatomy is largely preserved in the low-dimensional gene expression space at multiple levels. To provide a quantitative comparison of the relationship between gene expression patterns and neuroanatomy, we cluster the brain voxels into groups based on gene expression data in the original high-dimensional space and in the dimensionality-reduced space. Our results show that the clustering results in the low-dimensional space are more consistent with developmental anatomy than those in the original high-dimensional space.
Allen developing mouse brain atlas
The sizes of the 3-D grid data arrays at seven developmental ages
Dimensionality reduction and visualization
Dimensionality reduction is the procedure of mapping high-dimensional data points to low-dimensional space by optimizing certain criterion. Such techniques facilitate visual exploration of the high-dimensional data when they are mapped to 2-D or 3-D space. Traditional techniques for dimensionality reduction include linear method such as principal component analysis (PCA), multidimensional scaling (MDS), and nonlinear approaches such as local linear embedding (LLE) [29, 30]. These techniques either capture the global structure of the original data or try to retain the local structure within the neighborhood of each data point.
In order to capture both the local structure and the global structure such as the presence of clusters, a class of methods, known as the stochastic neighbor embedding (SNE), have been developed . To simplify the optimization and overcome the so-called “crowding problem”, SNE is extended to t-distributed SNE (t-SNE) in . Given n high-dimensional data points where , t-SNE computes n low-dimensional data points , known as map points, by trying to preserve the pairwise similarities in the high-dimensional space. To this end, t-SNE computes an n × n similarity matrix in both the original data space and in the low-dimensional space. The similarity matrix in the high-dimensional space is obtained based on symmetrized Gaussian conditional distributions, while that in the low-dimensional space is computed from Student t-distributions. The map points are learned by minimizing the Kullback-Leibler (KL) divergence between the probability distributions in the original data space and the embedding space. To map our ISH gene expression data, x i represents the high-dimensional gene expression vector of the ith voxel, and y i represents its representation in the low-dimensional space.
Because KL divergence is not symmetric, different types of mismatches contribute differently to the overall cost. Specifically, a large cost will be induced if distant map points are used to represent nearby original data points, while a small cost is incurred if distant original data points are mapped to nearby map points. This indicates that t-SNE is able to preserve the local structure of the high-dimensional data points. It has been shown that the objective function of t-SNE is particularly straightforward to optimize in comparison to the original SNE objective.
The original algorithm in  for computing the low-dimensional map points has a time and space complexity of O(n2), where n is the number of data points. In , a more efficient algorithm, known as the Barnes-Hut-SNE, is developed, and it has time and O(n) space complexity. This enables the application of t-SNE to the large-scale Allen Developing Mouse Brain Atlas data. The implementations of t-SNE can be found at http://homepage.tudelft.nl/19j49/t-SNE.html.
To study the relationship between spatial gene expression patterns and classical neuroanatomy in the adult mouse brain, Bohland et al.  use the Allen Mouse Brain Atlas data [20, 34] and apply principal component analysis (PCA) to reduce the data dimensionality before the k-means algorithm is used to cluster the brain voxels into groups. To visualize the spatial gene expression patterns, they also map the high-dimensional gene expression data to 3-D space using PCA and visualize the data using scatter plots.
Following , we apply the k-means clustering algorithms to group brain voxels into clusters based on the gene expression data in both the original high-dimensional space and the dimensionality-reduced space. Since the results of the k-means algorithm depend on the initial cluster centers that are randomly selected, we repeat this algorithm 10 times and use the results with the smallest within-cluster sum of squares error. The number of clusters in k-means is set to be equal to the number of brain structures at each particular ontology level. We reduce the high-dimensional gene expression data to 2-D and 10-D spaces using t-SNE and PCA and then apply the k-means algorithm to cluster the voxels based on these low-dimensional representations. We then quantitatively compare the consistency between voxel clusters and the neuroanatomy at multiple levels in the Reference Atlas developmental ontology.
We employ four performance measures, including the normalized mutual information (NMI), adjusted rand index (ARI), purity, and S-index, to evaluate the consistency between clustering results and developmental neuroanatomy. The first three measures have been commonly used in the clustering community as external criteria for evaluating clustering results , and the ARI and S-index have been used for comparing different brain parcellation schemes . We treat the voxel annotations as their class labels and compare them with the clustering results. In computing purity, each cluster is assigned to the most frequent class in the cluster, and then the final measure is the proportion of correctly assigned samples. One disadvantage of purity is that it cannot trade off the quality of the clustering against the number of clusters . This limitation can be overcome by the NMI, which measures the amount of (normalized) information by which our knowledge about the classes increases when we are given the clustering results. The ARI computes the normalized fraction of all possible pairs of voxels that (1) have the same class label and are assigned to the same cluster or (2) have different class labels and are assigned to different clusters. The S-index was specifically designed to compare different brain parcellations, and it “penalizes” class-to-cluster relationships that are overlapping, but that are not pure subset relationships . Different measures capture different aspects of class-to-cluster consistency, and thus the trend of performance by different measures might not always be the same.
Results and discussion
Statistics of the Allen Developing Mouse Brain Atlas data sets that are used in this work
# of genes
# of voxels
Data visualization at multiple ontology levels
We observe that t-SNE is better at visualizing the high-dimensional gene expression data than PCA. Specifically, we can observe that, at all developmental ages, the three major brain structures at Level 1 (forebrain, midbrain, and hindbrain) are very well separated. The results by t-SNE preserve the brain anatomy more faithfully than those by PCA at this level. The second rows of Figures 4 and 5 show the results by t-SNE and PCA displayed using the Level 3 annotations, which identify the major transversal segments. We can observe that both the global and local brain structures at this level are largely preserved in the dimensionality-reduced gene expression data space. The third rows of Figures 4 and 5 show the scatter plots of reduced data displayed using the Level 5 annotations, which identify the four longitudinal zones in addition to the transversal segments. We can observe that within each of the transversal segments, voxels belong to the same longitudinal zones are usually placed close to each other. However, voxels in the same longitudinal zone but belong to different transversal segments are not necessarily placed at nearby locations.
We can observe from Figures 4 and 5 that t-SNE is able to map high-dimensional data to 2-D space in which the neuroanatomy can be largely recovered. For example, in Figures 4 and 5 the overall organization of the three brain structures at Level 1 are largely preserved, where the midbrain voxels are placed between the forebrain and hindbrain voxels. These results indicate that t-SNE is able to preserve both the local and the global structures of the data simultaneously. In addition, the shapes of the structures are also preserved to some extent. For example, it is known that the midbrain is a wedge-shaped structure due to the sharp flexion of the neuraxis in this region . We can see from Figures 4 and 5 that this is largely preserved in most plots. This is especially clear from plot for the developmental age E11.5. This is presumably due to the much larger number of voxels in late ages (Table 2), which prevent some global structures from being fully incorporated.
At Level 3 shown in Figures 4 and 5, the transversal segment structures are also largely preserved. In particular, p1 voxels are almost always close to the midbrain voxels, while p3 voxels are usually on the secondary prosencephalon side. m1 voxels are mostly placed closely with p1 voxels, while m2 voxels are nearby with hindbrain voxels. In the hindbrain, prepontine hindbrain voxels (including is, r1, and r2) are mostly close to midbrain voxels; medullary hindbrain voxels (including r7, r8, r9, r10, and r11) are placed on the far side; pontine hindbrain (r3 and r4) and pontomedullary hindbrain (r5 and r6) voxels are somewhere in between. We also observe that the global brain structures are less well preserved at late developmental ages. This might be due to the increasingly larger number of brain voxels at late ages, which makes it increasingly difficult to preserve both the global and the local structures. In this case, t-SNE tends to focus more on retaining the local structure due to the asymmetric nature of the KL divergence.
Clustering and comparison with neuroanatomy
Comparison of clustering results with the Reference Atlas annotations at developmental ontology Level 1
Comparison of clustering results with the Reference Atlas annotations at developmental ontology Level 3
Comparison of clustering results with the Reference Atlas annotations at developmental ontology Level 5
We can observe from Table 3 that the results from low-dimensional representations computed by t-SNE are much more consistent with neuroanatomy than those from the original representations at Level 1. On average, the performance measured by NMI and S-index has been more than doubled, and that by adjusted rand index has been increased from 0.0985 to 0.3855. On the other hand, the results from PCA-reduced data are similar to those by the original data. This is consistent with the visualization results that PCA-reduced data fail to separate voxels from different brain structures clearly at this level. We also observe that the results of PCA are similar to those by the original data for measures NMI, ARI, and purity. For S-index, these two sets of results are not similar. This might indicate that S-index measures class-to-cluster consistency in a different way than other measures. As has been mentioned in Section “Clustering”, S-index penalizes class-to-cluster relationships that are overlapping, but that are not pure subset relationships . The other three measures are not specifically designed to capture such relationship.
At Levels 3 and 5, we can observe from Tables 4 and 5 that, on average, the clustering results based on the t-SNE reduced data are more consistent with the neuroanatomy than those by the original data. In addition, the t-SNE results are more consistent with the neuroanatomy than those by PCA for measures NMI, ARI, and purity. The PCA-reduced data give better performance than the original and the t-SNE reduced data for measure S-index. This again indicates that S-index measures consistency in a different way compared with the other three measures. We can conclude from the above results that, although t-SNE gives better visualization results than PCA at all levels, the clustering results based on PCA-reduced data could yield higher consistency with the neuroanatomy than those based on t-SNE for certain measure. These results are consistent with the results reported in .
Dimensionality reduction by t-SNE and PCA
We observe that t-SNE gives the best results in terms of preserving both the local and the global structures in the high-dimensional gene expression space in comparison with PCA. We also observe that when the data sets are very large, such as those in late developmental ages of the Allen Developing Mouse Brain Atlas, preserving both the local and the global structures might be very hard or even impossible. In these cases, t-SNE tries to preserve local structures at the price of losing some global structures. This tradeoff is achieved by giving different costs to different types of errors in computing the mapping. In particular, because KL divergence is not symmetric, different types of mismatches contribute differently to the overall cost. A large cost will be induced if distant map points are used to represent nearby original data points. This large cost will ensure that the local structures are faithfully preserved. In contrast, a relatively small cost is incurred if distant original data points are mapped to nearby map points. Hence, a small cost will be incurred if the global structures are not preserved accurately. This asymmetric property makes t-SNE especially useful in reducing and visualizing large-scale brain data sets in comparison to other traditional techniques, which preserve either the global or the local structures.
Longitudinal zones versus transversal segments
In developmental neuroanatomy, two primary models have been proposed to explain the neural plate and tube regionalization based on gene expression and morphological information . These are the topographic “columnar” model , and the topological “segmental” model known as the “prosomeric model” [24-26, 39]. Recent experimental data have shown that the prosomeric model is more consistent with morphological and molecular evidences. This leads to the adoption of this model in the Allen Developing Mouse Brain Reference Atlas. The columnar model mainly focuses on dividing the neural plate and tube along the longitudinal dimension, while the segmental model favors division into transversal domains. In the prosomeric model (Figure 1), the developing nervous system is divided into a grid-like pattern of longitudinal and transversal histogenetic domains. Along the longitudinal axis, four zones, known as the floor plate, basal plate, alar plate, and roof plate, are specified by DV patterning mechanisms. Along the transversal axis, the AP patterning signals subdivide the brain wall into a constant set of segments known as neuromeres.
Manifold structures in developmental gene expression
We have observed that clustering of the low-dimension representations generated by t-SNE leads to more consistent results with neuroanatomy than those by the original and the PCA-reduced representations. This might indicate that the original gene expression data lie on a low-dimensional manifold in the high-dimensional space. In addition, a general trend that we have observed in comparing the clustering results with neuroanatomy is that clustering using the low-dimensional representations gives very significant performance improvement at Level 1 in comparison to those by the original and the PCA-reduced representations. This improvement decreases as we move to Level 3 and Level 5. Such trend is consistent with our hypothesis that the original gene expression data lie on a manifold in the high-dimensional space, because the Level 1 structures are simpler and thus are easier to capture by low-dimensional representations than those at Level 3 and Level 5. Hence, embedding of the simple manifold into low-dimensional space facilitates the faithful characterization of the underlying structures. On the other hand, reducing relatively complex manifold structures to low-dimensional space might not lead to better representations.
We employ global computational analysis to study the relationship between gene expression patterns and neuroanatomy in the developing mouse brain. To enable visual explorations, we map the high-dimensional ISH gene expression data to low-dimensional space by preserving both the local and the global structures. This unsupervised, data-driven mapping of spatial gene expression data leads to low-dimensional representations that can be easily visualized. Our results show that the developmental neuroanatomy is largely preserved in the low-dimensional gene expression data space. To provide quantitative results, we cluster both the original high-dimensional data and the low-dimensional mapped data and compare the results with the developmental neuroanatomy. Our results show that the clusters in the low-dimensional space are more consistent with developmental neuroanatomy than those in the high-dimensional space.
In this work, the data set at each developmental age is analyzed separately. Since development is a continuous process, it would be interesting to map and cluster the data by incorporating temporal smoothness constraints [40, 41]. We will explore time-varying dimensionality reduction and clustering algorithms in the future. Our results have shown that, although majority of the voxels are mapped to locations that are consistent with their anatomical annotations, there do exist some exceptions. We will investigate these cases in the future.
We thank the Allen Institute for Brain Science for making the Allen Developing Mouse Brain Atlas data available. We thank Chinh Dang, David Feng, Terri Gilbert, Michael Hawrylycz, Luis Puelles, and Carol Thompson for assistance in interpreting the data and results. This work was supported by research grants from the National Science Foundation (DBI-1147134) and Old Dominion University Office of Research.
- Swanson LW: Brain Architecture: Understanding the Basic Plan, 2nd edition. 2011, New York: Oxford University PressView ArticleGoogle Scholar
- Swanson LW: Brain Maps: Structure of the Rat Brain, 3rd EDITION. 2003, San Diego: Academic Press, 3rdGoogle Scholar
- Sporns O: Networks of the Brain. 2010, Cambridge: The MIT PressGoogle Scholar
- Bullmore E, Sporns O: The economy of brain network organization. Nat Rev Neurosci. 2012, 13 (5): 336-349.PubMedGoogle Scholar
- Sporns O: From simple graphs to the connectome: Networks in neuroimaging. NeuroImage. 2012, 62 (2): 881-886. 10.1016/j.neuroimage.2011.08.085.View ArticlePubMedGoogle Scholar
- Rubinov M, Sporns O: Complex network measures of brain connectivity: uses and interpretations. NeuroImage. 2010, 52 (3): 1059-1069. 10.1016/j.neuroimage.2009.10.003.View ArticlePubMedGoogle Scholar
- Paxinos G, Franklin KB: The Mouse Brain in Stereotaxic Coordinates, 4th edition. 2012, San Diego: Academic PressGoogle Scholar
- Grange P, Hawrylycz M, Mitra PP: Computational neuroanatomy and co-expression of genes in the adult mouse brain, analysis tools for the Allen brain Atlas. Quant Biol. 2013, 1 (1): 91-100. 10.1007/s40484-013-0011-5. arXiv:1301.1730v1 Springer-VerlagView ArticleGoogle Scholar
- Honey CJ, Thivierge JP, Sporns O: Can structure predict function in the human brain?. NeuroImage. 2010, 52 (3): 766-776. 10.1016/j.neuroimage.2010.01.071.View ArticlePubMedGoogle Scholar
- Zalesky A, Cocchi L, Fornito A, Murray MM, Bullmore E: Connectivity differences in brain networks. NeuroImage. 2012, 60 (2): 1055-1062. 10.1016/j.neuroimage.2012.01.068.View ArticlePubMedGoogle Scholar
- Bohland JW, Wu C, Barbas H, Bokil H, Bota M, Breiter HC, Cline HT, Doyle JC, Freed PJ, Greenspan RJ, Haber SN, Hawrylycz M, Herrera DG, Hilgetag CC, Huang ZJ, Jones A, Jones EG, Karten HJ, Kleinfeld D, Kötter R, Lester HA, Lin JM, Mensh BD, Mikula S, Panksepp J, Price JL, Safdieh J, Saper CB, Schiff ND, Schmahmann JD, et al: A proposal for a coordinated effort for the determination of Brainwide Neuroanatomical connectivity in model organisms at a Mesoscopic scale. PLoS Comput Bio. 2009, 5 (3): e1000334-10.1371/journal.pcbi.1000334.View ArticleGoogle Scholar
- Bohland JW, Bokil H, Allen CB, Mitra PP: The brain atlas concordance problem: quantitative comparison of anatomical parcellations. PLoS ONE. 2009, 4 (9): e7200-10.1371/journal.pone.0007200.PubMed CentralView ArticlePubMedGoogle Scholar
- Watson C, Paxinos G, Puelles L: The Mouse Nervous System. 2011, San Diego: Academic PressGoogle Scholar
- Hidalgo-Sánchez M, Millet S, Bloch-Gallego E, Alvarado-Mallart RM: Specification of the meso-isthmo-cerebellar region: The Otx2/Gbx2, boundary. Brain Res Rev. 2005, 49 (2): 134-149. 10.1016/j.brainresrev.2005.01.010.View ArticlePubMedGoogle Scholar
- Ferran J, Sánchez-Arrones L, Sandoval J, Puelles L: A model of early molecular regionalization in the chicken embryonic pretectum. J Comp Neurol. 2007, 505 (4): 379-403. 10.1002/cne.21493.View ArticlePubMedGoogle Scholar
- Ferran J, de Oliveira ED, Merchán P, Sandoval J, Sánchez-Arrones L, Martínez-De-La-Torre M, Puelles L: Genoarchitectonic profile of developing nuclear groups in the chicken pretectum. J Comp Neurol. 2009, 517 (4): 405-451. 10.1002/cne.22115.View ArticlePubMedGoogle Scholar
- Bernard A, Sorensen SA, Lein ES: Shifting the paradigm: new approaches for characterizing and classifying neurons. Curr Opin Neurobiol. 2009, 19 (5): 530-536. 10.1016/j.conb.2009.09.010.View ArticlePubMedGoogle Scholar
- Hawrylycz MJ, Lein ES, Guillozet-Bongaarts AL, Shen EH, Ng L, Miller JA, van de Lagemaat LN, Smith KA, Ebbert A, Riley ZL, Abajian C, Beckmann CF, Bernard A, Bertagnolli D, Boe AF, Cartagena PM, Chakravarty MM, Chapin M, Chong J, Dalley RA, Daly BD, Dang C, Datta S, Dee N, Dolbeare TA, Faber V, Feng D, Fowler DR, Goldy J, Gregor BW, et al: An anatomically comprehensive atlas of the adult human brain transcriptome. Nature. 2012, 489 (7416): 391-399. 10.1038/nature11405.PubMed CentralView ArticlePubMedGoogle Scholar
- Allen Institute for BrainScience: Allen developing mouse brain atlas [Internet]. 2012, [http://developingmouse.brain-map.org]Google Scholar
- Lein ES, et al: Genome-wide atlas of gene expression in the adult mouse brain. Nature. 2007, 445 (7124): 168-176. 10.1038/nature05453.View ArticlePubMedGoogle Scholar
- Sunkin SM, Ng L, Lau C, Dolbeare T, Gilbert TL, Thompson CL, Hawrylycz M, Dang C: Allen Brain Atlas: an integrated spatio-temporal portal for exploring the central nervous system. Nucleic Acids Res. 2013, 41 (D1): D996-D1008. 10.1093/nar/gks1042.PubMed CentralView ArticlePubMedGoogle Scholar
- Ng LL, Sunkin SM, Feng D, Lau C, Dang C, Hawrylycz MJ: Chapter seven -Large-Scale Neuroinformatics for In Situ hybridization data in the mouse brain. Bioinformatics of Behavior: Part 2 Volume 104 International Review of Neurobiology. Edited by: Haendel M A, Chesler E J, Chesler E J , Haendel M A . 2012, San Diego: Academic Press, 159-182.View ArticleGoogle Scholar
- van der Maaten, Hinton GE: Visualizing high-dimensional data using t-SNE. J Mach Learn Res. 2008, 9: 2579-2605.Google Scholar
- Puelles L, Amat JA, Martinez-de-la Torre M: Segment-related, mosaic neurogenetic pattern in the forebrain and mesencephalon of early chick embryos: I. Topography of ache-positive neuroblasts up to stage HH18. J Comp Neurol. 1987, 266 (2): 247-268. 10.1002/cne.902660210.View ArticlePubMedGoogle Scholar
- Puelles L: A segmental morphological paradigm for understanding vertebrate forebrains. Brain Behav Evol. 1995, 46: 319-337. 10.1159/000113282.View ArticlePubMedGoogle Scholar
- Puelles L, Rubenstein JL: Forebrain gene expression domains and the evolving prosomeric model. Trends Neurosci. 2003, 26 (9): 469-476. 10.1016/S0166-2236(03)00234-0.View ArticlePubMedGoogle Scholar
- Allen Institute for BrainScience: Technical white paper informatics data processing for the allen developing mouse brain atlas. 2012, [http://developingmouse.brain-map.org/docs/InformaticsDataProcessing.pdf]Google Scholar
- Allen Institute for BrainScience: Technical white paper: Allen developing mouse brain reference atlas. 2012, [http://developingmouse.brain-map.org/docs/ReferenceAtlas.pdf]Google Scholar
- Burges CJC: Dimension reduction: a guided tour. Foundations Trends Mach Learn. 2010, 2 (4): 275-365.View ArticleGoogle Scholar
- van der Maaten, Postma EO, van den Herik: Dimensionality reduction: a comparative review. 2009, Tilburg University Technical Report, TiCC-TR 2009-005Google Scholar
- Hinton GE, Roweis ST: Stochastic neighbor embedding. Advances in Neural Information Processing Systems 15. 2003, Cambridge: MIT Press, 857-864.Google Scholar
- van der Maaten: Barnes-Hut-SNE. arXiv:1301.3342. 2013Google Scholar
- Bohland JW, Bokil H, Pathak SD, Lee CK, Ng L, Lau C, Kuan C, Hawrylycz M, Mitra PP: Clustering of spatial gene expression patterns in the mouse brain and comparison with classical neuroanatomy. Methods. 2010, 50 (2): 105-112. 10.1016/j.ymeth.2009.09.001.View ArticlePubMedGoogle Scholar
- Allen Institute for BrainScience: Allen mouse brain atlas [Internet]. 2012, [http://mouse.brain-map.org/]Google Scholar
- Manning CD, Raghavan P, Schütze H: Introduction to Information Retrieval. 2008, New York: Cambridge University PressView ArticleGoogle Scholar
- Allen Institute for BrainScience: Allen brain atlas API. 2012, [http://www.brain-map.org/api/index.html]Google Scholar
- Watson C, Kirkcaldie M, Paxinos G: The Brain: An Introduction to Functional Neuroanatomy. 2010, San Diego: Academic PressGoogle Scholar
- Alvarez-Bolado G, Rosenfeld MG, Swanson LW: Model of forebrain regionalization based on spatiotemporal patterns of POU-III homeobox gene expression, birthdates, and morphological features. J Compar Neur. 1995, 355 (2): 237-295. 10.1002/cne.903550207.View ArticleGoogle Scholar
- Bulfone A, Puelles L, Porteus M, Frohman M, Martin G, Rubenstein J: Spatially restricted expression of Dlx-1, Dlx-2 (Tes-1), Gbx-2, and Wnt-3 in the embryonic day 12.5 mouse forebrain defines potential transverse and longitudinal segmental boundaries. J Neurosci. 1993, 13 (7): 3155-3172.PubMedGoogle Scholar
- Ji S, Zhang W, Liu J: A sparsity-inducing formulation for evolutionary co-clustering. Proceedings of the Eighteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining New York: Association for Computing Machinery. 2012, 334-342.Google Scholar
- Zhang W, Ji S, Zhang R: Evolutionary soft co-clustering. Proceedings of the 2013 SIAM International Conference on Data Mining Philadelphia: Society for Industrial and Applied Mathematics. 2013, 121-129.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.