- Methodology article
- Open Access
Morphometic analysis of TCGA glioblastoma multiforme
© Chang et al; licensee BioMed Central Ltd. 2011
- Received: 21 September 2011
- Accepted: 20 December 2011
- Published: 20 December 2011
Our goals are to develop a computational histopathology pipeline for characterizing tumor types that are being generated by The Cancer Genome Atlas (TCGA) for genomic association. TCGA is a national collaborative program where different tumor types are being collected, and each tumor is being characterized using a variety of genome-wide platforms. Here, we have developed a tumor-centric analytical pipeline to process tissue sections stained with hematoxylin and eosin (H&E) for visualization and cell-by-cell quantitative analysis. Thus far, analysis is limited to Glioblastoma Multiforme (GBM) and kidney renal clear cell carcinoma tissue sections. The final results are being distributed for subtyping and linking the histology sections to the genomic data.
A computational pipeline has been designed to continuously update a local image database, with limited clinical information, from an NIH repository. Each image is partitioned into blocks, where each cell in the block is characterized through a multidimensional representation (e.g., nuclear size, cellularity). A subset of morphometric indices, representing potential underlying biological processes, can then be selected for subtyping and genomic association. Simultaneously, these subtypes can also be predictive of the outcome as a result of clinical treatments. Using the cellularity index and nuclear size, the computational pipeline has revealed five subtypes, and one subtype, corresponding to the extreme high cellularity, has shown to be a predictor of survival as a result of a more aggressive therapeutic regime. Further association of this subtype with the corresponding gene expression data has identified enrichment of (i) the immune response and AP-1 signaling pathways, and (ii) IFNG, TGFB1, PKC, Cytokine, and MAPK14 hubs.
While subtyping is often performed with genome-wide molecular data, we have shown that it can also be applied to categorizing histology sections. Accordingly, we have identified a subtype that is a predictor of the outcome as a result of a therapeutic regime. Computed representation has become publicly available through our Web site.
- Random Forest
- Nuclear Size
- Renal Clear Cell Carcinoma
- Genomic Association
- Computational Pipeline
Brief review of analysis of H&E images
A comprehensive review of techniques for the analysis of the H&E sections is beyond the scope of this paper. However, a brief review can be found in . From our perspective, three key concepts have been introduced to establish the trend and direction of the research community: (I) one group of researchers have focused on tumor grading through either accurate or rough nuclear segmentation  followed by computing cellular organization [5, 6] and classification. In some cases, tumor grading has been associated with recurrence, progression, and invasion carcinoma (e.g., breast DCIS) , but such an association is highly dependent on tumor heterogeneity and mixed grading (e.g., presence of more than one grade), which offers significant challenge to the pathologists as mixed grading appears to be present in 50% of patients . A recent study indicates that detailed segmentation and multivariate representation of nuclear features from H&E stained sections can predict DCIS recurrence [9, 10] in patients with more than one nuclear grade. In this study, nuclei in the H&E stained samples were manually segmented and a multidimensional representation was computed for differential analysis between the cohorts. The significance of this particular study is that it has been repeated with the same quantitative outcome. In other related studies, image analysis of nuclear features has been found to provide quantitative information that can contribute to diagnosis and prognosis values for carcinoma of the breast [11, 12], prostate , and colorectal mucosa . (II) The second group of researchers have focused on patch-based (e.g., region-based) analysis of tissue sections through means of supervised classification. These methods operate by representing each patch with color and texture features [15, 16] for training either a kernel or regression tree classifiers. A recent study evaluated and compared emerging techniques of sparse coding with kernel based methods (e.g., support vector machine, kernel discriminant analysis) on a GBM dataset to conclude that the kernel based method did equally as well, if not better, than sparse coding. Alternatively, some researchers have investigated how architectural features of tumor grades correlate with fractal dimensions . Fractal dimensions differ from topological dimensions and has been shown to have the potential to elucidate irregularities by assigning a gross scalar value for discriminating benign and malignant breast cells from fine needle aspiration . (III) A third group of researchers have suggested utilizing the detection of lymphocytes as a prognostic tool for breast cancer . Lymphocytes are part of the adaptive immune response and their presence has been correlated with nodal metastasis and HER2-positive breast cancer, ovarian , and GBM. These cells often respond in larger quantity, and they can be easily detected because of their constant size (e.g., approximately 7 micron in diameter) and high chromatin content.
Brief review of methods for nuclear segmentation
Complexities in delineating nuclear regions originate from both technical (e.g., non-uniform fixation and staining protocol, artifacts in a tissue section, non-uniform thickness in tissue sections) and biological (e.g., different cell types, overlapping compartment) variations. Present techniques have focused on adaptive thresholding followed by morphological operators [21, 22], fuzzy clustering [4, 23], level set method using gradient information [24–26], color separation followed by optimum thresholding and learning [27, 28], hybrid color and texture analysis that are followed by learning and unsupervised clustering , and representation of nuclei organization in tissue [30, 31] that is computed from either interactive segmentation or a combination of intensity, texture, and morphological operators. Some applications combine the above techniques. For example, in , iterative radial voting  was used to estimate seeds for the location of the nuclei and subsequently, the model interaction between neighboring nuclei with multiphase level set [34, 35]. It is also a common practice that through color decomposition, nuclear regions can be segmented using the same techniques that have been developed for fluorescence microscopy. In recent papers, we [36, 37] and others  have reviewed those techniques. However, none of these methods can effectively address analytical requirements of the tumor characterization. Thresholding and clustering assume constant chromatin content for the nuclei in the image. In practice, there is a wide variation in chromatin content. In addition, there is the issue with overlapping and clumping of the nuclei, and sometimes, due to the tissue thickness, they cannot be segmented. The method proposed in  aims to delineate overlapping nuclei through iterative radial voting , but seed detection can fail in the presence of wide variations in the nuclear size; thus, leading to fragmentation. We should also note that many of the techniques that have been developed for analysis of cell culture models, imaged through fluorescence microscopy, are applicable to the analysis of histology sections. Accordingly, methods have been developed to quantify a variety of endpoints using iterative voting [33, 39], geometric reasoning [40, 41], evolving fronts [35, 37, 42], and Gabor filter banks .
Having summarized the current state of computational histopathology, our objective is to use a large growing dataset of tumor sections and to identify intrinsic subtypes within this dataset. These subtypes can then be used for genomic association. In other words, we don't seek to build a system to mimic histological grading. To meet this objective, it is essential to develop a pipeline for processing a large scale dataset, to overcome technical variations, and to incorporate methods that are extensible to other tumor types. Our testbed consists of 344 sections of GBM, scanned with a 20 × objective in a bright field, which are typically 40,000-by-40,000 pixels.
Morphometric analysis and multidimensional profiling
We evaluated a number of nuclear segmentation methods that included level sets  or their variants using graph cut implementation, and integration of these methods with seed selection using geometric methods . But these techniques proved to be compute-intensive as a typical tissue section (of size 40k-by-40k pixels) would take roughly a week of processing time on a high end desktop computer. Our experience led to a design of a pipeline that will delineate nuclei and compute morphometric features with a superior computational throughput. The computational model was first validated against synthetic data, then tested on annotated tissue sections, and finally evaluated by a pathologist. Below, we summarize three major components of our methodology.
The pipeline maintains a local registry where consistency between images at TCGA (at the National Cancer Institute) and a local repository is constantly maintained, and new images are downloaded for processing. At present, NCI provides both frozen sections and those from paraffin embedded blocks. Although both types of images are registered and displayed through our system, only paraffin embedded blocks are processed. Each image is partitioned into strips of 1k-by-number of columns, then the strips are stored on the OME image server [47, 48].
Visualization of each large scale tissue section is realized through tiling and the utilization of Flash technology that enables a client to pan and zoom, similar to GoogleMaps™. Each image (of the order of 40k-by-40k pixels or higher) is partitioned into tiles of 256-by-256 pixels at different resolutions, and the tiles are then stored on a server. As the user drags and zooms on the image in the browser, the tiles are downloaded from the server and inserted into the browser page. Data and images are available through http://tcga.lbl.gov.
Each strip is subsequently partitioned into 1k-by-1k blocks, and blocks are submitted to a computer cluster for processing. The block size has been optimized for processing time and wait time in the queue. At the moment, the entire GBM data set of 344 images takes 4 days of processing. In addition to cluster-based computing, the computational methods of the previous section have a multithread implementation for a more efficient utilization of each computing node. Once each block is processed, computed features are imported into an imaging bioinformatics system, named BioSig [37, 49], for further analysis. Several java modules have been developed that run concurrently to access and update the database. The "Jobsubmitter" uses JSch (java version of ssh), and ExpectJ (java version of Expect) to drive shell scripts on the computing cluster. Computed representation (e.g., nuclear segmentation) can then be overlaid on the original image for quality control.
The backend of BioSig uses PostgreSQL (PG) and summarization of feature-based representation is performed through procedural programming. For high performance applications, PG server programming interface (SPI) enables the transparent transformation of SQL queries. This is a critical component since it adds flexibility for computing morphometric and organization features, normalizing them, and analyzing underlying representation in a new way that was not anticipated. This capability has proven to increase productivity by testing alternative representations without reprocessing the original images. Given the entire GBM (or other tumors) datasets, we have designed a four-step process to normalize each computed feature (e.g., nuclear size, texture, cellularity) for subtyping and genomic association, which is implemented through SPI: (i) each feature is represented as a density distribution per tissue; (ii) feature-based distribution for all tissues, within a tumor type, are combined to construct a global distribution; (iii) the global distribution is then re-binned so that each bin has a similar population of cells of a given feature-value, and (iv) local density distributions are then remapped to computed global bins of equal weight. The net result is that the morphological indices can then be compared, in context, by reporting a distribution function for each feature. These data are downloadable and can be visualized for each tissue section. The rationale for this simplified analysis is that given a large number of cells in a tissue section, classical clustering analysis (for quantization) can be computationally intractable (e.g., computing similarity matrices). In cases where multiple tissue sections exist for a single patient, an average distribution is computed and archived.
Subtyping and genomic association
Normalized representation of morphometric data are used for subtyping. Subtyping is based on consensus voting  by varying the number of subtypes and examining the similarity matrix. It has also been used in earlier papers for subtyping 2D and 3D cell culture morphologies [43, 51]. Two gene ranking algorithms of moderated F-statistic and random forests are used for genomic association. (i) Moderated F-statistic  utilizes the empirical Bayes method for assessing differential gene expression. In this method, the denominator mean squares (e.g., variance) are moderated across genes through the empirical Bayes approach. The net result is an improved statistical stability given the limited number of samples. The p-value is computed for each gene based on the moderated F-statistic, and then adjusted for multiple hypothesis testing. The adjustment is based on Benjamini and Hochberg's method to estimate the false discovery rate (FDR) . FDR controls the expected proportion of falsely rejected null hypotheses in multiple hypotheses testing to correct for multiple comparisons. The method is implemented through the R Limma package. The top genes that are differentially expressed between subtype 5 and others, with FDR adjusted p-value less than 0.06, are included in Additional file 1 as a heatmap. (ii) Random forest is an ensemble classifier that consists of many decision trees . In random forest, there are several policies for characterizing significance of each gene. One policy evaluates the decrease in classification accuracy by permutation values of a single gene between multiple samples . We used the R implementation of a random forest package , where the number of trees (ntree) is increased to 2000 to accommodate the original subset of genes (1740) that were used in an earlier TCGA publication . To insure the robustness and stability of gene selection, the process is repeated by averaging over 100 randomly generated forests.
The critical factors in our computational pipeline are the throughput, quality of segmentation and morphometric representation for subtyping, and genomic association. The throughput is significant since images need to be continuously processed with a newer version of the software with increased robustness. Presently, the total computational time for 344 large scale tissue sections (from 133 patients) is less than a week on a shared cluster. Because segmentation results are also important for quality control, a number of intermediate data are also released.
Data, intermediaries, and limitations
Since nuclear segmentation provides the basis for morphometric analysis, subtyping, and survival analysis, it is being released for visualization through our web site at http://tcga.lbl.gov, where users can pan and zoom through the images and overlay segmentation results on original images. The web site also enables exclusion of specific tissue sections for subtyping and genomic association. Computed representations and subtyping is also released through our web site to the community.
Three modules are tested in the computational pipeline: (i) segmentation, (ii) feature extraction, and (iii) subtyping. (i) We have created a subset of hand segmented images, which originate from a diverse set of tissue sections from TCGA GBM dataset. Even though most images are stained properly, the emphasis on this subset has been placed on blocks where the nuclear dye is heterogeneous. The recall and precision is at 78% and 65%, respectively; (ii) feature extraction and representation were tested against synthetic data with known ground truth; (iii) subtyping is evaluated qualitatively by displaying group similarity matrix.
Subtyping based tissue histology and survival analysis
Given that the therapeutic regime has increased life span for the subtype with extreme high cellularity, as shown in Figure 5E, we queried for its molecular marker through differential gene expression analysis as well as random forest. Both gene lists are provided in Additional file 1, and a more detailed discussion of the gene lists through random forest follows. We have analyzed the top 100 genes for pathway and subnetwork enrichment analysis through Pathway Studio. Pathway analysis reveals enrichment of immune response, such as NK-cell (Natural killer cell) and T-cell activation, and AP-1 signaling with p-value of less than 0.05. In support of these findings, the literature suggests that GBM expresses antigens that is recognized by the immune system to eliminate virus infected cells and GBMs [59, 60]. Tumor associated antigen (TAA) indicates that glioma cells can be recognized by the immune response, but this process is hindered by the tumor location and evasion strategies developed by GBM. AP-1 (JUN oncogene) is a transcription factor is responsible for high level regulation of IL-13Ra2 that is expressed in GBM cells , and is also a highly ranked gene in the TCGA gene tracker.
Comparison with prior art
It is important to note that another laboratory  has analyzed the same dataset. There are differences in the outcome and methodologies. For example, they have reported four subtypes in the GBM dataset. We suggest that the (i) addition of the cellularity index, (ii) utility of feature distributions as opposed to the feature means, (iii) selection of specific combination of features as opposed to all computed features, and (iv) absence of curation have been the deciding factors. Besides cell-based multidimensional representation, there are also differences in nuclear segmentation. It is difficult to assess the differences in segmentation in the absence of source code and computed results on a large dataset; however, color normalization (with respect to the gold standard) and separation of the touching nuclei has not been addressed in . These differences, especially curation, can have a significant impact on morphometric analysis. Finally, we have designed and built an open system, where algorithms and software are going through constant improvement, and computed representation and intermediaries are being made available for each version of the software.
We have developed an integrated pipeline to process large scale tissue sections for morphometric analysis. The data are downloaded from the NIH web site, partitioned into blocks, and then processed on a cluster. Computed representation is then transferred to a database where (i) data can be downloaded for molecular association, and (ii) computed information is overlaid on the original image and that through panning and zooming, quality control can be performed. Thus far, GBM and kidney data have become publicly available.
We have shown that through morphometric analysis and cellular organization of tissue histology of a large dataset, subtypes can be identified that are predictive of outcome as a result of therapeutic protocol. The main theme is that histological subtyping reveals intrinsic categories that are independent of supervised histological grades. In other words, TCGA's large curated dataset offers potential for revealing subtypes based on intrinsic properties of tissue signatures as opposed to the classical tumor grading (e.g., Gleason ranking in prostate cancer), practiced by pathologists. In this context, TCGA's histology database can provide a complementary repository for diagnostic and molecular underpinning for histological subtypes. Subsequently, molecular signature of a subtype can hypothesize a more effective targeted therapy. Our continued research focuses on addressing limitations that has been addressed in the Result section. Ultimately, we plan to develop a system that will process all tumor types.
This research was supported by the National Cancer Institute U24 CA1437991 and RO1CA140663 and the U.S. Department of Energy under contract number DE-AC02-05CH11231. The research utilized the Lawrencium computational cluster resource, provided by the IT Division at the Lawrence Berkeley National Laboratory, and the Center for Information Technology Research in the Interest of Society (CITRIS) at the University of California-Berkeley.
- Dalton L, Pinder S, Elston C, Ellis I, Page D, Dupont W, Blamey R: Histolgical gradings of breast cancer: linkage of patient outcome with level of pathologist agreements. Modern Pathology 2000, 13: 730–735. 10.1038/modpathol.3880126View ArticlePubMedGoogle Scholar
- Stupp R, Mason W, vanen Bent M, Weller M, Fisher B, Taphoorn M, Belanger K, Brandes A, Marosi C, Bogdahn U, et al.: Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. New England Journal of Medicine 2005, 352(10):987–996. 10.1056/NEJMoa043330View ArticlePubMedGoogle Scholar
- Demir C, Yener B: Automated cancer diagnosis based on histopathological images: a systematic survey. Rensselaer Polytechnic Institute; 2009.Google Scholar
- Latson L, Sebek N, Powell K: Automated cell nuclear segmentation in color images of hematoxylin and eosin-stained breast biopsy. Analytical and Quantitative Cytology and Histology 2003, 26(6):321–331.Google Scholar
- Doyle S, Feldman M, Tomaszewski J, Shih N, Madabhushu A: Cascade multi-class pairwise classifier (CASCAMPA) for normal, cancerous, and cancer confunder classes in prostate histology. In International Synposium on Biomedical Imaging: from nano to macro. IEEE; 2011:715–718.Google Scholar
- Basavanhally A, Xu J, Madabhushu A, Ganesan S: Computer-aided prognosis of ER+ breast cancer histopathology and correlating survival outcome with oncotype DX assay. In International Synposium on Biomedical Imaging: from nano to macro. IEEE; 2009:851–854.Google Scholar
- Kerlikowske K, Molinaro A, Cha I, et al.: Characteristics associated with recurrence among women with ductal carcinoma in situ treated by lmpectomy. Journal of the National Cancer Institute 2003, 95: 1692–1702. 10.1093/jnci/djg097View ArticlePubMedGoogle Scholar
- Miller N, Chapman J, Fish E: In situ duct carcinoma of the breast: clinical and histopathologic factors and association with recurrent carcinoma. Breast Journal 2001, 7: 292–302. 10.1046/j.1524-4741.2001.99124.xView ArticlePubMedGoogle Scholar
- Axelrod D, Miller N, Lickley H, Qian J, Christens-Barry W, Yuan Y, Fu Y, Chapman J: Effect of quantitative nuclear features on recurrence of ductal carcinoma in situ (DCIS) of breast. Cancer Informatics 2008, 4: 99–109.Google Scholar
- Chapman J, Miller N, Lickley H, Qian J, Christens-Barry W, Fu Y, Yuan Y, Axelrod D: Ductal carcinoma in situ of the breast (DCIS) with heterogeneity of nuclear grade: prognostic effects of quantitative nuclear assessment. BMC Cancer 2007., 7(174):Google Scholar
- Peinta K, Coffey D: Correlation of nuclear morphometry with progression of breast cancer. Cancer 1991, 68: 2012–2016. 10.1002/1097-0142(19911101)68:9<2012::AID-CNCR2820680928>3.0.CO;2-CView ArticleGoogle Scholar
- Mommers E, Poulin N, Sangulin J, Meiher C, Baak J, van Diest P: Nuclear cytometric changes in breast carcinogenesis. Journal of Pathology 2001, 193(1):33–39. 10.1002/1096-9896(2000)9999:9999<::AID-PATH744>3.0.CO;2-QView ArticlePubMedGoogle Scholar
- Veltri R, Khan M, Miller M, Epstein J, Mangold L, Walsh P, Partin A: Ability to predict metastasis based on pathology findings and alterations in nuclear structure of normal appearing and cancer peripheral zone epithelium in the prostate. Clinical Cancer Research 2004, 10: 3465–3473. 10.1158/1078-0432.CCR-03-0635View ArticlePubMedGoogle Scholar
- Verhest A, Kiss R, d'Olne D, Larsimont D, Salman I, de Launoit Y, Fourneau C, Pastells J, Pector J: Characterization of human colorectal mucosa, polyps, and cancers by means of computerized mophonuclear image analysis. Cancer 1990, 65: 2047–2054. 10.1002/1097-0142(19900501)65:9<2047::AID-CNCR2820650926>3.0.CO;2-4View ArticlePubMedGoogle Scholar
- Bhagavatula R, Fickus M, Kelly W, Guo C, Ozolek J, Castro C, Kovacevic J: Automatic identification and delineation of germ layer components in H&E stained images of teratomas derived from human and nonhuman primate embryonic stem cells. In International Synposium on Biomedical Imaging: from nano to macro. IEEE; 2010:1041–1044.Google Scholar
- Kong J, Cooper L, Sharma A, Kurk T, Brat D, Saltz J: Texture based image recognition in microscopy images of diffuse gliomas with multi-class gentle boosting mechanism. ICASSAP: 2010 2010, 457–460.Google Scholar
- Tambasco M, Magliocco A: Relationship between tumor grade and computed architectural complexity in breast cancer specimens. Human Pathology 2008, 39(5):740–746. 10.1016/j.humpath.2007.10.001View ArticlePubMedGoogle Scholar
- Dey P, Mohanty S: Fractal dimensions of breast lesions on cytology smears. Diagn Cytopathol 2003, 29: 85–87. 10.1002/dc.10324View ArticlePubMedGoogle Scholar
- Fatakdawala H, Xu J, Basavanhally A, Bhanot G, Ganesan S, Feldman F, Tomaszewski J, Madabhushi A: Expectation-maximization-driven geodesic active contours with overlap resolution (EMaGACOR): Application to lymphocyte segmentation on breast cancer histopathology. IEEE Transactions on Biomedical Engineering 2010, 57(7):1676–1690.View ArticlePubMedGoogle Scholar
- Zhang L, Conejo-Garcia J, Katsaros P, Gimotty P, Massobrio M, Regnani G, Makrigiannakis A, Gray H, Schlienger K, Liebman M, et al.: Intratumoral T cells, recurrence, and survival in epithelial ovarian cancer. New England Journal of Medicine 2003, 348(3):203–213. 10.1056/NEJMoa020177View ArticlePubMedGoogle Scholar
- Phukpattaranont P, Boonyaphiphat P: Color based segmentation of nuclear stained breast cancer cell images. ECTI Transactions on Electrical Engineering, and Communication 2007, 5(2):158–164.Google Scholar
- Ballaro B, Florena A, Franco V, Tegolo D, Tripodo C, Valenti C: An automated image analysis methodology for classifying megakaryocytes in chronic myelprliferative disorders. Medical Image Analysis 2008, 12: 703–712. 10.1016/j.media.2008.04.001View ArticlePubMedGoogle Scholar
- Land W, McKee D, Zhukov T, Song D, Qian W: A kernelised fuzzy-Support Vector Machine CAD system for the diagnosis of lung cancer from tissue images. International Journal of Functional Informatics and Personalised Medicine 2008, 1(1):26–52. 10.1504/IJFIPM.2008.018291View ArticleGoogle Scholar
- Bamford P, Lovell B: Unsupervise cell segmentation with active contours. Signal Process 1998, 71(2):203–213. 10.1016/S0165-1684(98)00145-5View ArticleGoogle Scholar
- Glotsos D, Spyridonos P, Cavouras D, Ravazoula P, Dadioti P, Nikiforidis G: Automated segmentation of routinely hematoxyli-eosin stained microscopic images by combining support vector machine, clustering, and active contour models. Anal Quant Cytol Histol 2004, 26(6):331–340.PubMedGoogle Scholar
- Fatakdawala H, Basavanhally A, Xu J, Bhanot G, Ganesan S, Feldman M, Tomaszewski J, Madabhushi A: Expectation maximization driven geodesic active contour: application to lymphocyte segmentation on digitized breast cancer histhopatholgy. International conference on bioinformatics and bioengineering 2009, 69–76.Google Scholar
- Cosatto E, Miller M, Graf H, Meyer J: Grading nuclear plemorphism on histological micrographs. International Conference on Pattern Recognition 2008, 1–4.Google Scholar
- Chang H, Defilippis RA, Tlsty TD, Parvin B: Graphical methods for quantifying macromolecules through bright field imaging. Bioinformatics 2009, 25(8):1070–1075. 10.1093/bioinformatics/btn426PubMed CentralView ArticlePubMedGoogle Scholar
- Datar M, Padfield D, Cline H: Color and texture based segmentation of molecular pathology images usING HSOMS. In International Symposium for Biomedical Imaging: from nano to maco. IEEE; 2008:292–295.Google Scholar
- Petushi S, Garcia F, Haber M, Katsinis C, Tozeren A: Large-scale computations on histology images reveal grade-differentiation parameters for breast cancer. BMC Medical Imaging 2006., 6(14):Google Scholar
- Doyle S, Agner S, Madabhushi A, Feldman M, Tomaszewski : Automated grading of breast cancer histipathology using spectral clustering with textural and architectural image features. International Symposium on Biomedical Imaging: from nano to macro 2008, 496–499.Google Scholar
- Bunyak F, Hafiane A, Palanippan K: Histopathology tissue segmentation by combining fuzzy clustering with multiphase vector level set. Advances in Experimental Medicine and Biology 2011, 696: 413–242. 10.1007/978-1-4419-7046-6_41View ArticlePubMedGoogle Scholar
- Parvin B, Yang Q, Han J, Change H, Rydberg B, Barcellos-Hoff MH: Iterative voting for inference of structural saliency and localization of subcellular structures. IEEE Transactions on Image Processing 2007., 16(3):Google Scholar
- Nath S, Palaniappan K, Bunyak F: Cell segmentation using coupled level sets and graph-vertex coloring. Medical Image Computing and Computed-assisted Intervention-Miccai: 2006 2006, 101–108.View ArticleGoogle Scholar
- Chang H, Parvin B: Multiphase level set for automated delineation of membrane-bound macromolecules. In International Symposium for Biomedical Imaging: from nano to macro. IEEE; 2010:165–168.Google Scholar
- Han J, Chang H, Yang Q, Groesser T, Barcellos-Hoff M, Parvin B: Multiscale iterative voting for differential analysis of stress response for 2D and 3D cell culture models. Journal of Microscopy 2010, 241(3):315–326.View ArticlePubMedGoogle Scholar
- Han J, Chang H, Andrarwewa K, Yaswen P, Barcellos-Hoff M, Parvin B: Multidimensional profiling of cell surface proteins and nuclear markers. IEEE Transactions on Computational Biology and Bioinformatics 2010, 7(1):80–90.View ArticlePubMedGoogle Scholar
- Coelho L, Shariff A, Murphy R: Nuclear Segmentation in Microscope Cell Images: A Hand-Segmented Dataset and Comparison of Algorithms. In International Symposium on Biomedical Imaging: from nano to macro. IEEE; 2009:690–693.Google Scholar
- Loss L, Bebis G, Parvin B: Iterative tensor voting for perceptual grouping if ill-defined curvilinear structures. IEEE Transactions on Medical Imaging 2011, 30(8):1503–1513.PubMed CentralView ArticlePubMedGoogle Scholar
- Wen Q, Chang H, Parvin B: A Delaunay triangulation approach for segmenting a clump of nuclei. In International Synposium on Biomedical Imaging: from nano to macro. IEEE; 2009:9–12.Google Scholar
- Raman S, Maxwell C, Barcellos-Hoff MH, Parvin B: Geometric approach to segmentation and protein localization in cell culture assays. Journal of Microscopy 2007, 225(Part 1):22–30.View ArticlePubMedGoogle Scholar
- Chang H, Yang Q, Parvin B: Segmentation of heterogeneous blob objects through voting and level set formulation. Pattern Recognition Letters 2007, 28(13):1781–1787. 10.1016/j.patrec.2007.05.008PubMed CentralView ArticlePubMedGoogle Scholar
- Han J, Chang H, Giricz O, Lee G, Baehner F, Gray J, Bissell M, Kenny P, Parvin B: Molecular Predictors of 3D Morphogenesis by Breast Cancer Cell Lines in 3D Culture. PLoS Computational Biology 2010, 6(2):e1000684. 10.1371/journal.pcbi.1000684PubMed CentralView ArticlePubMedGoogle Scholar
- Chan T, Vese L: Active contours without edges. IEEE Transactions on Image Processing 2001, 10(2):266–277. 10.1109/83.902291View ArticlePubMedGoogle Scholar
- Rabinovich A, Agarwal S, Larris C, Price J, Belongie S: Unsupervised color decomposition of histologically stained tissue samples. In Advances in Neural Information Processing Systems. MIT Press; 2003:667–674.Google Scholar
- Ruifork A, Johnston D: Quantification of histochemical staining by color decomposition. Anal Quant Cytol Histology 2001, 23(4):291–299.Google Scholar
- Swedlow J, Goldberg I, Brauner E, Sorger P: Informatics and quantitative analysis in biological imaging. Science 2003, 300: 100–102. 10.1126/science.1082602PubMed CentralView ArticlePubMedGoogle Scholar
- Goldberg I, Allan C, Burel JM, Creager A, Falconi H, Hochheiser H, Johnston J, Mellen J, Sorger P, Swedlow J: The open microscopy environment (OME) data model and xml files: open tools for informatics and quantitative analysis in biological images. Genome and Biology 2005, 6(5):R47. 10.1186/gb-2005-6-5-r47View ArticleGoogle Scholar
- Parvin B, Fontenay G, Yang Q, Barcellos-Hoff MH: BioSig: an imaging bioinformatics system for phenotypic analysis. IEEE Transactions on System, Man, and Cybernetics-Part B 2003, 33(5):814–824. 10.1109/TSMCB.2003.816929View ArticleGoogle Scholar
- Monti S, Tamayo P, Mesirov J, Golub T: Consensus clustering: A resampling-based method for class discovery and visualization of gene expression microarray data. Machine Learning 2003, 52(1–2):91–118.View ArticleGoogle Scholar
- Han J, Chang H, Fontenay G, Wang N, Gray J, Parvin B: Morphometric subtyping for a panel of breast cancer cell lines. In International Symposium on Biomedical Imaging: from Nano to Macro. IEEE; 2009:791–794.View ArticleGoogle Scholar
- Smyth GK: Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl GenetT Mo B 2004, 3: A3.Google Scholar
- Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc B 1995, 57: 397–420.Google Scholar
- Breiman L: Random Forests. Machine Learning 2001, 45(1):5–32. 10.1023/A:1010933404324View ArticleGoogle Scholar
- Diaz-Uriarte R, Alvarez de Andres S: Gene selection and classification of microarray data using random forest. BMC Bioinformatics 2006, 7: 3. 10.1186/1471-2105-7-3PubMed CentralView ArticlePubMedGoogle Scholar
- Liaw A, Wiener M: Classification and Regression by randomForest. R News 2002, 2(3):18–22.Google Scholar
- Verhaak R, Hoadley K, Purdom E, Wang V, Qi Y, Wilkerson M, Miller C, Ding L, Golub T, Mesirov J, et al.: Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell 2010, 17: 98–110. 10.1016/j.ccr.2009.12.020PubMed CentralView ArticlePubMedGoogle Scholar
- Mantel N: Evaluation of survival data and two new rank order statistics arising in its consideration. Cancer Chemotherapy Reports 1966, 50(3):163–170.Google Scholar
- Johnson L, Sampson J: immunotherapy approaches for malignant glioma from 2007 to 2009. Curr Neurol Neurosi Rep 2010, 10(4):259–266. 10.1007/s11910-010-0111-9View ArticleGoogle Scholar
- Mitchell D, Sampson J: Toward effective immunotherapy for the treatment of malignant brain tumors. Neurotherapeutics 2009, 6(3):527–538. 10.1016/j.nurt.2009.04.003PubMed CentralView ArticlePubMedGoogle Scholar
- Wu A, Ericson K, Chao W, Low W: NFAT and AP1 are essential for the expression of a glioblastoma multiforme related IL-13Ra2 transcript. Cell Oncology 2010, 32(5–6):313–329.Google Scholar
- Lin B, Madan A, Yoon J, Fang X, Yan X, Kim T, Hwang D, Hood L, Foltz G: Massively parallel signature sequencing and bioinformatics analysis identifies up-regulation of TGFBI and SOX4 in human glioblastoma. PLoS One 2010., 5(4):Google Scholar
- Kazanietz M: Protein Kinase C in cancer signaling and therapy. Humana Press; 2010.View ArticleGoogle Scholar
- Martin P, JHussanini I: PKC eta as a therapeutic target in glioblastoma multiforme. Expert Opin Ther Targets 2005, 9(2):299–313. 10.1517/14728184.108.40.2069View ArticlePubMedGoogle Scholar
- Keyse S: Stress response: methods and protocols. Totowa, New Jersey: Humana press; 2000.View ArticleGoogle Scholar
- Cooper L, Kong J, Wang F, Kurk T, Moreno C, Brat D, Saltz J: Morphological Signatures and Genomic Correlates in Glioblastoma. In International Symposium on Biomedical Imaging: from nano to macro. IEEE; 2011:791–794.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.