ChIP-on-chip significance analysis reveals ubiquitous transcription factor binding

Margolin, Adam A; Palomero, Teresa; Ferrando, Adolfo A; Califano, Andrea; Stolovitzky, Gustavo

doi:10.1186/1471-2105-8-S8-S2

Volume 8 Supplement 8

Highlights from the Third International Symposium for Computational Biology (ISCB) Student Council Symposium at the Fifteenth Annual International Conference on Intelligent Systems for Molecular Biology (ISMB)

Oral presentation
Open access
Published: 20 November 2007

ChIP-on-chip significance analysis reveals ubiquitous transcription factor binding

Adam A Margolin^1,2,3,
Teresa Palomero²,
Adolfo A Ferrando²,
Andrea Califano^1,2 &
…
Gustavo Stolovitzky³

BMC Bioinformatics volume 8, Article number: S2 (2007) Cite this article

2653 Accesses
2 Citations
Metrics details

Background

ChIP-on-chip technology provides a genome-scale view of transcription factor (TF)/target interactions and a systems-level window into transcriptional regulatory networks. However, while many studies have used ChIP-on-chip data to effectively discover new TF targets, statistical methods have fallen short of developing an accurate model to disassociate signals caused by experimental noise from those caused by true biological variation, thus leveraging the technology to provide high confidence predictions of the full range of interactions.

Method

This paper presents a novel method to accurately model the significance of binding events measured by ChIP-on-chip data. For each arrayed probe representing a genomic segment, a ChIP-on-chip microarray measures intensity levels for the IP channel, which is enriched in genomic fragments bound by an immunoprecipitated TF, and the WCE channel, which represents random genomic fragments. Statistical significance is inferred by computing the conditional probability, p(M | A), where $M = \log 2 (\frac{I P}{W C E})$ and $A = \frac{\log 2 (I P) + \log 2 (W C E)}{2}$ (Fig. 1). A kernel density estimation procedure is used to calculate the joint probability, p(M, A), and for each average intensity value, the mean of the null distribution (i.e. distribution for unbound probes) is inferred as ${\hat{M}}_{A} = \underset{M}{\arg \max} p (M | A)$ . The distribution of p(M | A), for M < ${\hat{M}}_{A}$ , is then projected across ${\hat{M}}_{A}$ to yield the inferred null distribution, which is used to assign statistical significance scores. Probes for replicate experiments and probes with genomic locations within the fragmentation length (~500 bp) are integrated to produce a single significance score for each genomic region.

Results

The method is tested on six different ChIP-on-chip arrays representing replicate experiments for three different TFs (NOTCH1, MYC and HES1). For each experiment, this analysis reveals an order of magnitude more genomic binding events than detected by traditional methods, predicting several thousand interactions for each TF and suggesting previously unappreciated complexity of transcriptional regulatory networks. Several independent experiments are used to provide evidence about the validity of these predictions. First, biochemical validation of more than 20 predicted targets by gene specific ChIP and qPCR confirm the accuracy of false discovery rate statistics computed by the method. Second, binding site enrichment analysis indicates that the strength of binding site signals are maintained over several thousand promoters. Finally, gene expression analysis reveals a coordinated downregulation of gene expression for the entire range of predicted NOTCH1 bound genes upon NOTCH1 inhibition experiments in cell lines, indicating that a large percentage of bound genes are also functionally regulated by NOTCH1.

Author information

Authors and Affiliations

Department of Biomedical Informatics, Columbia University, New York, NY, 10032, USA
Adam A Margolin & Andrea Califano
Joint Centers for Systems Biology, Columbia University, New York, NY, 10032, USA
Adam A Margolin, Teresa Palomero, Adolfo A Ferrando & Andrea Califano
Systems Biology Group, IBM Research, Yorktown Heights, NY, 10598, USA
Adam A Margolin & Gustavo Stolovitzky

Authors

Adam A Margolin
View author publications
You can also search for this author in PubMed Google Scholar
Teresa Palomero
View author publications
You can also search for this author in PubMed Google Scholar
Adolfo A Ferrando
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Califano
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo Stolovitzky
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Adam A Margolin.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Margolin, A.A., Palomero, T., Ferrando, A.A. et al. ChIP-on-chip significance analysis reveals ubiquitous transcription factor binding. BMC Bioinformatics 8 (Suppl 8), S2 (2007). https://doi.org/10.1186/1471-2105-8-S8-S2

Download citation

Published: 20 November 2007
DOI: https://doi.org/10.1186/1471-2105-8-S8-S2

Highlights from the Third International Symposium for Computational Biology (ISCB) Student Council Symposium at the Fifteenth Annual International Conference on Intelligent Systems for Molecular Biology (ISMB)

ChIP-on-chip significance analysis reveals ubiquitous transcription factor binding

Background

Method

Results

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

Highlights from the Third International Symposium for Computational Biology (ISCB) Student Council Symposium at the Fifteenth Annual International Conference on Intelligent Systems for Molecular Biology (ISMB)

ChIP-on-chip significance analysis reveals ubiquitous transcription factor binding

Background

Method

Results

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us