Skip to main content

Table 1 Comparison of different regions of sequence conservation in the muscle related genes. (A) Number of experimentally verified TFBSs found in the conserved regions of different sizes. The Wasserman conserved regions were taken from Wasserman et al. [15]; note that the regions were identified using a Bayesian alignment method between human and mouse, while the upstream regions that we call conserved are from the UCSC Bioinformatics Site [29,30]. The 1, 2, 5 kb conserved regions were generated using the blastz alignment of the human (hg16) and mouse (mm4) taken from the UCSC Bioinformatics Site. The name of each TF family is listed next to the number of known sites taken from Wasserman et al. [14]. The number of sites found within the CRMs is listed next. The number of sites found in the sequences generated by SequenceExtractor is listed according to the length of the region. (B) Number of sites matching the corresponding TFBS motif, at a Pearson correlation coefficient threshold of 0.6, within each of the regions shown in (A). (C) Relative enrichment of each TFBS motif in the specified regions, normalized to the CRMS. Specifically, the enrichment is calculated by dividing the frequency of the motif in the region of interest by its frequency in the CRMs.

From: Meta-analysis discovery of tissue-specific DNA sequence motifs from mammalian gene expression data

(A) Experimentally verified transcription factor binding sites.

TF

Known Sites

Wasserman Conserved Sites

1 kb Upstream Conserved Sites

2 kb Upstream Conserved Sites

5 kb Upstream Conserved Sites

Mef2

21

16

11

11

11

Myf

24

18

14

14

14

Sp1

24

19

16

16

16

SRF

16

12

10

10

10

Tef

9

9

6

6

6

(B) Sites matching PWM.

TF

 

Wasserman Conserved Sites

1 kb Upstream Conserved Sites

2 kb Upstream Conserved Sites

5 kb Upstream Conserved Sites

Mef2

 

25

13

15

17

Myf

 

78

36

44

91

Sp1

 

79

43

54

90

SRF

 

31

22

22

32

Tef

 

10

7

8

13

(C) Relative enrichment of PWM matches in sequence windows.

TF

 

Wasserman Conserved Sites

1 kb Upstream Conserved Sites

2 kb Upstream Conserved Sites

5 kb Upstream Conserved Sites

Mef2

 

1.00

1.16

1.00

0.57

Myf

 

1.00

1.03

0.94

0.97

Sp1

 

1.00

1.21

1.14

0.95

SRF

 

1.00

1.59

1.19

0.86

Tef

 

1.00

1.56

1.34

1.08