Skip to main content

Table 1 Motif factors and p values of TATAAAAG and TATATAAG and their extension sequences on S1000 and S2000.

From: Frequency distribution of TATA Box and extension sequences on human promoters

Sequences

S1000

S2000

p

TATAAAAG

19

12.3

<1e-16

G TATAAAAG

18

8.5

<1e-16

GG TATAAAAG

8

8

2.3e-10

C TATAAAAG

11

10

<1e-16

TC TATAAAAG

8

6

1.6e-11

TATAAAAGC

26

21

<1e-16

TATAAAAGCA

7

9

8.6e-8

TATAAAAGG

8.3

7.2

1.9e-15

TATAAAAGGC

8

12

1.4e-12

TATAAAAGGG

8

9

2.0e-12

TATATAAG

9

7.3

<1e-16

G TATATAAG

17

15

<1e-16

GG TATATAAG

6

8

<1e-16

C TATATAAG

9

8

8.3e-14

TATATAAGG

13.5

11

<1e-16

TATAAAAAGG

8

8

4.0e-12

  1. TATA extension sequences which are statistically significant mainly extend from two TATA elements: TATAAAAG and TATATAAG. Table 1 gives the motif factors and p values for these two TATA elements and fourteen TATA extension sequences. P values are calculated based on the human promoters of length 1000 bp. In Table 1, bases of italic bold font are the extension bases.