Skip to main content

Table 1 Motif factors and p values of TATAAAAG and TATATAAG and their extension sequences on S1000 and S2000.

From: Frequency distribution of TATA Box and extension sequences on human promoters

Sequences S1000 S2000 p
TATAAAAG 19 12.3 <1e-16
G TATAAAAG 18 8.5 <1e-16
GG TATAAAAG 8 8 2.3e-10
C TATAAAAG 11 10 <1e-16
TC TATAAAAG 8 6 1.6e-11
TATAAAAGC 26 21 <1e-16
TATAAAAGCA 7 9 8.6e-8
TATAAAAGG 8.3 7.2 1.9e-15
TATAAAAGGC 8 12 1.4e-12
TATAAAAGGG 8 9 2.0e-12
TATATAAG 9 7.3 <1e-16
G TATATAAG 17 15 <1e-16
GG TATATAAG 6 8 <1e-16
C TATATAAG 9 8 8.3e-14
TATATAAGG 13.5 11 <1e-16
TATAAAAAGG 8 8 4.0e-12
  1. TATA extension sequences which are statistically significant mainly extend from two TATA elements: TATAAAAG and TATATAAG. Table 1 gives the motif factors and p values for these two TATA elements and fourteen TATA extension sequences. P values are calculated based on the human promoters of length 1000 bp. In Table 1, bases of italic bold font are the extension bases.