Skip to main content

Table 1 SIDD is the most distinct variable that differentiates promoter from non-promoter sequences

From: Promoter prediction and annotation of microbial genomes based on DNA sequence and structural responses to superhelical stress

 

Promoter region

 

vs.

 

Coding region

CON region

SIDD

1.0308*10-76 a/ 4.0961*10-72 b

1.0398*10-46 a/ 2.5736*10-44 b

Curvature

2.4277*10-15 a/ 5.3737*10-14 c

5.3170*10-5 a/ 1.5965*10-5 c

Deformation

7.116*10-63 a/ 1.0000b

1.2783*10-31 a/ 0.8567 b

Thermo-Stability

5.5028*10-42 a/ 0.4981 b

1.0527*10-14 a/ 0.9997 b

-10 motif

1.1882*10-74 d

2.6299*10-30 d

  1. Each value in the table is the probability that the two distributions are the same, as found using the Kolmogorov-Smirnov two sample test.
  2. a, sum of the values of the variables in the sequences; b, minimum value of the variable in the sequences; c, maximum value of the variable in the sequences; d, sum of the -10 motif scores of the sequences