Skip to main content

Table 4 The performance of AL-BP-MD-ME-MS on a dataset of shuffled margin sequences a).

From: EMD: an ensemble algorithm for discovering regulatory motifs in DNA sequences

  nPC nSn nSp
Margin size (nt) 100 200 400 100 200 400 100 200 400
Original set b) 0.288 0.254 0.197 0.328 0.292 0.234 0.416 0.360 0.270
Shuffled set c) 0.317 0.255 0.187 0.340 0.275 0.201 0.481 0.375 0.266
  1. a) To the both sides of known sites in the ECRDB61B data set, artificially shuffled sequences with the size of 100, 200, and 400 nt are attached. The statistics of the di-mer nucleotide frequency used to generate the shuffled margin sequences are taken from intergenic regions of the E. coli genome.
  2. b) The performance on the original ECRDB61B-100, 200, 400 set.
  3. c) The performance on the data set with shuffled margin sequences.