Skip to main content

Table 3 Real datasets in the mESC data

From: SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets

Dataset

Motif

(l, d)

t

q

c-Myc

GCACGTGGC

(9, 2)

3422

0.60

CTCF

CCACCAGGGGGCG

(13, 4)

39,601

0.58

Esrrb

GGTCAAGGTCA

(11, 3)

21,644

0.54

Klf4

GGGTGTGGC

(9, 2)

10,872

0.61

Nanog

CCTTGTCATGC

(11, 3)

10,342

0.26

n-Myc

GCACGTGGC

(9, 2)

7181

0.57

Oct4

CATTGTTATGCAAAT

(15, 5)

3775

0.29

Smad1

CCTTTGTTATGCA

(13, 4)

1126

0.36

Sox2

CATTGTTATGCAAAT

(15, 5)

4525

0.39

STAT3

TTCCCGGAA

(9, 2)

2546

0.61

Tcfcp2I1

CCGGTTCAAACCG

(13, 4)

26,907

0.29

Zfx

GCTAGGCCGCG

(11, 3)

10,336

0.49