Skip to main content

Table 2 Real datasets selected from the ENCODE TF ChIP-seq data

From: SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets

Dataset

Motif

(l, d)

t

q

egr1

CCGCCCCCGCA

(11, 3)

15,400

0.68

elf1

AACCCGGAAGT

(11, 3)

8611

0.54

hnf4

GGGTCAAAGTCCA

(13, 4)

11,045

0.53

myc

ACCACGTGCTC

(11, 3)

4542

0.49

nfy

ACTAACCAATCAG

(13, 4)

9781

0.44

sp1

GGGGCGGGG

(9, 2)

14,779

0.52

srf

TGACCATATATGGTC

(15, 5)

4903

0.36

yy1

CGGCCATCT

(9, 2)

2077

0.49