Skip to main content

Table 5 Top 10 discovered motifs after alignment postprocessing

From: Discovering motifs that induce sequencing errors

Rank

Context

FER

[%]

RER

[%]

ERD

[%]

1

ACGGCGGT

26.1

0.5

25.6

2

GTGGCGGT

25.1

0.7

24.4

3

GCGGCGGT

22.9

0.7

22.2

4

GTGGCTGT

22.4

0.6

21.8

5

ATGGCGGT

21.2

1.0

20.3

6

NCGGCGGT

20.0

0.7

19.3

7

GTGGCTTG

20.2

1.2

19.0

8

GNGGCGGT

19.2

0.7

18.5

9

GCGGCTGT

18.8

0.7

18.1

10

ACGGCTGT

18.6

0.8

17.7

  1. Top 10 (based on ERD) contexts on dataset GAIIx-bs with (q, n) = (8, 4) after GATK postprocessing and duplicate removal.