Skip to main content

Table 1 Total CPU time and alignment quality for DIALIGN (D) and DIALIGN anchored with CHAOS (C+D) applied to a set of 42 pairs of genomic sequences from human and mouse [12]. CHAOS was run with varying cutoff parameters. Lower cutoff values for CHAOS produced higher numbers of anchor points resulting in a decreased search space for the final DIALIGN alignment procedure thus leading to improved running time but slightly decreased alignment quality. The average number of anchor points per kilobase is shown (anc./kb). Score is the total numerical score of all produced DIALIGN alignments, i.e. the sum of the scores of the segment pairs in the alignments. As a rough measure of the biological quality of the produced alignments, we compared local sequence similarities identified by DIALIGN and CHAOS to known protein-coding regions. Here, Sn, Sp and AC are sensitivity, specificity and approximate correlation, respectively. For the D and C+D results, DIALIGN was evaluated by comparing all segment pairs contained in the alignment to annotated exons.

From: Fast and sensitive multiple alignment of large genomic sequences

program

cutoff

anc./kb

CPU

%CPU

score

%score

Sn

Sp

AC

D

  

179,001

100.0

54,214

100.0

83

40

57

C+D

35

1.4

14,334

8.0

53,839

99.3

83

40

57

C+D

30

1.7

11,717

6.5

53,820

99.2

83

40

57

C+D

25

2.1

11,485

6.4

53,654

98.9

83

40

57

C+D

20

2.8

8,964

5.0

53,642

98.9

83

40

57

C+D

15

4.2

7,404

4.1

53,208

98.1

82

41

57

C+D

10

6.5

6,696

3.7

52,684

97.1

82

41

57