Skip to main content

Table 3 Summary of assembly of paralogous Arabidopsis genes

From: SRAssembler: Selective Recursive local Assembly of homologous genomic regions

Arabidopsis target

Total paralogs in clade

Complete paralogs

Complete gene bodies

Gene bodies >50% assembled

At1g01040.2

6

1 (17%)

1 (17%)

1 (17%)

At1g01050.1

6

4 (67%)

4 (67%)

5 (83%)

At1g01090.1

3

1 (33%)

1 (33%)

1 (33%)

At1g01230.1

2

1 (50%)

2 (100%)

2 (100%)

At1g01560.2

29

3 (10%)

5 (17%)

17 (59%)

At1g01620.1

37

12 (32%)

14 (38%)

17 (46%)

At1g01750.2

12

9 (75%)

10 (83%)

10 (83%)

At1g01820.1

5

3 (60%)

3 (60%)

3 (60%)

At1g01910.1

3

1 (33%)

1 (33%)

3 (100%)

At1g01940.1

30

3 (10%)

6 (20%)

12 (40%)

At1g01950.3

64

1 (2%)

1 (2%)

10 (16%)

At1g02130.1

63

22 (35%)

30 (48%)

40 (63%)

At1g02140.1

1

1 (100%)

1 (100%)

1 (100%)

At1g02500.1

4

3 (75%)

4 (100%)

4 (100%)

At1g02560.1

10

2 (20%)

3 (30%)

3 (30%)

At1g02780.1

4

3 (75%)

3 (75%)

3 (75%)

At1g02830.1

3

3 (100%)

3 (100%)

3 (100%)

At1g03190.1

6

1 (17%)

1 (17%)

1 (17%)

At1g03475.1

2

2 (100%)

2 (100%)

2 (100%)

At1g03630.1

5

3 (60%)

3 (60%)

3 (60%)

  1. Nineteen out of the 20 Arabidopsis gene “targets” have at least one annotated paralog. SRAssembler was able to completely assemble at least one additional paralog for 13 of those targets. In many cases in which the complete paralog was not assembled, contigs still covered a significant fraction of the gene body (region from the start codon to the stop codon). If further investigation of a clade is desired, the final contigs could be used as starting queries for new SRAssembler runs