Skip to main content

Table 6 Genomes de novo annotated with GALBA using reference protein sets listed in Additional file 1: Table S1 as use cases that demonstrate the applicability of GALBA

From: Galba: genome annotation with miniprot and AUGUSTUS

Species

Assembly

Size (Gbp)

nSeqs

N50 (nt)

BUSCO C (%)

RM (%)

Vespula vulgaris

GCA_014466185.1

0.18

35

8,304,510

94.9

19.5

Vespula germanica

GCA_014466195.1

0.18

133

8,396,154

93.6

19.9

Vespula pensylvanica

GCA_014466175.1

0.18

225

8,532,720

96.2

19.4

Polistes dominula

GCA_001465965.1

0.21

1,483

1,625,592

95.7

48.1

Balaenoptera bonaerensis

GCA_000978805.1

2.23

421,444

20,082

54.1

34.0

Eubalaena japonica

GCA_004363455.1

2.69

1,353,963

39,813

74.9

43.3

Inia geoffrensis

GCA_004363515.1

2.60

1,213,610

26,707

67.7

43.8

Kogia breviceps

GCA_004363705.1

2.76

1,252,072

28,812

66.1

41.3

Phocoena phocoena

GCA_004363495.1

2.70

1,331,158

115,969

85.9

44.7

Platanista gangetica

GCA_004363435.1

2.67

1,098,790

23,933

59.1

44.7

Ziphius cavirostris

GCA_004364475.1

3.15

3,758,276

3,608

39.9

45.1

Coix aquatica

GCA_009725075.1

1.62

2,012

148,397,812

97.8

83.3

  1. nSeqs number of sequences in the assembly, BUSCO C percentage of BUSCOs detected as complete, RM percentage of repeatmasked nucleotides in assembly