Skip to main content

Table 1 Species selected for this work with reference to the respective abbreviation and identification of genome sequence data by accession number for bacteria or genome assembly project for eukaryotes

From: Linear-time computation of minimal absent words using suffix array

Species

Abbreviation

Genome reference

Bacteria

  

Bacillus anthracis strain Ames

Ba

NC003997

Bacillus subtilis strain 168

Bs

NC000964

Escherichia coli strain K-12 substrain MG1655

Ec

NC000913

Haemophilus influenzae strain Rd KW20

Hi

NC000907

Helicobacter pylori strain 26695

Hp

NC000915

Lactobacillus casei strain BL23

Lc

NC010999

Lactococcus lactis strain Il1403

Ll

NC002662

Mycoplasma genitalium strain G37

Mg

NC000908

Staphylococcus aureus strain N315

Sa

NC002745

Streptococcus pneumoniae strain CGSP14

Sp

NC010582

Xanthomonas campestris strain 8004

Xc

NC007086

Eukaryotes

  

Arabidopsis thaliana (thale cress)

At

AGI release 7.2

Drosophila melanogaster (fruit fly)

Dm

FlyBase release 5

Homo sapiens (human)

Hs

build 38

Mus musculus (mouse)

Mm

build 38