Skip to main content

Table 2 Genomes from the NCBI Genome database for second data set.

From: Swiftly Computing Center Strings

Species name

Refseq

Genes

PC

Aquifex aeolicus

NC_000918

1580

1529

Clostridium acetobutylicum ATCC 824

NC_003030

3843

3671

Corynebacterium glutamicum ATCC 13032

NC_003450

3073

2993

Deinococcus radiodurans R1 chromosome 1,

NC_001263

2687

2629

Deinococcus radiodurans R1 chromosome 2

NC_001264

369

268

Fusobacterium nucleatum

NC_003454

2125

2063

Listeria innocua Clip11262

NC_003212

3065

2968

Mesorhizobium loti

NC_002678

6804

674

Mycoplasma genitalium

NC_000908

524

475

Mycoplasma pneumoniae

NC_000912

733

689

Mycoplasma pulmonis

NC_002771

815

782

Mycobacterium tuberculosis CDC1551

NC_002755

4293

4189

Ralstonia solanacearum, megaplasmid

NC_003296

1684

1676

Ralstonia solanacearum

NC_003295

3503

3437

Rickettsia conorii str. Malish 7

NC_003103

1414

1374

Salmonella typhimurium LT2

NC_003197

4620

4423

Staphylococcus aureus subsp. aureus N315

NC_002745

2664

2583

Synechocystis sp. PCC 6803

NC_000911

3229

3179

Thermotoga maritima

NC_000853

1928

1858

Ureaplasma urealyticum

NC_011374

695

646

Bacillus halodurans C-125

NC_002570

4170

4065

Bacillus subtilis

NC_014479

4170

4062

Borrelia burgdorferi

NC_001318

890

851

Buchnera sp. APS

NC_002528

607

564

Campylobacter jejuni

NC_008787

1707

1653

Caulobacter crescentus

NC_002696

3819

3737

Chlamydia pneumoniae

NC_000922

1122

1052

Chlamydia trachomatis

NC_000117

940

895

Escherichia coli O157:H7

NC_002695

5371

5229

Escherichia coli str. K-12 substr. MG1655

NC_000913

4493

4149

Haemophilus influenzae Rd

NC_000907

1789

1657

Helicobacter pylori 26695

NC_000915

1627

1573

Helicobacter pylori str. J99

NC_000921

1534

1488

Lactococcus lactis

NC_002662

2425

2321

Xylella fastidiosa

NC_002488

2838

2766

Neisseria meningitidis serogroup B str. MC58

NC_003112

2225

2063

Pasteurella multocida PM70

NC_002663

2092

2015

Pseudomonas aeruginosa PA01

NC_002516

5669

5566

Rickettsia prowazekii str. Madrid E

NC_000963

888

835

Streptococcus pneumoniae

NC_012467

2254

2073

Streptococcus pyogenes str. SF370 serotype M1

NC_002737

1810

1696

Treponema pallidum

NC_000919

1095

1036

Vibrio cholerae chromosome 1

NC_012668

2897

2768

Vibrio cholerae chromosome 2

NC_012667

1013

1004

Neisseria meningitidis serogroup A str. Z2491

NC_003116

2065

1909

Mycobacterium leprae str. TN

NC_002677

2770

1605

  1. Genomes from the NCBI Genome database used for detection of approximate gene clusters to generate biological instances of the center string problem. 'Refseq' is the reference sequence from NCBI Genome database, 'PC' the number of protein-coding genes.