Skip to main content

Table 5 PhyloSophos mapping status for scientific names (n = 59,570) found in four NP occurrence databases

From: PhyloSophos: a high-throughput scientific name mapping algorithm augmented with explicit consideration of taxonomic science, and its application on natural product (NP) occurrence database processing

Scientific name mapping status

COL

EOL

GBIF

NCBI

Exactly matched without correction

43,279

35,812

46,385

37,236

Exactly matched with simple correction

3,500

2,823

3,773

2,197

Recursive mapping for scientific names which are exactly matched using other taxonomic references

1,351

9,595

198

4,154

Nearest taxon mapping for scientific names which are exactly matched using other taxonomic references

2,892

2,907

784

7,553

Matched with edit distance-based correction

3,079

3,108

3,106

2,943

Nearest mapping for strains & species affinis

2,529

2,523

2,553

2,531

Latin declension correction

490

439

527

480

Partial mapping

1,494

1,516

1,494

1,615

Unmapped

956

847

750

861