Skip to main content

Table 6 PhyloSophos mapping status for scientific names found in in the individual NP occurrence databases using NCBI taxonomy as a reference system

From: PhyloSophos: a high-throughput scientific name mapping algorithm augmented with explicit consideration of taxonomic science, and its application on natural product (NP) occurrence database processing

Scientific name mapping status

COCONUT

IMPPAT

LOTUS

NPASS

Exactly matched without correction

15,617

3,591

30,338

17,178

Exactly matched with simple correction

1,994

0

13

376

Recursive mapping for scientific names which are exactly matched using other taxonomic references

2,304

111

735

2,595

Nearest taxon mapping for scientific names which are exactly matched using other taxonomic references

2,724

299

5,597

2,950

Matched with edit distance-based correction

2,432

8

74

694

Nearest mapping for strains & species affinis

1,750

0

0

814

Latin declension correction

96

0

0

390

Partial mapping

1,296

1

38

349

Unmapped

315

0

8

553

(Total)

28,528

4,010

36,803

25,899