From: A molecular structure matching approach to efficient identification of endogenous mammalian biochemical structures
Dataset
Number of Compounds
BioSM
BioSMXpress
HMDB Compounds
2,329
88%
91%
Plant Metabolites
2,416
73%
72%
Drug Compounds
3,282
62%
58%
PubChem Compounds
46,203
35%
25%
Synthetic Compounds
374,509
33%
36%