From: Ontology-driven indexing of public datasets for translational bioinformatics
Resource | Number of elements | Resource local size (Mb) | Number of direct annotations (mgrep results) | Total number of 'useful'1 annotations | Average number of annotating concepts |
---|---|---|---|---|---|
PubMed (subset) | 1050000 | 146.1 | 30822190 | 174840027 | 763 |
ArrayExpress | 3371 | 3.6 | 502122 | 1849224 | 525 |
ClinicalTrials.gov | 50303 | 99 | 16108580 | 48796501 | 824 |
Gene Expression Omnibus | 2085 | 0.7 | 165539 | 772608 | 359 |
ARRS GoldMiner (subset) | 1155 | 0.5 | 134229 | 662687 | 564 |
TOTAL | 1106914 | 249.9 | 47732660 | 226921047 | (avg)461.5 |