Skip to main content

Table 1 Size of KaBOB

From: KaBOB: ontology-based semantic integration of biomedical databases

 

imported OBOs

ICE records

generated (rules and id sets)

KaBOB total

subset

# triples

size .owl (GB)

# triples

size .nt.gzip (GB)

# triples

size .nt.gzip (GB)

# triples

size (GB)

human only

13,830,676

1.5

144,489,737

2.0

7,615,547

0.2

165,935,960

3.6

human +7 major model organisms

13,830,676

1.5

369,027,022

4.9

34,968,305

0.7

417,826,003

7.1

all organisms

13,830,676

1.5

9,584,033,541

126

n/a

n/a

n/a

n/a

  1. Lists the size of the various collection of RDF generated in the KaBOB build process, recorded in number of triples and size on disk. The first three major columns include the imported OBOs, the ICE records (output of the file parsers), and the generated triples (output of the rules and ID merging). The fourth column is the sum of the first three. The rows represent subsets of the KaBOB data based on organisms included. The subsets are human-only, human plus seven major model organisms (listed in the paper), and the final row is for all organisms combined. Due to the scale of the data in the final subset this data is currently incomplete.