Figure 5
From: BIOZON: a system for unification, management and analysis of heterogeneous biological data

Data integration. Individual elements d from source databases are translated to their representation in Biozon as per the transformation function T D . The graph ∑ resulting from integration of these elements has non-redundant objects, serving to merge the data from disparate sources into a cohesive whole. As shown, six records from GenPept, SwissProt BIND and DIP are translated into Biozon graph form. Each record is transformed into a set of objects (e.g. ) and descriptors (e.g. ). Identical proteins from SwissProt and GenPept records, and respectively, are instantiated as a single non-redundant protein object P1 on the graph. Similarly, and are mapped to a single P2. As a result, the two interaction objects (BIND) and (DIP) are mapped to the same object I1.