From: BIOZON: a system for unification, management and analysis of heterogeneous biological data
Document type | Representation | Atomic units |
---|---|---|
protein sequence | string | amino acids |
nucleic acid sequence | string | nucleic acids |
protein family | set | proteins |
pathway | set | protein families |
domain | ordered pair | sequence coordinates |
domain family | set | domains |
interaction | set | proteins, nucleic acids |
descriptor | text | characters |
structure | list | 3D coordinates |
unigene cluster | set | nucleic acids (ESTs) |