From: MICA: desktop software for comprehensive searching of DNA databases
File Element | Generic Storage Requirement (bytes) | Storage Requirement for Chromosome 1 (bytes) |
---|---|---|
Sequence Segment | ||
A. Segment Format | 1 | 1 |
B. Segment Size | 4 | 4 |
C. Sequence Properties | 1 | 1 |
D. DNA Sequence | L | 245,522,847 (234 MB) |
SEGMENT TOTAL | 6 + L | 245,522,853 (234 MB) |
Index Segment | ||
E. Segment Format | 1 | 1 |
F. Segment Size | 4 | 4 |
G. Index Properties | 1 | 1 |
H. Chunk Counts Summary | 4K+1 | K = 4: 1,024 (1 KB) K = 6: 16,384 (16 KB) |
I. Degenerate K-mer Count | 4 | 4 |
J. N-Stretch Count (S) | 4 | 4 |
K. Chunk Data Array | (4K * C + number of nondegenerate K-mers) * 2 | K = 4: 447,573,936 (427 MB) K = 6: 476,350,748 (454 MB) |
L. Degenerate Data Array | (number of partially degenerate K-mers) * (4 + K) | K = 4: 1,752 (1.7 KB) K = 6: 3,650 (3.6 KB) |
M. N-Stretch Data Array | 8S | 296 |
SEGMENT TOTAL | Typically about 2L bytes. | K = 4: 447,577,022 (427 MB) K = 6: 476,371,092 (454 MB) |