Skip to main content

Advertisement

Table 1 Synoptic report of the loading procedure

From: GIDL: a rule based expert system for GenBank Intelligent Data Loading into the Molecular Biodiversity database

DIVISION   MIN MAX AVERAGE Standard Deviation # ENTRIES
VRL SequenceLength (bp) 9 1181404 1097 4206 814122
  FeatureTableLength (byte) 716 887303 2504 3556  
  # Features 2 2894 4.5 8.8  
  # Statements 22 27367 52.1 94.5  
  ParsingTime (ms) 1 1925 6.5 10.0  
  ReasoningTime (ms) 72 173912 95.6 235.3  
  InsertTime (ms) 361 531333 759.3 6472.2  
  TotalTime (ms) 446 531467 861.5 6491.5  
INV SequenceLength (bp) 7 3291871 1536 17715 959065
  FeatureTableLength (byte) 710 1536275 2222 6264  
  # Features 2 4033 4.5 22.0  
  # Statements 24 42815 51.9 197.9  
  ParsingTime (ms) 1 11044 6.1 24.4  
  ReasoningTime (ms) 67 391901 101.6 1020.6  
  InsertTime (ms) 503 1702407 670.2 4689.0  
  TotalTime (ms) 573 2098246 778.0 5406.3  
PLN SequenceLength (bp) 2 3439086 2378 16960 1365360
  FeatureTableLength (byte) 821 1844292 2397 7106  
  # Features 2 4113 4.6 17.5  
  # Statements 23 33032 51.4 185.8  
  ParsingTime (ms) 1 5546 6.3 17.4  
  ReasoningTime (ms) 73 5803323 107.5 5633.7  
  InsertTime (ms) 423 401933 559.0 1518.5  
  TotalTime (ms) 501 6206793 672.8 6315.6  
EST SequenceLength (bp) 7 1770 659 193 1035087
  FeatureTableLength (byte) 1358 4798 2563 351  
  # Features 2 2 2 0  
  # Statements 25 34 30.1 1.1  
  ParsingTime (ms) 3 233 6.1 2.8  
  ReasoningTime (ms) 73 1089 83.3 10.9  
  InsertTime (ms) 372 38746 499 959.1  
  TotalTime (ms) 454 38859 588.4 950.6  
  1. This table shows, in a synoptic view, the main parameters of the loading procedure to populate the Molecular Biodiversity Database. These numbers refer to entries coming from four GenBank divisions (VRL, INV, PLN and EST). For each of these sets, the first four rows describe some aspects of the GenBank entries, while the other four ones refer to parameters measured during the loading procedure. The VRL and INV sets were loaded by using the topology shown in Figure 5(a), while the other sets were loaded by using the topology shown in Figure 5(b). See text for a complete discussion of this topic.