Skip to main content

Table 1 Synoptic report of the loading procedure

From: GIDL: a rule based expert system for GenBank Intelligent Data Loading into the Molecular Biodiversity database

DIVISION

 

MIN

MAX

AVERAGE

Standard Deviation

# ENTRIES

VRL

SequenceLength (bp)

9

1181404

1097

4206

814122

 

FeatureTableLength (byte)

716

887303

2504

3556

 
 

# Features

2

2894

4.5

8.8

 
 

# Statements

22

27367

52.1

94.5

 
 

ParsingTime (ms)

1

1925

6.5

10.0

 
 

ReasoningTime (ms)

72

173912

95.6

235.3

 
 

InsertTime (ms)

361

531333

759.3

6472.2

 
 

TotalTime (ms)

446

531467

861.5

6491.5

 

INV

SequenceLength (bp)

7

3291871

1536

17715

959065

 

FeatureTableLength (byte)

710

1536275

2222

6264

 
 

# Features

2

4033

4.5

22.0

 
 

# Statements

24

42815

51.9

197.9

 
 

ParsingTime (ms)

1

11044

6.1

24.4

 
 

ReasoningTime (ms)

67

391901

101.6

1020.6

 
 

InsertTime (ms)

503

1702407

670.2

4689.0

 
 

TotalTime (ms)

573

2098246

778.0

5406.3

 

PLN

SequenceLength (bp)

2

3439086

2378

16960

1365360

 

FeatureTableLength (byte)

821

1844292

2397

7106

 
 

# Features

2

4113

4.6

17.5

 
 

# Statements

23

33032

51.4

185.8

 
 

ParsingTime (ms)

1

5546

6.3

17.4

 
 

ReasoningTime (ms)

73

5803323

107.5

5633.7

 
 

InsertTime (ms)

423

401933

559.0

1518.5

 
 

TotalTime (ms)

501

6206793

672.8

6315.6

 

EST

SequenceLength (bp)

7

1770

659

193

1035087

 

FeatureTableLength (byte)

1358

4798

2563

351

 
 

# Features

2

2

2

0

 
 

# Statements

25

34

30.1

1.1

 
 

ParsingTime (ms)

3

233

6.1

2.8

 
 

ReasoningTime (ms)

73

1089

83.3

10.9

 
 

InsertTime (ms)

372

38746

499

959.1

 
 

TotalTime (ms)

454

38859

588.4

950.6

 
  1. This table shows, in a synoptic view, the main parameters of the loading procedure to populate the Molecular Biodiversity Database. These numbers refer to entries coming from four GenBank divisions (VRL, INV, PLN and EST). For each of these sets, the first four rows describe some aspects of the GenBank entries, while the other four ones refer to parameters measured during the loading procedure. The VRL and INV sets were loaded by using the topology shown in Figure 5(a), while the other sets were loaded by using the topology shown in Figure 5(b). See text for a complete discussion of this topic.