Skip to main content

Table 2 Barcode datasets description.

From: Alignment-free analysis of barcode sequences by means of compression-based methods

DATASET

# Species

# Specimens

% Sequences with undefined bases

Sequence Length

ABSMC

46

72

1.3%

650-657

AECI

30

30

0.0%

605-679

AGFDO

22

22

0.0%

901

AGFSU

42

48

2.0%

633-639

AGLUO

38

46

2.1%

630

AGWEB

33

33

87.0%

900

ARCPU

28

52

5.0%

625-658

BACX

74

119

2.5%

616-657

BCUB

30

108

0.9%

657

BLSPA

86

86

4.0%

604-658

BRBP

17

106

0.0%

658

BSHMT

22

141

5.6%

645

CNLVA

33

73

5.0%

625-658

DLTC

40

67

1.5%

689-1821

DSALA

12

44

11.0%

649-651

DSANA

14

274

0.0%

652

DSFCH

17

173

3.4%

620-650

FBLGO

44

122

2.4%

580-658

FBLOT

34

64

3.0%

419-658

GBFBA

27

27

7.0%

669

GZPSE

23

78

7.7%

601-658

JDWAM

103

226

8.8%

620-650

JTB

53

225

0.4%

658-899

MHTRI

13

108

3.7%

620-650

MJMSL

76

198

4.5%

559-658

Onychophora

52

210

0.9%

451-884

PLOCE

33

102

0.0%

620-660

RDMYS

6

37

32.0%

636

SIBHI

38

85

0.0%

650-694

WXYZ

9

34

3.0%

650-680

  1. The main features of the 30 barcode datasets used in our experimental tests