Skip to main content

Table 4 The composition of the entire data set

From: LTPConstraint: a transfer learning based end-to-end method for RNA secondary structure prediction

 

Samples

Train

Test

Validate

Rfam_128

21487

17190

2149

2148

Rfam_512

8762

7010

876

876

5SrRNA

10788

8630

1079

1079

tRNA

9245

7396

925

924

PDB

669

535

67

67

SPR

622

498

62

62

grpI

2135

1708

214

213

RNP

466

373

47

46

SRP

959

767

96

96

telomerase

37

30

3

4

tmRNA

637

510

64

63

  1. The column of samples represents the total number of a data set. The column train represents the data volume of the training set, and similarly test and validate represent the data volume of the test set and validation set