Skip to main content

Table 2 Categorical FASTA files of transcripts after data processing

From: IIMLP: integrated information-entropy-based method for LncRNA prediction

Transcripts types

GRCh37 ncRNAs

GRCh37 PCTs

GRCh38 ncRNAs

GRCh38 PCTs

After removing short

24,513

94,830

28,628

94,527

After deduplication

21,965

41,134

24,863

41,200

After data balancing

21,965

21,965

24,863

24,863