Skip to main content

Table 2 Categorical FASTA files of transcripts after data processing

From: IIMLP: integrated information-entropy-based method for LncRNA prediction

Transcripts types GRCh37 ncRNAs GRCh37 PCTs GRCh38 ncRNAs GRCh38 PCTs
After removing short 24,513 94,830 28,628 94,527
After deduplication 21,965 41,134 24,863 41,200
After data balancing 21,965 21,965 24,863 24,863