Skip to main content

Table 1 Datasets description: Three omics are provided for each dataset, respectively DNA gene expression, miRNA and Methylation

From: Data integration by fuzzy similarity-based hierarchical clustering

  #Cases DNA miRNA Methy Multi-Omics
Dataset - ORI LN RF ORI LN RF ORI LN RF ORI LN RF
AML 170 20531 2000 1997 5000 2000 1999 705 558 553 26236 4558 4529
BIC 621 20531 2000 2000 5000 2000 2000 1046 891 854 26577 4891 4854
COAD 220 20531 2000 2000 5000 2000 2000 705 613 591 26236 4613 4590
GBM 274 12042 2000 2000 5000 2000 2000 534 534 534 17576 4534 4534
KIRC 183 20531 2000 1999 5000 2000 1999 1046 796 754 26577 4796 4752
LIHC 367 20531 2000 2000 5000 2000 2000 1046 852 826 26577 4852 4366
LUSC 341 20531 2000 2000 5000 2000 2000 1046 878 850 26577 4878 4850
SKCM 448 20531 2000 2000 5000 2000 2000 1046 901 874 26577 4901 4874
OV 287 20531 2000 2000 5000 2000 2000 705 616 600 26236 4616 4600
SARC 257 20531 2000 2000 5000 2000 2000 1046 838 805 26577 4838 4805
  1. The number of features at each variable selection method is shown. ORI: Original variable dimension, LN: Logarithm and normalisation and, RF: Random Forest based on Mean Decrease Gini index