Skip to main content

Table 4 Gene expression microarray datasets used in this study.

From: A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification

Task & dataset Number of classes Number of genes Number of samples Prediction task
Dx-Alizadeh 3 4026 62 Diffuse large B-cell lymphoma, follicular lymphoma, chronic lymphocytic leukemia
Dx-Alon 2 2000 62 Colon tumors and normal tissues
Dx-Armstrong 3 11225 72 AML, ALL and mixed-lineage leukemia (MLL)
Dx-Bhattacharjee 5 12600 203 4 lung cancer types and normal tissues
Dx-Golub 3 5327 72 Acute myelogenous leukemia (AML), acute lymphoblastic leukemia (ALL) B-cell and ALL T-cell
Dx-Khan 4 2308 83 Small, round blue cell tumors of childhood
Dx-Nutt 4 10367 50 4 malignant glioma types
Dx-Pomeroy 5 5920 90 5 human brain tumor types
Dx-Ramaswamy 26 15009 308 14 various human tumor types and 12 normal tissue types
Dx-Ramaswamy2 2 13247 76 Metastatic and primary tumors
Dx-Shipp 2 5469 77 Diffuse large B-cell lymphomas and follicular lymphomas
Dx-Singh 2 10509 102 Prostate tumor and normal tissues
Dx-Staunton 9 5726 60 9 various human tumor types
Dx-Su 11 12533 174 11 various human tumor types
Px-Beer 2 7129 86 Lung adenocarcinoma survival
Px-Bhattacharjee 2 12600 62 Lung adenocarcinoma 4-year survival
Px-Iizuka 2 7070 60 Hepatocellular carcinoma 1-year recurrence-free survival
Px-Pomeroy 2 7129 60 Medulloblastoma survival
Px-Rosenwald 2 7399 240 Non-Hodgkin lymphoma survival
Px-Veer 2 24188 97 Breast cancer 5-year metastasis-free survival
Px-Veer2 3 24188 115 Breast cancer 5-year metastasis-free survival, metastasis within 5 years, germline BRCA1 mutation
Px-Yeoh 2 12240 233 Acute lymphocytic leukemia relapse-free survival
  1. The reference paper for each dataset is provided in the Additional File 3.