Skip to main content

Table 1 Characteristics of microarray datasets used in this study

From: MiningABs: mining associated biomarkers across multi-connected gene expression datasets

Sample types Dataset serial numbers GEO accession numbers Platform types N/T A/D # of distinct genes in a platform Avg length of sequences Source of samples References
(Avg ± SD)
ESCC 1-1 GSE23400 Affymetrix HG-U133A 53/53 20,133/22,283 12,633 250 ± 22 China [5]
1-2 GSE23400 Affymetrix HG-U133B 51/51 14,110/22,477 9,256 250 ± 22
1-3 GSE20347 Affymetrix HG-U133A_2 17/17 20,133/22,277 12,633 250 ± 22 China [6]
1-4 GSE29001 Affymetrix HG-U133A_2 12/12 20,133/22,277 12,633 250 ± 22 China [7]
HCC 2-1 GSE14520 Affymetrix HG-U133A_2 19/22 20,133/22,277 12,633 250 ± 22 China [8]
2-2 GSE14520 Affymetrix HT_HG-U133A 210/225 20,429/22,277 12,743 440 ± 105
2-3 GSE17856 Agilent 014850 44/43 20,772/25,073 14,312 60 ± 0†† Japan [9]
  1. ESCC: esophageal squamous cell carcinoma; HCC: hepatocellular carcinoma; N: # of normal samples; T: # of tumor samples; A: # of available probes matched with distinguishable gene IDs in a platform; D: # of downloaded probes contained in a platform; Avg: average; SD: standard deviation; : Affymetrix probe set-matched target sequence; ††: Agilent spotted sequence.