Skip to main content

Table 5 Example datasets considered for each simulation scenario

From: G-bic: generating synthetic benchmarks for biclustering

 

Dataset Context

Description

Dimensions

Size

1

Gene Expression

Arabidopsis [60]

Genes \(\times\) Conditions

\(21031 \times 351\)

2

Recommendation Systems

MovieLens-20M [19, 61]

Users \(\times\) Movies

\(138000 \times 27000\)

3

Text Mining

Reuters-21578 [62]

Terms \(\times\) Documents

\(29930\times 21578\)

4

Clinical Data

PMSI2013 [6]

Patients \(\times\) Clinical Data

\(49231\times 7941\)

5

Spatio-Temporal data

fMRI time series [12]

Brain Regions \(\times\) Time

\(30 \times 150\)