Skip to main content

Table 1 The lists of TADs identified by the seven different algorithms in Fig. 4

From: ClusterTAD: an unsupervised machine learning approach to detecting topologically associated domains of chromosomes from Hi-C data

Algorithm

K = 4

K = 5

K = 6

a

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (9,14), (15,20), (21,25), and (27,30)}.

b

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (9,14), (15,20), (21,25), and (27,30)}.

c

{(1,8), (9,14), (15,20), and (21,30)}.

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (15,20), (21,25), and (26,30)}.

d

{(1,8), (9,14), (15,20), and (21,30)}.

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (15,20), (21,25), and (26,30)}.

e

{{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (15,20), (21,25), and (26,30)}.

f

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (15,20), (21,25), and (26,30)}.

g

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (9,14), (15,20), (21,25), and (26,30)}.

{(1,8), (9,14), (15,20), (21,25), and (27,30)}.

  1. The table contains the lists of TADs extracted for K = 4, K = 5 and K = 6 (from left, middle to right) by the seven algorithms: (a) HC-eulcidean, (b) KM-eulidean, (c) HC-pearson, (d) KM-pearson, (e) HC-cityblock, (f) KM-cityblock, and (g) EM. HC denotes the hierarchical clustering algorithm, KM the K-means algorithm, and EM the expectation maximization algorithm. HC-euclidean denotes the combination of the hierarchical clustering algorithm and the Euclidean distance metric. A TAD is represented as {start, end}, where “start” is the TAD start region, and “end” is the TAD end region. The best TAD set for the synthetic data is {(1, 8), (9, 14), (15, 20), (21, 25), and (26, 30)}