 Research
 Open Access
 Published:
Exploring the 2D and 3D structural properties of topologically associating domains
BMC Bioinformatics volume 20, Article number: 592 (2019)
Abstract
Background
Topologically associating domains (TADs) are genomic regions with varying lengths. The interactions within TADs are more frequent than those between different TADs. TADs or subTADs are considered the structural and functional units of the mammalian genomes. Although TADs are important for understanding how genomes function, we have limited knowledge about their 3D structural properties.
Results
In this study, we designed and benchmarked three metrics for capturing the threedimensional and twodimensional structural signatures of TADs, which can help better understand TADs’ structural properties and the relationships between structural properties and genetic and epigenetic features. The first metric for capturing 3D structural properties is radius of gyration, which in this study is used to measure the spatial compactness of TADs. The mass value of each DNA bead in a 3D structure is novelly defined as one or more genetic or epigenetic feature(s). The second metric is folding degree. The last metric is exponent parameter, which is used to capture the 2D structural properties based on TADs’ HiC contact matrices. In general, we observed significant correlations between the three metrics and the genetic and epigenetic features. We made the same observations when using H3K4me3, transcription start sites, and RNA polymerase II to represent the mass value in the modified radiusofgyration metric. Moreover, we have found that the TADs in the clusters of depleted chromatin states apparently correspond to smaller exponent parameters and larger radius of gyrations. In addition, a new objective function of multidimensional scaling for modelling chromatin or TADs 3D structures was designed and benchmarked, which can handle the DNA beadpairs with zero HiC contact values.
Conclusions
The web server for reconstructing chromatin 3D structures using multiple different objective functions and the related source code are publicly available at http://dna.cs.miami.edu/3DChrom/.
Background
Revealing the threedimensional (3D) structures of chromosomes has been an important and challenging research topic. Fluorescence in situ hybridization (FISH) can measure the presence of specific DNA sequences, but provides limited spatial distance information; therefore, we cannot massively model chromatin 3D structures using FISH data. In comparison, the chromosome conformation capture (3C) technique [1] can capture physical interactions between two genome loci; the chromosome conformation captureonchip (4C) technique [2, 3] can capture the interactions between one genomic site and any genomic sites; and the chromosome conformation capture carboncopy (5C) technique [4] can detect chromatin interactions among a set of genomic loci. Particularly, the recent populationcell HiC [5] technique, which can capture chromatin interactions across the entire genome, provides us an opportunity to more accurately explore the relationships between the spatial organizations of genomes and genomic functions [6,7,8,9]. A populationcell HiC contact profile usually is generated from millions of cells.
Various computational methods using HiC data as input to infer chromatin 3D structures have been developed. Duan et al. [10] firstly introduced metric multidimensional scaling (MDS) for reconstructing chromatin 3D structures. After that, Varoquaux et al. [11] and Ay et al. [9] designed new objective functions of metric MDS to model the 3D structures of chromatins based on experimental and simulated HiC data. Hu et al. [8] developed a Bayesian approach (BACH) based on an assumption that HiC contact counts follow a Poisson distribution. After removing systematic biases in the HiC data, they used Markov chain Monte Carlo simulations to infer chromosomal 3D structures. Zhang et al. developed ChromSDE [12] that infers 3D structures of chromosomes by semidefinite programming and uses golden section search to find the best parameter for converting HiC contacts into spatial target distances. While the abovementioned studies are based on populationcell HiC data, it is important to notice the emergence of singlecell HiC technique revealing celltocell variabilities and related method that can reconstruct chromosomal 3D structures based on singlecell HiC data [13, 14].
Topologically associating domains (TADs) have been identified as selfinteracting genome regions detected from the normalized HiC contact matrices [7]. The HiC contacts within TADs are apparently larger and more enriched than those between two successive TADs. Some factors are enriched at the boundary regions of TADs [7] including RNA polymerase II, insulator binding protein CTCF, and H3K4me3. It is also found that TADs are conserved across different cells or cell lines and even among different species, indicating that TADs may be the structural units of genomes [7].
Hu et al. [8] defined a structural metric named HD ratio to quantify TAD structures’ elongation properties. They found that the elongated TADs usually had a large value of HD ratio. They also observed that HD ratios were positively correlated with selected genetic and epigenetic features [8]. However, the limitation of HD ratio is that it can only measure the elongation properties but not the compactness or folding degree in the 3D space, both of which are significant structural properties.
We benchmarked three measures to gauge TADs’ structural properties, which are based on radius of gyration [15], folding degree [16], and exponent parameter [5]. We have observed that the structural properties of TADs are highly correlated with multiple genetic and epigenetic features. We observed that small values of radius of gyration and folding degree usually correspond to a strong enrichment of CTCF, which plays an important role in organizing the 3D structures of chromosomes [17], whereas the exponent parameter usually has a significant positive correlation with the intensities of CTCF. The 3D structures of TAD used in this study were generated by multidimensional scaling based on populationcell HiC data. We benchmarked and compared four objective functions for reconstructing chromosomal 3D structures including an objective function newly designed to model the cases with zero HiC contact value.
Methods
Resource and processing of HiC data, TAD definitions, and chromatin states information
We downloaded the HiC contact matrices at 40 kb resolution from http://chromosome.sdsc.edu/mouse/hic/download.html, which includes raw and normalized HiC matrices of each chromosome for mouse embryonic stem cells (mESC). The domaincaller software [7] was used to detect TADs on normalized HiC contact matrices at 40 kb resolution, resulting in 2040 TADs. The 3D structures of these 2040 TADs were reconstructed and used to explore the relationships between 3D DNA structures and genetic and epigenetic features.
We downloaded the lowresolution raw HiC reads from the GEO database with ID GSE35156 [7]. The low resolutions we used in our analysis are 400 kb, 500 kb, and 1 Mb. BWA [18] with default parameters was used to separately map pairedend HiC reads back to the mouse reference genome (build mm9). We removed multiple experimental artifacts defined in [19]: first, the PCR duplicates, resulting in multiple pairedend reads mapped to the same genomic location were removed using Picard (http://broadinstitute.github.io/picard/); second, we discarded the reads with mapping quality less than or equal to 10; and third, pairedend reads were discarded if the two ends were mapped to the same fragment (i.e., a genomic region defined by two successive restriction sites) or the sum of the distances between the mapping site of each end of a pairedend read and its corresponding nearest restriction cut site was larger than a certain threshold (500 bp suggested in [6]). We used the ICE normalization method [20] to normalize the raw HiC contact matrices at resolutions of 400 kb, 500 kb, and 1 Mb.
We downloaded the 14state annotations for mESC from [21]. These chromatin states were generated using ChromHMM [22]. For each TAD, we computed its fold enrichment of each state using the OverlapEnrichment function in ChromHMM [22]. For each pair of TADs, we calculated the absolute value of the Pearson’s correlation coefficient between their fold enrichments, which was used as their similarity scores. We used Spectral Clustering [23, 24] with predefined number of clusters equal to 20 to classify TADs based on their chromatinstate similarity scores.
Data for genetic and epigenetic features
We defined the gene density of a TAD as the number of transcription start sites (TSS) located within the TAD. The number of peaks existing in a TAD was used as the enrichment for both H3K4me3 (GEO, GSM723017) and RNA polymerase II (GEO, GSM723019). The enrichment for the other four features including CTCF (GEO, GSM723015), H3K27me3 (GEO, GSM1000089), H3K9me3 (GEO, GSM1000147), and H3K36me3 (GEO, GSM1000109), were defined as the average (after log2 scale) of all data values existing in a TAD. All enriched values for a TAD were normalized by the TAD's length.
Reconstructing the 3D structures of chromosomes and TADs using metric MDS
The HiC contact data were formulated as a contact matrix as in [5]. The DNA of the whole chromosome or TAD was evenly split into beads/bins with length equal to the resolution value. The entry c_{ij} in a HiC contact matrix C denotes the contact count between the i^{th} and j^{th} beads. All entries in the matrix C can be simply classified into two mutually exclusive sets: C_{1} = {(i, j) c_{ij} ≠ 0} and C_{2} = {(i, j) c_{ij} = 0}, that is, the first set C_{1} contains the bead pairs with the number of HiC contact larger than zero and the second set C_{2} contains the bead pairs having zero HiC contact.
To model the 3D structures of TADs and individual chromosomes by metric MDS, the HiC contact values between any two DNA beads are converted into a target spatial distance. The conversion was conducted using the following equation:
where α was set to 1/3 as in [11]; and β, the scale parameter [11], was set to 1.
We implemented and benchmarked the following three objective functions:
The d_{ij} and δ_{ij} are the real distance and target/objective distance between beads i and j, respectively. The first objective function [25] is from [11], whose numerator term is normalized by δ_{ij}^{2}.
The second one is from [12], in which the second term is a regularization term for maximizing the pairwise distances between two beads in the set C_{2}. The parameter γ in the second term was set to 0.01 [12].
We designed the third objective function based on the first two functions by introducing the second term, which was used to make the distances between any two beads in C_{2} as close to value R as possible. We tested two different ways of defining R: (1) as the sum of the edge weights along the shortest path found by the FloydWarshall algorithm [26] if we think of every DNA bead as a node in a graph and that every two DNA beads have an edge if they have >0 HiC contacts; the weight of each edge is modeled as the target distance; and (2) as the maximum target distance between any two bead pairs in C_{1}.
Radius of gyration
The reconstructed 3D structure of a TAD is represented using a nby3 matrix where n is the number of beads in a TAD and 3 indicates the 3D coordinates.
The centre of mass R_{C} of a TAD 3D structure is calculated by:
where m_{i} is the mass of the i^{th} bead and x_{i} is the 3D coordinates of the bead.
The radius of gyration R_{g} of a TAD is calculated as:
where M is all beads’ mass values normalized by the TAD length.
We first assumed the mass values of all beads in one TAD were the same. The unit value 1 was used to represent the mass of every bead, which makes the radius of gyration to only consider the spatial arrangement of nucleotide. This type of mass representation is labelled as “mass_unit” in this paper. We also considered gene density that is, the number of transcription start sites (TSS) as the mass representations. Figure 1 (a) and (b) show two TADs with the same length but with different 3D structures, which have different distributions of TSS. The 3D structure shown in Fig. 1(a) is dispersed but with four successive beads enriched with TSS concentrated in a small region of the TAD, whereas the structure shown in Fig. 1(b) is compacted but with three beads having one or two TSS spreading over the TAD. The illustration in Fig. 1(a) and (b) indicates that the compactness of all DNA beads can be different compared with the compactness of the DNA beads with a certain genetic feature. Therefore, the second mass representation “mass_TSS” was modeled as the number of TSS located within a bead plus the unit mass.
We defined another two mass representations “mass_or” and “mass_and”. “mass_or” defines the mass of a bead as 2 if we find one of the three features (H3K4me3, RNA polII, and TSS) in the bead, and 1 if we find none of them. “mass_and” defines the mass of a bead as the sum of 1 (indicating nucleotide), the total number of TSS, and the number of peaks for H3K4me3 and RNA polII.
Folding degree and exponent parameter
The Estrada index [16, 27] has been used as an effective metric for measuring the folding degree of protein tertiary structures. In this study, we used it to model the folding degree of the 3D structures of TADs. Every three consecutive DNA beads in a chain of beads (e.g., a TAD) can form a plane; and every two consecutive planes have a specific dihedral angle. The Estrada index is able to capture the folding degree of the entire chain while considering these dihedral angels. It generates the first, second, and third line graphs of a given graph G (Fig. 3(c)) and calculates the eigenvalues of a symmetric matrix from the third line graph to obtain the final Estrada index.
We also applied and benchmarked a twodimensional (based on HiC contact matrix) structural metric named exponent parameter [5].
Normalized HiC data between TADs with different lengths
Since the lengths of TADs are different, the HiC interaction frequencies between the Xistcontaining TAD (the TAD including the Xist gene) and each of the other TADs were calculated as the observed/expected numbers of HiC contact counts (in this way, the HiC contacts were normalized by the different lengths of TADs): as previously did in [5, 28], the expected contacts between two different TADs i and j were calculated by E_{ij} = R_{i} × R_{j} × N_{inter}, where R_{i} and R_{j} are the fractions of interTADs reads associated with i and j respectively, and N_{inter} is the total number of interTADs reads.
Results
Structural properties are correlated with genetic and epigenetic features
We calculated the Pearson’s correlation between each of the three measures, including radius of gyration, folding degree, and exponent parameter, and each of the seven features, including gene density, H3K36me3, H3K9me3, H3K27me3, H3K4me3, RNA polII, and CTCF. The results are shown in Table 1.
We observed that the radius of gyration is negatively correlated with the seven features; folding degree is also negatively correlation with the seven features, but not as significant as radius of gyration. However, exponent parameter of TADs’ 2D contact matrices has strong positive correlations with these features. All of the correlation values shown in Table 1 are statistically significant (all Pvalues < 10^{− 5}). We may conclude from the significant negative correlations between radius of gyration and CTCF that the compactness level of TADs in the 3D space are related to the enrichment level of CTCF. The Pearson’s correlations between the three measures and the seven features on each chromosome are shown in Fig. 2.
Relationships between structural properties and chromatin states
The 14 chromatin states we have considered in this study include: (1) transcription elongation, (2) weakly transcribed, (3) weak/poised enhancer 1, (4) active promoter 1, (5) strong enhancer 1, (6) active promoter 2, (7) strong enhancer 2, (8) weak/poised enhancer 2, (9) poised promoter, (10) repressed, (11) heterochromatin 1, (12) heterochromatin 2, (13) heterochromatin 3, and (14) insulator. We clustered all 2040 TADs based on their chromatinstate enrichment similarities (details see Methods). The clustering procedure generated 20 clusters of TADs. Based on the fold enrichment of each chromatin state for each of the 20 clusters (Fig. 3 (a)), we found that: (1) three clusters (i.e., clusters 2, 6, and 18) are depleted for almost all of the 14 chromatin states; (2) compared with the TADs in the other clusters, the TADs in the same three clusters have apparently smaller exponent parameters and larger radius of gyrations (Fig. 3 (b)).
Inference and evaluations of the reconstructed 3D structures
Evaluations of the objective functions
Three different objective functions were used to reconstruct the 3D structures of 20 chromosomes of mES individually at two different resolutions (1 Mb and 500 kb).
We plotted the distribution of the inferred Euclidean distances between any two beads with missing HiC contacts on the Xchromosome at 1 Mb and 500 kb resolutions in Fig. 4. Compared to the other objective functions, Eq. (4) with R equal to the maximum distance generates larger distances. Therefore, our Eq. 4 with R equal to the maximum distance works best if we assume that the two beads with missing HiC contacts should be spatially far away to each other. Our results shown in Fig. 4 indicates that Eq. (4) with R equal to the shortest path generates smaller Euclidean distances; and most of the Euclidean distances are smaller than one.
We used the Kabsch algorithm [29] to superimpose the reconstructed 3D structures from different objective functions; their minimum rootmeansquare deviations (RMSDs) were calculated to quantify the level of similarity, which are shown in Fig. 5 at two different resolutions (1 Mb and 500 kb). Compared to 1 Mb resolution, the values of RMSD are relatively larger at 500 kb resolution. An illustration of the superimposed 3D structures of chromosome 8 are shown in Fig. 5(c).
We show more evaluation results in Fig. 6 at two different resolutions (a, b, c, d for 1 Mb and e, f, g, h for 500 kb) from different perspectives: (a) and (e) show the Pearson’s correlations between target and inferred Euclidean distances among any two beads with nonzero HiC contacts; (b) and (f) show the plots of rootmeansquare errors (RMSEs) between target and inferred Euclidean distances among any two beads with nonzero HiC contacts; (c) and (g) show the Spearman’s correlations between inferred Euclidean distances and nonzero HiC contact counts, whereas (d) and (h) show the same correlations as in (c) and (g) but with all HiC contacts. These results indicate that the inferred distances with the three objective functions are consistent with HiC contacts; and Eq. (4) with R equal to the maximum distance generates slightly better results than R equal to the shortest path. Therefore, we used the 3D structures inferred by Eq. (4) with R equal to the maximum target distance in the other sections.
Chromatin 3D structure reconstruction and biochemistry validations with Xist lncRNA localization
We used metric MDS to reconstruct the 3D structures of TADs and the Xchromosome at two different resolutions. A twodimensional heat map of HiC contact matrix and the corresponding reconstructed 3D structures of four TADs on chromosome 6 in mESC are shown in Fig. 7 (a, b). The 3D structures of the four TADs we inferred using Eq. (4), illustrated by four different colors, do not overlap with each other but have their own territories in the 3D space, which fits the clear boundarydefined patterns in the 2D HiC contact map in Fig. 7 (a).
We also modelled the 3D structures of the Xchromosome at 1 Mb (Fig. 8 (a)) and 400 kb (Fig. 8 (b)) resolutions and further used Xist localization intensities to evaluate the inferred 3D structures. The reconstructed 3D structure of the Xchromosome at 400 kb resolution mapped with the Xist localization intensities after six hours of induction are shown in Fig. 8(b). It has been found in [30] that Xist transcripts more intensively localize in the DNA sites in spatial proximity to the Xist locus based on a high correlation (Pearson’s correlation r = 0.69) between Xist localization and HiC contact counts. If the 3D structure we inferred makes sense, a similar observation should be found.
We investigated this from the perspective of TADs (instead of DNA segments with the same length as did in [30]). We calculated the normalized HiC contacts between the Xistcontaining TAD and all other TADs (details see Methods). The Pearson’s correlation between the localization intensities of Xist transcripts (after one hour of the induction of Xist transcription [30]) and the normalized TADTAD HiC contacts is 0.79, observed from 76 TADs in the Xchromosome and based on our inferred 3D structure of the Xchromosome (Fig. 9 (a)). This finding fits the observation mentioned in [30], indicating the correctness of our inferred 3D structure of the Xchromosome.
Furthermore, we calculated the correlation between the localization intensities of Xist transcripts and the beadbead (the length of every bead is the same as the resolution value) spatial distances calculated based on our inferred 3D structure of the Xchromosome. First, we reconstructed the 3D structure of the Xchromosome at 1 Mb resolution (Fig. 8 (a)) and then mapped the Xist localization intensities onto the 3D structure. The Euclidean distances from the Xistcontaining bead to each of the other beads were calculated, which were then correlated with the Xist transcripts localization intensities (RAP values after one hour of the induction of Xist transcription). The Pearson’s correlation is r = − 0.67, see Fig. 9 (b), which is consistent with what the previous work has found [30].
Conclusions
In this study, we reconstructed the 3D structures of individual TADs and chromosomes of mESC at different resolutions using MDS methods and defined a new objective function to handle the cases with missing HiC contact.
We used three measures including exponent parameter, radius of gyration, and Estrada index to gauge the compactness and folding degree of the 3D structures of TADs and observed that the three measures are significantly correlated with seven genetic and epigenetic features including H3K36me3, H3K4me3, H3K27me3, H3K9me3, RNA PolII, gene density, and CTCF. Moreover, our novel radiusofgyrationbased measure can consider the enrichments of genetic and epigenetic features and achieve more significant negative correlation when the enrichment levels of gene density, H3K4me3, and RNA polII are in consideration. By analysing the chromatin states of TADs and conducting clustering of TADs based on chromatin states, we found that the TADs in the clusters of depleted chromatin states correspond to smaller exponent parameters and larger radius of gyration values.
In conclusion, TADs do have structural properties that significantly correlate with genetic and epigenetic features. The radiusofgyrationbased structural measure we designed can capture the structural properties of TADs and have a significant correlation with multiple genetic and epigenetic features.
Availability of data and materials
The web server of 3DChrom and source code are available at http://dna.cs.miami.edu/3DChrom/.
Abbreviations
 2D:

Two dimensional
 3D:

Three dimensional
 MDS:

Multidimensional scaling
 TAD:

Topologically associating domain
References
 1.
Dekker J, Rippe K, Dekker M, Kleckner N. Capturing chromosome conformation. Science. 2002;295(5558):1306–11.
 2.
Simonis M, Klous P, Splinter E, Moshkin Y, Willemsen R, de Wit E, van Steensel B, de Laat W. Nuclear organization of active and inactive chromatin domains uncovered by chromosome conformation capture–onchip (4C). Nat Genet. 2006;38(11):1348–54.
 3.
Zhao Z, Tavoosidana G, Sjölinder M, Göndör A, Mariano P, Wang S, Kanduri C, Lezcano M, Sandhu KS, Singh U. Circular chromosome conformation capture (4C) uncovers extensive networks of epigenetically regulated intraand interchromosomal interactions. Nat Genet. 2006;38(11):1341–7.
 4.
Dostie J, Richmond TA, Arnaout RA, Selzer RR, Lee WL, Honan TA, Rubio ED, Krumm A, Lamb J, Nusbaum C. Chromosome conformation capture carbon copy (5C): a massively parallel solution for mapping interactions between genomic elements. Genome Res. 2006;16(10):1299–309.
 5.
LiebermanAiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO, et al. Comprehensive mapping of longrange interactions reveals folding principles of the human genome. Science. 2009;326(5950):289–93.
 6.
Yaffe E, Tanay A. Probabilistic modeling of hiC contact maps eliminates systematic biases to characterize global chromosomal architecture. Nat Genet. 2011;43(11):1059–65.
 7.
Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, Hu M, Liu JS, Ren B. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485(7398):376–80.
 8.
Hu M, Deng K, Qin Z, Dixon J, Selvaraj S, Fang J, Ren B, Liu JS. Bayesian inference of spatial organizations of chromosomes. PLoS Comput Biol. 2013;9(1):e1002893.
 9.
Ay F, Bunnik EM, Varoquaux N, Bol SM, Prudhomme J, Vert JP, Noble WS, Le Roch KG. Threedimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression. Genome Res. 2014;24(6):974–88.
 10.
Duan Z, Andronescu M, Schutz K, McIlwain S, Kim YJ, Lee C, Shendure J, Fields S, Blau CA, Noble WS. A threedimensional model of the yeast genome. Nature. 2010;465(7296):363–7.
 11.
Varoquaux N, Ay F, Noble WS, Vert JP. A statistical approach for inferring the 3D structure of the genome. Bioinformatics. 2014;30(12):i26–33.
 12.
Zhang Z, Li G, Toh KC, Sung WK: 3D chromosome modeling with semidefinite programming and HiC data. J Comput Biol. 2013;20(11):831–46.
 13.
Zhu H, Wang Z. SCL: a latticebased approach to infer threedimensional chromosome structures from singlecell hiC data. Bioinformatics. 2019. https://doi.org/10.1093/bioinformatics/btz181.
 14.
Nagano T, Lubling Y, Stevens TJ, Schoenfelder S, Yaffe E, Dean W, Laue ED, Tanay A, Fraser P. Singlecell hiC reveals celltocell variability in chromosome structure. Nature. 2013;502(7469):59–64.
 15.
Lobanov MY, Bogatyreva N, Galzitskaya O. Radius of gyration as an indicator of protein structure compactness. Mol Biol. 2008;42(4):623–8.
 16.
Estrada E. Characterization of the folding degree of proteins. Bioinformatics. 2002;18(5):697–704.
 17.
Phillips JE, Corces VG. CTCF: master weaver of the genome. Cell. 2009;137(7):1194–211.
 18.
Li H, Durbin R. Fast and accurate short read alignment with burrows–wheeler transform. Bioinformatics. 2009;25(14):1754–60.
 19.
Hu M, Deng K, Qin Z, Liu JS. Understanding spatial organizations of chromosomes via statistical analysis of hiC data. Quant Biol. 2013;1(2):156–74.
 20.
Imakaev M, Fudenberg G, McCord RP, Naumova N, Goloborodko A, Lajoie BR, Dekker J, Mirny LA. Iterative correction of hiC data reveals hallmarks of chromosome organization. Nat Methods. 2012;9(10):999–1003.
 21.
Bogu GK, Vizán P, Stanton LW, Beato M, Di Croce L, MartiRenom MA. Chromatin and RNA maps reveal regulatory long noncoding RNAs in mouse. Mol Cell Biol. 2015. https://doi.org/10.1128/MCB.0095515.
 22.
Ernst J, Kellis M. ChromHMM: automating chromatinstate discovery and characterization. Nat Methods. 2012;9(3):215.
 23.
Shi J, Malik J. Normalized cuts and image segmentation. IEEE T Pattern Anal. 2000;22(8):888–905.
 24.
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V. Scikitlearn: machine learning in python. J Mach Learn Res. 2011;12(Oct):2825–30.
 25.
Kruskal JB, Wish M: Multidimensional Scaling, Sage University Paper series on Quantitative Application in the Social Sciences, 07011. Beverly Hills and London: Sage Publications; 1978.
 26.
Floyd RW. Algorithm 97: shortest path. Commun ACM. 1962;5(6):345.
 27.
Estrada E. Characterization of 3D molecular structure. Chem Phys Lett. 2000;319(5):713–8.
 28.
Wang Z, Cao R, Taylor K, Briley A, Caldwell C, Cheng J. The properties of genome conformation and spatial gene interaction and regulation networks of normal and malignant human cell types. PLoS One. 2013;8(3):e58793.
 29.
Kabsch W. A discussion of the solution for the best rotation to relate two sets of vectors. Acta Crystallogr Sect A: Cryst Phys, Diffr, Theor Gen Crystallogr. 1978;34(5):827–8.
 30.
Engreitz JM, PandyaJones A, McDonel P, Shishkin A, Sirokman K, Surka C, Kadri S, Xing J, Goren A, Lander ES. The Xist lncRNA exploits threedimensional genome architecture to spread across the X chromosome. Science. 2013;341(6147):1237973.
Acknowledgements
Not applicable.
About this supplement
This article has been published as part of BMC Bioinformatics Volume 20 Supplement 16, 2019: Selected articles from the IEEE BIBM International Conference on Bioinformatics & Biomedicine (BIBM) 2018: bioinformatics and systems biology. The full contents of the supplement are available online at https://bmcbioinformatics.biomedcentral.com/articles/supplements/volume20supplement16.
Funding
This research was supported by the National Institutes of Health grant R15GM120650 to ZW and startup funding from the University of Miami to ZW. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. Publication costs were funded by the National Institutes of Health grant R15GM120650 to ZW.
Author information
Affiliations
Contributions
TL implemented the algorithms, generated the 3D structures, and analyzed the results. ZW advised the research. TL and ZW wrote the manuscript. Both authors have read and agreed with the content of the manuscript. Both authors read and approved the final manuscript.
Corresponding author
Correspondence to Zheng Wang.
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Liu, T., Wang, Z. Exploring the 2D and 3D structural properties of topologically associating domains. BMC Bioinformatics 20, 592 (2019) doi:10.1186/s128590193083z
Published
DOI
Keywords
 HiC
 Chromosomal threedimensional structure
 Topologically associating domains
 3D structural properties of TADs