Skip to main content
Figure 3 | BMC Bioinformatics

Figure 3

From: Tandem machine learning for the identification of genes regulated by transcription factors

Figure 3

Overview of tandem machine learning. For each gene, the PWM representing binding sites for PXR/RXRα was used to scan the 10 kb region upstream of the transcription start site to generate a list of the location and strength of individual binding sites. This list was used to generate summary features, e.g., the total number of sites, total information content. It was also used as input for IDBC to generate clusters. A second set of summary features was extracted from the clustering obtained, e.g., total number of clusters, total information content within clusters. The combined list of features for each promoter region constituted a single data item for input to one of several machine-learning algorithm.

Back to article page