Skip to main content
Fig. 5 | BMC Bioinformatics

Fig. 5

From: Self-analysis of repeat proteins reveals evolutionarily conserved patterns

Fig. 5

The CLANS plot of the clustering of repeat proteins discovered in UniRef90. Dot plots for every protein chain in UniRef90 (downloaded Sept 17, 2018, N = 78915455 chains) were calculated and those proteins with significant signal were collected (nPROT = 13297656) and all possible pairwise Jaccard comparisons were made. These were then clustered using MCL and the medioid point was calculated for every cluster with 5 or more members (nCLUST = 10205) and the inter-medoid distances were used to generate the CLANS figure. Clusters are colored according to the frequency of low complexity regions (LCR) with more intense red indicating the presence of a higher fraction of chains with one or more LCR. Notably, these LCR tend to cluster in the same region of the CLANS plot. This is a 2D representation of a 3D CLANS plot

Back to article page