Fig. 5From: Self-analysis of repeat proteins reveals evolutionarily conserved patternsThe CLANS plot of the clustering of repeat proteins discovered in UniRef90. Dot plots for every protein chain in UniRef90 (downloaded Sept 17, 2018, N = 78915455 chains) were calculated and those proteins with significant signal were collected (nPROT = 13297656) and all possible pairwise Jaccard comparisons were made. These were then clustered using MCL and the medioid point was calculated for every cluster with 5 or more members (nCLUST = 10205) and the inter-medoid distances were used to generate the CLANS figure. Clusters are colored according to the frequency of low complexity regions (LCR) with more intense red indicating the presence of a higher fraction of chains with one or more LCR. Notably, these LCR tend to cluster in the same region of the CLANS plot. This is a 2D representation of a 3D CLANS plotBack to article page