Figure 6From: EnzML: multi-label prediction of enzyme classes using InterPro signaturesCross-evaluation on the UniRef reference sequences. Left panel: the reference sequences are derived from SwissProt⋈KEGG using UniRef100, UniRef90 or UniRef50 clusters. Right panel: number of protein instances, InterPro attributes and EC classes when the SwissProt⋈KEGG dataset is reduced to its UniRef representative sequences. The values in both panels are shown as difference to the corresponding value for the entire SwissProt⋈KEGG dataset. The full data is available in Additional file 7: all_cross_evaluation_results.csvBack to article page