Skip to main content

Table 2 Summary of explained and unexplained sites in the four experimental datasets of protein variation, having run PsychoProt with only physicochemical descriptors

From: Detection and sequence/structure mapping of biophysical constraints to protein variation in saturated mutational libraries and protein sequence alignments with a dedicated server

Protein HA RNP 1934 RNP 1968 TEM-1
Source of data Thyagarajan et al. eLife [7] Doud et al. Mol Biol Evol [19] Doud et al. Mol Biol Evol [19] From Deng et al. [4]
Sites explained by some descriptor 29.8 % 39.8 % 39.6 % 29.7 %
Unexplained of very low tolerance (k* < 4) 14.5 % 15.9 % 7.4 % 38 %
Unexplained of very high tolerance (k* > 16) 23.9 % 5.8 % 19.3 % 0.4 %
Unexplained of not extreme tolerance to substitutions 31.7 % 38.8 % 33.9 % 31.9 %