Skip to main content

Table 1 Distribution of data in SKEMPI 2.0

From: Improve hot region prediction by analyzing different machine learning algorithms

Amino acid Non-hot spots Hot spots All residues Ratio of hot spots property of side chain
SER 143 21 164 0.128 Hydroxyl-containing
CYS 6 1 7 0.143 Sulfur-containing
GLN 107 29 136 0.213 Amid
THR 114 31 144 0.214 Hydroxyl-containing
PRO 42 17 59 0.288 Cyclic
ASN 101 41 142 0.289 Amid
GLY 48 20 68 0.294 Aliphatic
VAL 70 30 100 0.3 Aliphatic
GLU 154 68 222 0.306 Acid
HIS 60 28 88 0.318 Basic aromatic
MET 23 11 34 0.326 Sulfur-containing
LYS 131 64 195 0.328 Basic
ARG 146 80 226 0.354 Basic
ASP 101 41 142 0.409 Acid
LEU 48 41 89 0.419 Aliphatic
ILE 72 52 124 0.461 Aliphatic
PHE 51 55 106 0.519 Aromatic
TYR 75 104 179 0.581 Aromatic
TRP 21 50 71 0.704 Aromatic
All 1513 813 2623 0.35 none