Skip to main content

Table 10 The HI rules learnt to identify 1MPP are shown in English translation.

From: Homology Induction: the use of machine learning to improve sequence similarity searches

PDB 1MPP Pepsin (Renin).
HIall  
A protein is homologous if
a1 it has the classification 'eukaryota' and
  it has the PROSITE pattern 'PS00141'.
HIseq  
A protein is homologous if
s1 it has it has a level 10 serine-serine pair content and
  it has it has a level 10 glycine-serine pair content and
  it has in the 8th decile of predicted β-strands a strand of length level 9
or  
s2 it has a molecular weight of level 7 and
  it has in the 9th decile of predicted coils a coil of length level 1
or  
s3 it has it has a level 2 histidine content and
  it has in the 7th decile of predicted secondary structures a β-strand of lengthlevel 5 and
  it has in the 7th decile of predicted secondary structures a coil of lengthlevel 5 and
  it has in the 4th decile of predicted secondary structures a β-strand of length level 6.