Skip to main content

Table 9 The HI rules learnt to identify IMLA are shown in English translation. The secondary structure elements along the sequence are ordered into ten equal groups (deciles). The 1st decile are the 10% of elements near the N-teminal and the 10th decile at the C-terminal.

From: Homology Induction: the use of machine learning to improve sequence similarity searches

PDB 1MLA Malonyl-Co-enzyme A Acyl Carrier Protein Transacylase

HI all

 

A protein is homologous if

a1

it has the word 'synthase' in its description line and

 

it is in the 10th decile of predicted secondary structures a coil of length level 4.

HI seq

 

A protein is homologous if

s1

it has in the 10th decile of predicted α-helices a helix of length level 3 and

 

it has in the 10th decile of predicted β-strands a strand of length level 1.