Skip to main content

Table 4 The list of the 83 features used in the study.

From: A novel scoring function for discriminating hyperthermophilic and mesophilic proteins with application to predicting relative thermostability of protein mutants

Protein feature

Number of Features

Source

Sequence length (L)

1

In-house script

Count and composition of amino acids

40

In-house script

Number and percentage of positive, negative and all charged residues, as well as the net charges

8

In-house script

Number and percentage of small (T and D), tiny (G, A, S and P), aromatic (F, H, Y, W), aliphatic, hydrophobic and polar residues

12

In-house script

Number and percentage of residues which can form hydrogen bond in sidechain

2

In-house script

Number of sulfide atoms

1

In-house script

Average solubility of amino acids in aqueous solutions under room temperature

1

**

The average of the maximum solvent accessible surface area (ASA) of each amino acid

1

Eisenhaber[50]

Predicted isoelectric point (pI) of the protein, the average pI on all residues (pIa)

2

ProtParam[51]

Instability index and instability class

2

 

Aliphatic index

1

 

Gravy hydropathy index

1

 

Composition of the predicted secondary structure residues

3

Psipred[52]

Predicted percentages of buried/exposed residues

2

Accpro[40]

The overall length and percentage of all coils, rem465, and hotloop

6

disEMBL[53]

  1. **Obtained from The Merck Index, Merck & Co., Inc., Whitehouse Station, NJ 12 (1996).