Skip to main content

Table 2 Feature index used in this study.

From: Learning to predict expression efficacy of vectors in recombinant protein production

Feature Type

Description

#(Feature)

Nucleotide

≤3-mer

84

Nucleotide

nt Seq Length

1

Nucleotide

GC Content

1

Code Preference

Codon Adaptation Index

1

Amino Acid

Wilkinson and Harrison (1991)

6

Amino Acid

Idicula-Thomas et al. (2006)

444

Amino Acid

isoelectric point

1

Amino Acid

peptide statistics

8

PTMs

Plewczynski et al. (2005)

71

 

Total

617

  1. 617 features were extracted from an entire recombinant fusion protein. They were divided into two groups with respect to nucleotide or protein levels. The first 87 features were generated from nucleic acid sequences of entire recombinant fusion genes. The rest 530 features were retrieved from protein sequences.