Skip to main content

Table 1 Percentage of input space covered by training instances for various alphabet sizes (CN feature)

From: Automated Alphabet Reduction for Protein Datasets

# letters Ratio
2 100%
3 97.8%
4 57.6%
5 11.3%
20 3.1e-7