Skip to main content

Table 3 The performance of optimum feature subsets on four training sets using five-fold cross-validation

From: Prediction of bioluminescent proteins by using sequence-derived features and lineage-specific scheme

Lineage

Number

Sensitivity

Specificity

Accuracy

MCC

AUC

General

199

0.732 ± 0.010

0.949 ± 0.022

0.841 ± 0.006

0.698 ± 0.018

0.883 ± 0.007

Bacteria

174

0.832 ± 0.012

0.943 ± 0.016

0.888 ± 0.006

0.780 ± 0.013

0.920 ± 0.010

Eukaryota

204

0.667 ± 0.053

0.833 ± 0.053

0.750 ± 0.026

0.510 ± 0.054

0.806 ± 0.015

Archaea

129

0.825 ± 0.061

0.900 ± 0.094

0.863 ± 0.047

0.733 ± 0.095

0.917 ± 0.019

  1. The results are reported by maximizing the MCC value of prediction on the corresponding dataset