Skip to main content

Table 3 The results of different models on 10-fold cross-validation test and independent test

From: Rigorous assessment and integration of the sequence and structure based features to predict hot spots

Features

Testing

P

R

F1

AUC

ASA

10-fold

0.58

0.65

0.61

0.62

 

Test set

0.60

0.61

0.61

0.57

BC

10-fold

0.66

0.33

0.43

0.67

 

Test set

0.71

0.40

0.51

0.58

Phy

10-fold

0.63

0.51

0.55

0.67

 

Test set

0.59

0.53

0.56

0.58

ECS

10-fold

0.58

0.27

0.34

0.51

 

Test set

0.74

0.18

0.28

0.68

SE

10-fold

0.57

0.53

0.54

0.60

 

Test set

0.59

0.61

0.60

0.52

PSSM

10-fold

0.65

0.54

0.58

0.65

 

Test set

0.64

0.48

0.55

0.66

Phy+PSSM+ECS+SE

10-fold

0.65

0.65

0.65

0.68

 

Test set

0.69

0.68

0.68

0.68

Phy+ASA+BC

10-fold

0.65

0.60

0.61

0.70

 

Test set

0.62

0.70

0.66

0.62

Phy+ASA+BC+PSSM+ECS+SE

10-fold

0.66

0.68

0.66

0.72

 

Test set

0.65

0.63

0.64

0.66

  1. ASA denotes accessible surface area; BC denotes biochemical contacts; Phy means physicochemical features; ECS denotes evolutionary conservation score; SE means sequence entropy; and PSSM is the abbreviation of Position-Specific Scoring Matrix.