BMC Bioinformatics

Table 1 The features selected by the iterative genetic search algorithm with evaluation by Logistic Regression (LR), Baysian Network (BN), Functional Tree (FT), REP Tree (RT), and Alternating Decision Tree (AT)

From: Transient protein-protein interface prediction: datasets, features, algorithms, and the RAD-T predictor

Feature	Count	LR	BN	FT	RT	AT
relSESA	10	□∙	□∙	□∙	□∙	□∙
esolv	6	□	□∙	∙		□∙
Density	6	∙		∙	□∙	□∙
ePot	5		□∙	□	□	∙
Scorecons	5	□∙	∙	□	□
rate4site	5	□	∙	∙	□
Disorder	5	□∙		∙	□	□
B-Factor	4		∙	□∙		□
Roughness	4	□∙	□∙
Hydro	3	∙		□	□
Protrusion	3	□∙			□
Propensity	3	□		□	∙
Curvature	2		□∙

A white box (□) represents the feature was selected when the algorithm was tested on all proteins, using leave-one-out cross-validation, while a feature with a black circle (∙) was selected when tested on the NI1 subset. Count is the number of times a feature was selected by either datasets tested.

Back to article page

ISSN: 1471-2105

Contact us

General enquiries: journalsubmissions@springernature.com