Skip to main content

Table 1 Data transformations

From: Identification of novel DNA repair proteins via primary sequence, secondary structure, and homology

Method

Feature Vector Representation

Data Used

P

<φ s (αk, Q) >

Primary Sequence

PS

< ϕ s ( α k 1 , Q p ) , ϕ s ( { C , E , H } k 2 , Q s ) > MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeyipaWJaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqaHXoqydaahaaWcbeqaaiabdUgaRnaaBaaameaacqaIXaqmaeqaaaaakiabcYcaSiabdgfarnaaBaaaleaacqWGWbaCaeqaaOGaeiykaKIaeiilaWIaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqGG7bWEcqWGdbWqcqGGSaalcqWGfbqrcqGGSaalcqWGibascqGG9bqFdaahaaWcbeqaaiabdUgaRnaaBaaameaacqaIYaGmaeqaaaaakiabcYcaSiabdgfarnaaBaaaleaacqWGZbWCaeqaaOGaeiykaKIaeyOpa4daaa@4F5B@

Primary Sequence and Secondary Structure

PF

< ϕ s ( α k , Q ) , ϕ f C , E ( Q ) > MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeyipaWJaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqaHXoqydaahaaWcbeqaaiabdUgaRbaakiabcYcaSiabdgfarjabcMcaPiabcYcaSiabew9aMnaaDaaaleaacqWGMbGzaeaacqWGdbWqcqGGSaalcqWGfbqraaGccqGGOaakcqWGrbqucqGGPaqkcqGH+aGpaaa@4253@

Primary Sequence and Frequency Priors

PH

<φ s (αk, Q), φ H (Q) >

Primary Sequence and Homology

PSH

< ϕ s ( α k 1 , Q p ) , ϕ s ( { C , E , H } k 2 , Q s ) , ϕ H ( Q ) > MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeyipaWJaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqaHXoqydaahaaWcbeqaaiabdUgaRnaaBaaameaacqaIXaqmaeqaaaaakiabcYcaSiabdgfarnaaBaaaleaacqWGWbaCaeqaaOGaeiykaKIaeiilaWIaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqGG7bWEcqWGdbWqcqGGSaalcqWGfbqrcqGGSaalcqWGibascqGG9bqFdaahaaWcbeqaaiabdUgaRnaaBaaameaacqaIYaGmaeqaaaaakiabcYcaSiabdgfarnaaBaaaleaacqWGZbWCaeqaaOGaeiykaKIaeiilaWIaeqy1dy2aaSbaaSqaaiabdIeaibqabaGccqGGOaakcqWGrbqucqGGPaqkcqGH+aGpaaa@562F@

Primary Structure, Secondary Structure, Homology

BLAST

BLAST

Homology

  1. A summary of the six data processing methods used to identify and classify DNA repair proteins.