Skip to main content

Table 1 Data transformations

From: Identification of novel DNA repair proteins via primary sequence, secondary structure, and homology

Method Feature Vector Representation Data Used
P <φ s (αk, Q) > Primary Sequence
PS < ϕ s ( α k 1 , Q p ) , ϕ s ( { C , E , H } k 2 , Q s ) > MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeyipaWJaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqaHXoqydaahaaWcbeqaaiabdUgaRnaaBaaameaacqaIXaqmaeqaaaaakiabcYcaSiabdgfarnaaBaaaleaacqWGWbaCaeqaaOGaeiykaKIaeiilaWIaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqGG7bWEcqWGdbWqcqGGSaalcqWGfbqrcqGGSaalcqWGibascqGG9bqFdaahaaWcbeqaaiabdUgaRnaaBaaameaacqaIYaGmaeqaaaaakiabcYcaSiabdgfarnaaBaaaleaacqWGZbWCaeqaaOGaeiykaKIaeyOpa4daaa@4F5B@ Primary Sequence and Secondary Structure
PF < ϕ s ( α k , Q ) , ϕ f C , E ( Q ) > MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeyipaWJaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqaHXoqydaahaaWcbeqaaiabdUgaRbaakiabcYcaSiabdgfarjabcMcaPiabcYcaSiabew9aMnaaDaaaleaacqWGMbGzaeaacqWGdbWqcqGGSaalcqWGfbqraaGccqGGOaakcqWGrbqucqGGPaqkcqGH+aGpaaa@4253@ Primary Sequence and Frequency Priors
PH <φ s (αk, Q), φ H (Q) > Primary Sequence and Homology
PSH < ϕ s ( α k 1 , Q p ) , ϕ s ( { C , E , H } k 2 , Q s ) , ϕ H ( Q ) > MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeyipaWJaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqaHXoqydaahaaWcbeqaaiabdUgaRnaaBaaameaacqaIXaqmaeqaaaaakiabcYcaSiabdgfarnaaBaaaleaacqWGWbaCaeqaaOGaeiykaKIaeiilaWIaeqy1dy2aaSbaaSqaaiabdohaZbqabaGccqGGOaakcqGG7bWEcqWGdbWqcqGGSaalcqWGfbqrcqGGSaalcqWGibascqGG9bqFdaahaaWcbeqaaiabdUgaRnaaBaaameaacqaIYaGmaeqaaaaakiabcYcaSiabdgfarnaaBaaaleaacqWGZbWCaeqaaOGaeiykaKIaeiilaWIaeqy1dy2aaSbaaSqaaiabdIeaibqabaGccqGGOaakcqWGrbqucqGGPaqkcqGH+aGpaaa@562F@ Primary Structure, Secondary Structure, Homology
BLAST BLAST Homology
  1. A summary of the six data processing methods used to identify and classify DNA repair proteins.