Structure-based kernels for the prediction of catalytic residues and their involvement in human inherited disease
BMC Bioinformatics volume 11, Article number: O4 (2010)
Enzyme catalysis is involved in numerous biological processes and the disruption of enzymatic activity has been implicated in human disease. Despite the functional importance, various aspects of catalytic reactions are not completely understood, such as the mechanics of reaction chemistry and the geometry of catalytic residues within active sites. As a result, the computational prediction of catalytic residues has the potential to identify novel catalytic pockets, aid in the design of more efficient enzymes and also predict the molecular basis of disease.
We proposed a new kernel-based algorithm for the prediction of catalytic residues and functional sites in general in protein structures . The method relies upon explicit modelling of similarity between residue-centred neighbourhoods in protein structures. Specifically, we start with a construction of oriented structural neighbourhoods followed by separating the neighbourhood volume into small cells. The similarities between two structural neighbourhoods are accumulation of their similarity in each cell. The kernel function is a product of three kernels, each addressing a separate aspect of protein function: (i) the geometric kernel addresses the shape similarity, (ii) the chemical kernel addresses the similarity in physicochemical properties, and (iii) the evolutionary kernel addresses the evolutionary similarity of conservation patterns for the residues in two structural neighbourhoods. Our approach was favourably evaluated against two of the leading alternative approaches, FEATURE  and GBT , as shown in Table 1. The new algorithm was used to identify known mutations associated with inherited disease whose molecular mechanism might be predicted to operate specifically though the loss or gain of catalytic residues. It should therefore provide a viable approach in identifying the molecular basis of disease in which the loss or gain of function is not caused solely by the disruption of protein stability. Our analysis suggests that both loss and gain of catalytic residues are actively involved in human inherited disease.
Our kernel method for functional sites prediction based on protein structures evaluates favourably against established methods on the same data set using the same evaluation procedure. The results from applying our catalytic residue predictor to disease mutations indicated that both loss and gain of catalytic residues are actively involved in human inherited disease.
Xin F, Myers S, Li Y, Cooper D, Mooney S, Radivojac P: Structure-based kernels for the prediction of catalytic residues and their involvement in human inherited disease. Bioinformatics 2010, 26(16):1975–1982. 10.1093/bioinformatics/btq319
Wu S, Liang MP, Altman RB: The SeqFEATURE library of 3D functional site models: comparison to existing methods and applications to protein function annotation. Genome Biol 2008, 9: R8. 10.1186/gb-2008-9-1-r8
Gutteridge A, Bartlett GJ, Thornton JM: Using a neural network and spatial clustering to predict the location of active sites in enzymes. J Mol Biol 2003, 330(4):719–34. 10.1016/S0022-2836(03)00515-1
About this article
Cite this article
Xin, F., Myers, S., Li, Y.F. et al. Structure-based kernels for the prediction of catalytic residues and their involvement in human inherited disease. BMC Bioinformatics 11 (Suppl 10), O4 (2010). https://doi.org/10.1186/1471-2105-11-S10-O4
- Structural Neighbourhood
- Kernel Method
- Enzyme Catalysis
- Catalytic Residue
- Functional Site