Coordination characterization of zinc metalloproteins
© Yao et al; licensee BioMed Central Ltd. 2014
Published: 29 September 2014
Zinc metalloproteins play a key role in all three domains of life. Prior studies have explored the use of zinc coordination characteristics for the prediction of functions in metalloproteins. However, prediction methods have been hampered by the inaccuracy in characterizing the zinc coordination environment. In order to minimize the impact of this factor, we improved our method to better characterize zinc coordination.
Materials and methods
We developed a systematic and fully automatic method to characterize the zinc coordination environment. Zinc metalloproteins were acquired from the worldwide Protein Data Bank (wwPDB). The overall zinc coordination environment was characterized in terms of coordination geometries, coordinating ligands, bidentation status, as well as side chain dihedral angles. In addition to the three major coordinations, tetrahedral, trigonal bipyramidal, and octahedral, we also examined four minor coordinations, trigonal planar, trigonal pyramidal, square planar, and square pyramidal. We calculated the number of binding ligands for each zinc site using criteria derived from an analysis of ligand-zinc bond lengths. Zinc sites with four, five, and six ligands were considered independently. K-means was used to differentiate the geometries based on angle statistics. Random forest was then applied to explore the interactions within different geometries.
We can automatically cluster the coordination of zinc metalloproteins into major and minor geometries. The consideration of four additional coordination geometries enables the detection of potential specific coordination errors like missing ligands within wwPDB entries. Bidentation usually introduces an abnormal small angle, which makes the coordination unsuitable for any of the theoretical geometries and thus should be examined separately. Random forest, which excels at integrating both categorical and non-categorical information, has revealed some interesting intra- and inter-cluster connections in our dataset. The better classification and characterization of zinc coordination should help us gain insight into specific functional tendencies of zinc metalloproteins.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.