Introduction
The functional properties of a protein are strongly dependent on its structural conformation. The primary question addressed in this work is how to determine structural and therefore functional similarity from 3D protein structures. The approach we take relies on protein structure alignment, which elucidates functional protein relationships that are not depicted by the sequence.
Most current structural alignment tools are based on geometric properties of protein structures, and do not incorporate the effect of physical properties responsible for protein shape and similarity. The effect of physical features, such as hydrogen bonding, bond and torsion angles, hydropathy, and solvent accessibility, are not well understood and hence not used while developing structure alignment tools. In this work, we exploit physical properties in protein structural alignment algorithms. In particular, we incorporate dominant physical features in one of the exact protein structure alignment tools, CMOS [1], which is based on the contact map overlap maximization (MAX-CMO) formulation of protein structure alignment. CMOS is a state-of-the-art tool, providing fast structural alignments that are in excellent agreement with the SCOP fold family classification. An implementation of CMOS is accessible at http://eudoxus.cheme.cmu.edu/cmos/cmos.html.