Insights into mechanism kinematics for protein motion simulation

Background The high demanding computational requirements necessary to carry out protein motion simulations make it difficult to obtain information related to protein motion. On the one hand, molecular dynamics simulation requires huge computational resources to achieve satisfactory motion simulations. On the other hand, less accurate procedures such as interpolation methods, do not generate realistic morphs from the kinematic point of view. Analyzing a protein’s movement is very similar to serial robots; thus, it is possible to treat the protein chain as a serial mechanism composed of rotational degrees of freedom. Recently, based on this hypothesis, new methodologies have arisen, based on mechanism and robot kinematics, to simulate protein motion. Probabilistic roadmap method, which discretizes the protein configurational space against a scoring function, or the kinetostatic compliance method that minimizes the torques that appear in bonds, aim to simulate protein motion with a reduced computational cost. Results In this paper a new viewpoint for protein motion simulation, based on mechanism kinematics is presented. The paper describes a set of methodologies, combining different techniques such as structure normalization normalization processes, simulation algorithms and secondary structure detection procedures. The combination of all these procedures allows to obtain kinematic morphs of proteins achieving a very good computational cost-error rate, while maintaining the biological meaning of the obtained structures and the kinematic viability of the obtained motion. Conclusions The procedure presented in this paper, implements different modules to perform the simulation of the conformational change suffered by a protein when exerting its function. The combination of a main simulation procedure assisted by a secondary structure process, and a side chain orientation strategy, allows to obtain a fast and reliable simulations of protein motion.


Background
To address functional requirements or interact with other biological molecules, proteins undergo structural changes of variable degree, varying between distinct overall conformations, of which only some are usually determined experimentally (i.e the activated and inactivated forms of an enzyme). This is caused by the difficulties in the obtention of X-ray quality diffracting crystals, and consequently, it limits the knowledge on the dynamic behavior of the biomolecular machinery in important biological http://www.biomedcentral.com/1471-2105/ 15/184 Recently, and thanks to the information available related to protein science, new approaches have been proposed in the literature [5,6] to simulate protein motion. These approaches are not based on either quantum mechanisms, or biology related roots, but deal with mechanism and robot kinematics principles [5]. One of the main advantages of these new methodologies is their small computational cost. One of the first applied methods is Probabilistic Roadmap Method (PRM) [6][7][8][9][10][11]. This method consists in discretizing the protein configurational space. Then, each position is evaluated against a scoring function (force field, empirical, etc) and it is considered either correct or incorrect. Once every position has been checked it is possible to trace a path connecting neighboring positions to obtain the protein motion. The PRM is used on a wide variety of protein motion studies. In [12,13] it is proposed to use this approach to the simulation of ligand-protein interaction. In those works, it is proposed to consider the degrees of freedom of the ligand, as well as some degrees of freedom of the protein (mostly side chains related degrees of freedom). In [6], a similar approach, considering only the ligand degrees of freedom, is proposed. In [14], it is studied how restrictions to the possible motions of the protein backbone affect the search algorithms used for PRM. Although this approach yields quite good results, the need of computing all possible configurations of the protein structure makes this procedure computationally costly, especially for big proteins.
One important approach used for protein simulation, is related to the Normal Mode Analysis (NMA) implemented in mechanisms and robotics. This analysis provides information related to vibrational modes of mechanisms, useful for the dynamic analysis of their structure [15]. This approach, computationally much less expensive than PRM, may be applied to protein structures [16,17]. Using this methodology, information related to possible movements of a protein structure around its current configuration is obtained. Thus, although computationally less expensive, the information provided is not complete. New procedures combine both NMA and PRM to obtain large conformational changes in proteins [18]. In this approach NMA results are used to guide the PRM algorithms and reduce their computational cost. Kinetostatic compliance method [19,20] makes use of several kinematic theories to simulate protein motion. Firstly, it takes advantage of zero notation [21] to simplify protein structure definition during the procedure. Secondly, It also implements ball and rods modelization, considering both bond lengths and angles as constant. To execute the simulation process the protein potential energy field is transformed into equivalent forces and torques applied to the protein chain. Basing on the applied torques, it calculates the dihedral angle increments for the next step of the simulation.
In this paper, we present a new methodology based on our previous works [22,23] for protein motion simulation. The objective of the procedure is to morph a protein from one known configuration to another known one, providing reliable and quick information in relation to protein kinematics and movement with a very low computational cost, (low enough to be used on a normal PC). The simulation procedure presented in this paper is composed of four independent strategies. The first one consists in a normalization procedure aimed to homogenize equivalent bond distances and angles in the protein structure [22]. The second one is a main simulation procedure entrusted to advance in the simulation obtaining valid structures [23]. A third strategy is a procedure intended to reduce protein's potential energy by changing side chains orientation. And the final one detects secondary structures among the protein chain. The novelty of the proposed procedure consists, on the one hand, in the implementation of the side chain orientation strategy and the secondary structure detection method, and on the other hand, in the simultaneous combination of the aforementioned four strategies. Consequently, the approach provides a computationally efficient simulation tool for protein motion simulation. To validate the results, three indicators are measured through the simulation process: (i) backbone atom root mean square deviation to compare obtained structures global similarity, (ii) Ramachandran plots, to ensure proteins biological nature and (iii) protein's potential energy to verify that no steric clashes have occurred during the simulation [24].

Preparing protein structure for kinematics modeling
Ball and rods models provide model structures valid to apply mechanism theorems for protein simulation [21]. Most protein structure models whose target is protein simulation incorporate some simplifications. Cα meshes are used in [25][26][27] to produce a reduced model with an acceptable computational cost. Rigid bond and angle approaches are used in the same way in [28,29]. Normal mode analysis (NMA), usually mixes rigid bonds with springs to produce the structure needed for modal analysis [17,30]. Side chains are also simplified in various ways to reduce the computational cost associated to them. In [8] it is proposed to treat side chains as spheres filling an equivalent space. In [28], the authors propose to adjust the size of side chains to reduce its influence on the simulation, and later on, resize them progressively.
In this paper we propose an all atom model, based on ball and rods approach, in which some simplifications are considered to reduce the overall computational cost. In particular, Protein degrees of freedom are reduced to rotations around the dihedral angles. Every other possible atom movement resulting from bond stretching or non-http://www.biomedcentral.com/1471-2105/15/184 proper torsions is despised. Besides the relative position among peptide plane atoms is maintained constant during the simulation process. The peptide bond angle ω value is also limited to 0°or 180°. Regarding side chains, according to the proposed all atom model, every atom should be taken into account. In relation to side chains' degrees of freedom, to reduce the computational cost of the process, only the rotation around the Cα − Cβ bond is considered.
As previously stated, the procedure simulates protein motion between two known structures. Most structural data for protein simulation come from experimentation. Thus, these data need to be compared with the proposed model to verify that all the hypotheses are fulfilled. One of the major drawbacks of trying to use kinematic theories to protein simulation is the difference between proteins and mechanisms structures. In mechanism kinematics the linkages do not change their structural form or characteristics during the movement, unless the objective of the mechanism lies precisely in that requisite [31]. However we must bear in mind that proteins are composed by atoms that atoms that are bonded by electromagnetic and covalent forces, thus, in the case of proteins, there is no need to maintain constant neither bond length nor bond angles during the conformational change.
Therefore a normalization procedure is applied to experimental data with the purpose of homogenizing equivalent bond distances and angles in the protein structure. The approach is based on two different normalization processes, peptide plane normalization and bond length normalization.
Peptide planes are normalized to get exactly 0°or 180°p eptide bond angles. This complies with the proposed hypothesis of rigid peptide planes among the simulation [23]. The objective of normalizing the peptide planes is to assign to ω i the angles 0°or 180°, as it is proposed in the kinematic model. To do this, Cα i and Cα i+1 atoms' coordinates are fixed. Then, using the least square method, the mean plane is calculated using the local coordinates of and Cα i+1 atoms. Fixing Cα i and Cα i+1 atoms' coordinates allows to maintain protein backbone continuity during the normalization process, as the normalization of one peptide plane does not alter the one that has been already normalized.
Local reference systems to determine relative locations of amino acids atoms have also been used in [32,33]. Here, Cα i is selected as the origin of the local coordinate system. The straight line connecting Cα i and Cα i+1 defines u i axis direction. w i axis direction is defined by the perpendicular direction to the plane containing Cα i and C i atoms and u i axis. Finally, v i completes the right-handed reference system. The advantage of using this local reference system, is that the middle plane will always contain u i axis, so the least square problem is reduced to the calculus of the slope of this plane. This is made by the following formula: where v j , w j are the local coordinates of C i , O i , N i+1 atoms and a is the slope of the plane to be calculated. To find the value of a, Eq. (1) is differentiated with respect to a obtaining the result in Eq. (2): Once the slope of the plane has been defined, atoms C i , O i and N i+1 are projected onto it (Figure 1).
In a second stage, distance constraints are applied to ensure that every bond length is identical between two same type of atoms. To apply these distance constraints, we need to define standard bond lengths. For this matter, reference bond lengths from the AMBER force field are used [34]. Distance constraints are sequentially applied from the fist atom of the protein chain to the last one. To apply the distance constraint, the formulation proposed in [35,36] is used. Thus, using j as a reference atom, the distance constraint is applied to the bond between j and i atoms according to the following expressions: where r 1 j and r 0 i represent the position vectors of the atoms j and i respectively before the application of the corresponding distance constraint, whereas r 1 i is the vector determining the new position of atom i. d ji defines the theoretical mean value of the bond length. Once the distance constraint has been applied to i atom, every subsequent atom of the protein chain is translated by r 1 i − r 0 i . As Eq. (3) defines a sphere of radius d ji it is necessary another constraint to univocally determine the position of i atom. This new condition determines that the new position fo i atom must lie on the straight line defined by the actual positions of i and j atoms. In Figure 2, the application of the distance constraint to C i (j atom) and N i+1 (i atom) is represented.
To test the viability of the normalization process, Type-C PPase protein from S. gordonii is used, which is supposed to suffer a conformational change from a "closed" (PDB 1k20) to an "open" (PDB 1k23) structure upon binding of pyrophosphate (PPi). SgPPase exists as an homodimer in solution, in which each subunit is composed of two distinct domains. The N-terminal domain consists of a six-stranded parallel b-sheet flanked by extended loops on one side and a helical subunit on the other one. In the absence of ligands, the C-terminal domains occlude the corresponding active sites. Binding of PPi or substrate analoges to the active site of the N-terminal domain presumably triggers a conformational change that liberates the inhibition imposed by the Cterminal region [37]. Three strategies have been tried, applying distance, planes and sequential (planes and next distance) normalization processes. The results for the normalization of the initial and final reference conformations of the movement are shown in the Table 1 where they are compared against the protein before the normalization. Table 1 shows that the normalization processes have not altered the global structure of the proteins, the higher root mean square deviation being only 0.37 Å. Regarding potential energy variation, length normalization process reduces its value in both 1k20 and 1k23 proteins. However, the proteins' potential energy value is increased by 3% on 1k20 protein and by 1% on 1k23 protein on the planes normalization process. This effect is the result of projecting the atoms onto the calculated middle plane, producing the bonds distances change. Finally, sequential normalization process reduces this effect by first applying peptide plane normalization and next length normalization. As stated before, every distance constraint displaces the subsequent atoms of the protein chain, thus not altering already normalized peptide planes. In every normalization process, Ramachandran plot values indicate that the normalized protein structures have always maintained their biological meaning. After analysing these results, from these three normalization processes the sequential one has been chosen. This normalization process fits the experimental data with the proposed model of the protein structure. Additionally, the existence of normalized peptide planes allows to exclude the peptide bond angle ω as a degree of freedom.

Simulation procedure for dynamic dihedral angle increments adjustment
The main target of the methodology described in this paper is to simulate the protein motion between two known positions. To that aim, we have simulated the conformational changes suffered by three different proteins, (i) Type-C Inorganic Pyrophosphatases; (ii) Troponin C; and (iii) Calmodulin, ranging from the displacement of an entire domain to the reorientation of few secondary structure elements. The proposed model of the protein structure permits considering only the dihedral angles as proteins degrees of freedom to produce the motion. This configuration resembles the protein with a very long serial robot. Thus, the motion of the protein can be defined as a sequence of incremental steps applied on these dihedral angle values, from the initial to the final conformations. Data structures of the proteins under study are taken from the Protein Data Bank (PDB), which are used as input data for the procedure. The simulations were carried out using a software developed by our research group called GIMPRO [38]. Several options are implemented in the software as it is stated subsequently. The software is able to read and save protein data with .pdb file format. It also produces text files with the evolution of both studied protein's potential energy and rmsd evolution. Finally the software creates a .pdb file containing the obtained simulation. This file is compatible with other visualization programs like PYMOL in order to produce higher quality renderizations of the protein movement.
The process uses dihedral angles increment data, φ i and ψ i , to complete the simulation. This data is obtained by calculating dihedral angle values of the final conformation φ To assess the quality of the obtained structures, root mean square deviation (rmsd), Ramachandran plots [39] and potential energy will be evaluated during the simulation process. Regarding the root mean square deviation, backbone atoms are considered for its calculus. To evaluate proteins' potential energy, AMBER potential force field, with the parameters proposed by Cornell [34] has been chosen. The use of the three indicators ensures the global similarity between the structures (rmsd), the nonexistence of steric clashes (potential energy) and the biological sense of the obtained structures (Ramachandran plots). Intermediate data structures for rmsd comparison are obtained from the Morph server [40].
In Figure 3 the flux diagram of the simulation procedure is shown. The first step is to calculate protein's initial conformation potential energy E 0 . To obtain the next protein's conformation, the procedure starts rotating the dihedral angles of the protein in sequential order, from the first amino acid to the last one. One important characteristic of the process is that it stores the increment produced by each rotation on the protein's potential energy as E k i , where k is the actual step number. Once the process has finished the actual step's proteins' potential energy is obtained (E k ).
To avoid steric clashes, the process checks if E k has exceeded an admissible threshold. This energy threshold is defined by k and the protein's potential energy value E 0 . In every iteration, E k value must be below E 0 + E 0 · k . To calculate k the following formula is used: where determines the maximum change percentage in the protein's potential energy. As defined before, p is the total number of steps of the simulation. If E k has exceeded the imposed threshold for the current step, the procedure detects the dihedral angle that has generated the higher energy increment. Once detected, the applied rotation is rolled back. The procedure then applies a new rotation with a reduced increment value ( ψ i /2, φ i /2). Again, protein's potential energy is calculated and checked against the admissible threshold. If the threshold is exceded, the new reduced rotation is undone and the dihedral angle is blocked for the current step. This process is repeated for each dihedral angle of the protein chain. During the simulation, the number of times each degree of freedom is blocked is saved. When this number reaches an m value (user defined), the procedure considers that the dihedral angle cannot rotate in its assigned direction, changing the rotation direction of this dihedral angle ( ψ i = − ψ i , φ i = − φ i ) for the next n steps (user defined). Each dihedral angle is independently studied by the procedure, each one with its own record of m and n parameters.
As the procedure has the capacity to change both dihedral angle increment values and rotation direction, once per step, if a degree of freedom has been blocked, p is incremented by 1. This is done with the purpose of better adjusting the energy tolerance of each step which is calculated as a function of p (see Eq. (6)).

Procedure for side chain orientation
In the strategy on charge of orienting the side chains, the main objective is to achieve a low computational cost. Therefore, in this first stage, only the rotational degree of freedom of Cα i − Cβ i bonds will be considered (see Figure 4).
Another target is to reduce the potential energy value of the protein. This objective may lead to an energy minimization algorithm that could produce an unacceptable increment on the computational cost. To avoid this problem, as it will be explained subsequently the proposed procedure is based on the guidelines shown in Figure 5.
The procedure starts calculating the optimal rotation direction for each side chain. To do so, each side chain is rotated in both possible directions calculating the produced energy increments E i and E i in the process. The selected rotation direction is the one that has produced the higher potential energy decrement, or if both rotation directions have increased the potential energy value, the direction that has produced the lesser energy increment is selected. Accordingly the rotation direction for each side chain is independently defined.
Once the rotation directions are defined, the procedure proceeds to rotate 2°each side chain in its optimal direction. The value has been defined to ensure that no displacement on the edges of long side chains produces steric clashes. After this rotation, it is checked that if the obtained protein's potential energy E k is lower than the main simulation process produced structure's one (E k ). On the contrary, the process undoes the rotations and new rotation of 1°. Again, obtained protein's potential energy E k is compared with E k . If again, E k is higher than E k , the process undoes the applied rotations and rotates only those side chains that have produced a decrement on the protein's potential energy. After the last rotation the values of E k and E k are checked again, and in the case that E k is higher than E k last applied rotations are undone, leaving the structure in the initial stage before starting the side chains orientation procedure.
It could be considered unnecessary to validate the last check between E k and E k since only those side chains that have produced a decrement on protein's potential energy have been rotated. However, the latter verification must be done because each side chain rotation direction has been independently calculated, without considering the influence of neighboring side chains rotations. A decrement on the potential energy due to the rotation of a single side chain does not ensure that when every other side chain is rotated that decrement is maintained.

Procedure for secondary structure detection by dihedral angle parameters evaluation
The procedure presented in this paper to detect secondary structures uses the dihedral angle values, which are obtained in any simulation program. A previous step to carry out this strategy is to perform a classification of each amino acid attending to their dihedral angle values as follows: • Candidate: an amino acid is considered as a candidate when its dihedral angles are inside a zone of the Ramachandran plot belonging to a secondary structure. • Stable: an amino acid is considered stable when the procedure has checked that it belongs to a secondary structure.

Figure 7
Simulation process diagram including side chain orientation module and secondary structure detection module.
• Unstable: an amino acid is considered unstable when it cannot be classified as candidate or stable.
To define an amino acid as a candidate, a tolerance has been incorporated with respect to the theoretical values of the dihedral angles. This tolerance has been adjusted to obtain the best possible results, obtaining a final value of 30°for each type of secondary structure.
Once the process has classified each amino acid as candidate, stable or unstable, it is possible to start searching for secondary structures. In an initial step, the first candidate amino acid of the protein chain is found and selected. After that, it is checked if the two subsequent amino acids are candidates of the same type of secondary structure. In this case, the three amino acids are classified as stable amino acids. The process continues checking if the next amino acids belongs to the same type of secondary structures, until an unstable amino acid or an amino acid of another type of secondary structure is found. This process is repeated until every amino acid of the protein has been checked. In Figure 6 a diagram of the secondary structure detection process is shown.
The use of a high tolerance resulted in unsatisfactory results, specifically on coil parts of the protein structure. As stated, one of the conditions for an amino acid to be set as a stable amino acid is to be part of a chain of three consecutive amino acids of the same type of secondary structure.This condition reduces the detection of secondary structures on coils of the protein structure, allowing to maintain the tolerance value.
To test the results seven proteins have been analyzed. The proteins under study have allowed us to validate the procedure with different protein sizes and different secondary structure distributions. The obtained results have been compared with each proteins' structural data, available on the Protein Data Bank (PDB). The results are shown on Table 2.  It can be conclude that the strategy requires a very low computational cost, needing only 8ms to complete the secondary structure detection on the biggest protein, 3sza. Thus, its implementation on the simulation procedure does not increase the overall computational cost.
Once both procedures have been explained, it is essential to assess how to implement them on the simulation process. The aim of developing both processes, side chain orientation and secondary structure detection procedures, is to keep them independent from the main simulation process. This way, if any of them fails to achieve its objective, i.e., detecting no secondary structure or not reducing the protein potential energy value, the main simulation process can continue. For this reason, the proposed sequence of the strategies is shown in Figure 7.
As it can be seen in Figure 7. the secondary structure detection procedure is executed prior to the main simulation process. This allows to extract the degrees of freedom already located on secondary structures, therefore, reducing the number of degrees of freedom from the first step. On the other hand, side chains orientation procedure is carried out after the main simulation process. In the simulation process it is required to obtain a valid structure prior to reducing its energy making use of the side chains orientation procedure. If the latter procedure successes on reducing the protein's potential energy value, then the main simulation process has a higher amount of available energy for the next step.

Results
Each of the simulation strategies will be applied to three different proteins. The first protein, type C inorganic Pyrophosphatase (family II) from Streptococcus gordonii protein, pdb entry 1k20 (see Figure 8). With the proposed modelization, the protein has 4732 atoms with 604 degrees of freedom. The next protein is the Troponin C protein, pdb entry 1zac (see Figure 9). Again, applying the  proposed modelization the protein has 1347 atoms and 176 degrees of freedom. Finally, the last protein is the Calmodulin protein, pdb entry 3cln (see Figure 10) with 2201 atoms and 284 degrees of freedom. To test the aforementioned strategies the following type of combinations are proposed: • Type 1: General simulation procedure and the side chain orientation strategy will be applied. • Type 2: The three simulation strategies are applied.
• Type 3: The same as Type 2 but with one exception: the degrees of freedom of side chains located on secondary structures will be blocked.
To select the parameters, the results obtained in previous simulations made with the main simulation process [22,23] have been considered. In particular, the simulation parameters have been set as follows: p = 100, ε = 10%, m = 3, n = 1. Simulations are run in a PC under Windows XP, with a pentium core 2 duo 2.13 GHz making use of a single core.
In Table 3 the results for 1k20 protein are shown. As it can be observed we obtain a minimal rmsd value of 5.97 Å with the Type 2 simulation strategy. The Ramachandran plot values indicate that the biological meaning of the protein has been ensured in all simulation strategies. Energy values also indicate the non-existence of steric clashes on the protein structure. Regarding the computational cost of the process, it needs 43 minutes on both Type 1 and Type 2 simulation strategies and 31 minutes on the Type 3 simulation strategy to obtain each intermediate structure of the movement. It can be concluded that the computationally most expensive procedure is the side chain orientation one. This fact is corroborated with the results of Type 3 simulation. In this simulation strategy, the degrees of freedom of side chains located on secondary structures are extracted from the main simulation. In Table 3 it can be seen that the computational time is reduced by 38% for each intermediate structure. In Figure 11 several snapshots of the simulation are shown.
Results for 1zac protein are shown in Table 4. In this protein minimum rmsd error is obtained on Type 3 simulation strategy with 2.52 Å. Again Ramachandran plot values confirm the biological meaning of intermediate structures. Energy values also indicate that steric clashes have been avoided during the simulation procedure. Lastly, regarding the computational cost of the simulation for this protein, the procedure has needed about 2 minutes to obtain each intermediate structure of the motion, completing the simulations in about 80 minutes.
Finally, the results for 3cln protein are shown in Table 5. The simulation procedure obtains a rmsd error of 6.34 Å on Type 1 simulation strategy. Again the Ramachandran plot value ensures the biological meaning of the obtained intermediate structures and energy values indicates that no steric clashes have been produced. As it can be seen, neither Type 2 nor Type 3 simulation strategies are able to achieve a valid simulation. In this particular protein morph, secondary structures degrees of freedom (which have been extracted from the main simulation) are needed to form the central α-helix. This fact causes the failure of the aforementioned simulation strategies.
The superpositions between obtained final structures (in red) and data structures (in green) are presented in Figure 12.

Discussion
The molecular mechanisms underlying the activity of many proteins involve conformational transitions by hinge-bending, which involves the movement of relatively rigid parts of preserved geometry about flexible joints. The detection of both, the rigid domains and the ginger regions, has been largely studied during the last decades [41,42]. The graphical representation of these dynamic events is not only essential to understand how  the processes take place, but it helps to unravel mechanistic aspects that are difficult to visualize by less intuitive indirect approaches. One of the main limitations on the simulation of these conformational transitions is the huge computational cost of actual simulation strategies. Actual approaches struggle to obtain morphs that fulfill both biologic and kinematic requisites. The procedure presented in this paper satisfies both requirements with a good cost-error relation in the studied proteins.
All the procedures presented in the paper make use of the same protein model. In this model bonds are considered as rigid links and the degrees of freedom have been limited to rotations around dihedral angles and each side chain's first bond rotation. These hypothesis produces a computationally very efficient model of the protein structure with the sufficient mobility to simulate conformational changes.
Current work is focused on the simulation of conformational changes on dimers which posses similar hingebending mechanisms. The low computational cost of the presented procedures may be enough to deal with these types of molecules.

Conclusions
A recurring question in the analysis of molecular mechanisms underlying the regulation of a protein deals with the possible structural routes that let it evolve between two different conformations. The identification of these pathways sets the structural basis for the rational design of molecules that act as modulators (activators or inhibitors) of the proteins activity, becoming an essential tool for the development of new drugs in the pharmaceutical industry.
Few alternatives arise at the time of simulating protein motions. Low computational cost methodologies offer fast results but usually with kinematically senseless trajectories with impossible backbone movements between consecutive positions of the simulation. These methods are based on restrained interpolations of atoms coordinates [40], and rely on intermediate energy minimization processes to solve the steric clashes produced during the simulation. On the other hand, simulation procedures based on molecular dynamics need huge computational resources to complete successful simulations in an acceptable spam of time. Software packages like GROMACS [43,44], NAMD [45], AMBER [46] etc. do not only require shared computing architectures, but due to their complexity its use is limited.
The procedure proposed in this paper offers a fast and reliable method to obtain the motion of the protein.
The procedure runs on a single processor and is fond for further improvement by implementing simple distributed computing algorithms. This procedure maintains the kinematic continuity of the movement and ensures the biological sense of the obtained structures.
The presented procedure has been implemented in a new bioinformatic package with the aim of facilitating the comprehension of the processes by which biological machines perform their function. The simulation http://www.biomedcentral.com/1471-2105/15/184 strategies described herein help the user to understand the behavior of these mechanisms. The described procedures require, at least partially, the availability of the initial and final conformations adopted by the biological machine under analysis. In this regard, the validation indicators implemented in the proposed simulation processes, help to overcome the lack of knowledge in protein structures by providing a modeling tool to reconstruct the fold of a target protein from homologous molecules in other organisms. Moreover, it may also help in deciphering the molecular mechanisms underlying metabolic processes, signaling pathways or transport events, as well as in mapping specific "conformational routes" that characterize the dynamic behavior of a promiscuous protein motif (i.e cystathionine beta synthase (CBS) domains), that undergoes different structural changes upon binding distinct types of ligands (see [47][48][49]). It should not be neglected the capacity of our software to improve structural search models in molecular replacement methods during the elucidation of novel crystal structures by X-ray diffraction techniques.