MAVENs: Motion analysis and visualization of elastic networks and structural ensembles
© Zimmermann et al; licensee BioMed Central Ltd. 2011
Received: 28 February 2011
Accepted: 28 June 2011
Published: 28 June 2011
The ability to generate, visualize, and analyze motions of biomolecules has made a significant impact upon modern biology. Molecular Dynamics has gained substantial use, but remains computationally demanding and difficult to setup for many biologists. Elastic network models (ENMs) are an alternative and have been shown to generate the dominant equilibrium motions of biomolecules quickly and efficiently. These dominant motions have been shown to be functionally relevant and also to indicate the likely direction of conformational changes. Most structures have a small number of dominant motions. Comparing computed motions to the structure's conformational ensemble derived from a collection of static structures or frames from an MD trajectory is an important way to understand functional motions as well as evaluate the models. Modes of motion computed from ENMs can be visualized to gain functional and mechanistic understanding and to compute useful quantities such as average positional fluctuations, internal distance changes, collectiveness of motions, and directional correlations within the structure.
Our new software, MAVEN, aims to bring ENMs and their analysis to a broader audience by integrating methods for their generation and analysis into a user friendly environment that automates many of the steps. Models can be constructed from raw PDB files or density maps, using all available atomic coordinates or by employing various coarse-graining procedures. Visualization can be performed either with our software or exported to molecular viewers. Mixed resolution models allow one to study atomic effects on the system while retaining much of the computational speed of the coarse-grained ENMs. Analysis options are available to further aid the user in understanding the computed motions and their importance for its function.
MAVEN has been developed to simplify ENM generation, allow for diverse models to be used, and facilitate useful analyses, all on the same platform. This represents an integrated approach that incorporates all four levels of the modeling process - generation, evaluation, analysis, visualization - and also brings to bear multiple ENM types. The intension is to provide a versatile modular suite of programs to a broader audience. MAVEN is available for download at http://maven.sourceforge.net.
One of the first dynamic computations of a protein was published by Levitt and Warshel in 1975 , a folding a coarse-grained peptide chain. The first publication widely recognized as a Molecular Dynamics (MD) simulation came two years later  and presented a simulation of the 56 residue bovine pancreatic trypsin inhibitor. Today's most advanced simulations, such as the fully atomic model simulation of the ribosome , (2.6 million atoms for a period of 106 CPU hours) represent significant improvements in computational technology and advance our understanding of the behavior of molecular systems.
There is growing evidence supporting the collectiveness of the motions of biomolecules [4–8]. Atomic MD takes into account the detailed modeling of the intrinsic randomness of the short time-scale motions. While true to the underlying atomic theories, this level of randomness may be distracting when analyzing motions of biomolecules on the longer time scale. To overcome such randomness, methods have been developed to determine the dominant motions within the trajectory - Principal Component Analysis (PCA) based on average covariances across the trajectory or Essential Dynamics . The results of PCA have been shown to agree well with ANM modes [10, 11]. The rigor of molecular dynamics simulations makes them ideal for investigating specific events, but less applicable for extracting mechanisms, and overly detailed for general purposes.
Normal Mode Analysis (NMA) using Elastic Network Models (ENMs) shifts the focus from simulating the motion of all atoms based on a detailed empirical force field to the harmonic motions of a set of springs and masses representing the starting structure. ENM is also an analytic solution yielding a basis set of orthogonal independent motions, rather than a simulation over time. The differences between various ENM types specify how the masses (computed from atomic coordinates or density contours) and springs (their harmonic interactions) are assigned or modeled; the exact chemistry of the object is not usually considered. Despite their lack of detailed chemistry, ENMs have proven themselves. Yang, et al.  as well as Bakan and Bahar  find that the motions computed using ENMs correspond well to the principal components of MD trajectories as well as to the spatial variance observed when multiple crystallographically determined structures of the same protein are superimposed or taken from an NMR ensemble. Sen et al.  emphasized the cooperativity of protein motions and how atomic detail is not necessary, while Yang, et. al.  shows the ability of elastic models to handle proteins across a broad range of sizes. From Jernigan, et. al.  we learn that functionally important motions of biomolecules are often governed by packing density: the basis for ENMs. Lu and Ma show that shape also plays an important role . Early studies by Gō and Scheraga computationally defined the difference between local and collective motions , while Gō, Noguti, and Nishikawa demonstrated how the low frequency harmonic vibrations in proteins can be computed . From these and other studies, it is becoming increasingly evident that molecular structures exhibit collective motions and that these motions can be sampled well by using ENMs. Methods that include distance dependent springs [18–20] or torsional angles [21–23] show improvements over the uniform springs and are also implemented in our software.
ENMs are capable of computing the important motions of biomolecules on time scales beyond the usual reach of atomic MD, and do so efficiently. Generation for small and medium sized proteins takes only seconds or minutes on a standard desktop computer, while typical MD studies require days or weeks of computer time on high performance systems or clusters. The largest molecular assemblages may require more time, but they can be further coarse-grained without loss of the major motions , once again making the computation of the dominant motions tractable. It is also possible to more efficiently solve numerically for only a subset of the normal modes. The low frequency modes contribute much more to the total motion than do the high frequency modes. Thus, many of the high frequency modes can be ignored without loss of important information. Detailed analyses can be achieved with elastic models by employing mixed-resolution models where most of the structure is coarse-grained but regions of special interest remain at atomic detail. Even with the proven efficiencies GPU computing brings to MD, the simplicity and effectiveness of NMA using ENMs will ensure their continued use. We aim here to make them more accessible.
Elastic Network Model
Many other model types have been developed including methods where bond or dihedral angles are explicitly taken into account [21–23] and are implemented in MAVEN by incorporation of the freely available Spring Tensor Model developed by Lin and Song . We also implement the nearest neighbor method which utilizes a coarse-grained model (usually one point per residue), but assigns contacts between residues based on an atomic model of the system. To facilitate detailed analysis, one may employ mixed resolution modeling [29, 30]. In this scheme most of the structure is coarse-grained, but any region of interest is included in greater detail (see Figure 1D). One may then choose different cutoff values for the two regions and use the geometric mean of the cutoff values as the cutoff for connections between them .
Comparing ENMs to Ensembles of Structures
Perhaps the most important step in ENM modeling is choosing which points will represent the system. Originally, ENMs were proposed for use with all atom coordinates , but when it was learned that nearly the same motions are obtained with coarse-grained structures [12, 24], this became the more common approach. Within MAVEN, one may retain all atoms, choose standard representations like Cα atoms, pick specific atom types, or generate centroid points from residues, side-chains, or bases (for examples see Figure 1). A set of points can be further coarse-grained using spherical coarse-graining. This task is accomplished by selecting an initial point (or set of points) that will be retained. All points within a given radius of the retained point will be removed from the model. The closest point that was not removed is then added to the retained set. This process continues until no more points can be removed. The result is a spatially more uniform distribution of points than one would likely have after selecting points linearly along the sequence. Finally, MAVEN allows the user to employ low resolution density maps (Figure 1E); a data source that has rarely been used with these methods. Since packing density and shape are the properties most critical to ENMs, model points picked along the desired density contour should provide a reasonable coarse-graining. Doruker and Jernigan have shown that similar motions are extracted from proteins and from the protein's molecular volume filled with lattice points , further showing the potential usefulness of density maps in ENM modeling. This method represents an extension to understand the dynamics of very large structures where there are no atomic coordinates.
Because of the usefulness of the ENM method, web and standalone applications have been constructed [23, 34–38]. Web servers have the advantage of near universal accessibility, but often lack flexibility and extensibility. Existing standalone applications tend to only implement one ENM type and often force the user to use one representation, for instance alpha carbons only, thereby preventing the use of nucleotides, sugars, or small molecules. In developing MAVEN we seek to incorporate many ENM methods including support for dihedral angle and mixed resolution modeling, as well as to facilitate model generation and shape based coarse-graining (see Methods).
The first major feature of this platform is the ability to construct many types of ENMs whereas other servers and applications available are restricted to one or two types. These include the standard cutoff based models, distance dependent springs, nearest neighbor, Spring Tensor , and mixed resolution. The nearest neighbor method generates a coarse-grained model, but uses an atomic model for determining connectivity. The Spring Tensor model expands the energy function of ENMs to account for bond and torsion angle changes. Mixed coarse-graining represents a compromise: part of the system remains in high detail with the remainder coarse-grained. With mixed resolution, one is able to analyze molecular effects on motions such as chemical modifications, mutations, drug binding, proline isomerization, or post-translational modifications, while retaining nearly the computational efficiency of coarse-grained NMA. A second feature of this application is the ability to handle large systems through sparse matrix methods and the ability to calculate only a set of the lowest frequency modes. Since the contribution of each mode to the total motion decreases quickly, calculating only the lowest frequency modes captures the majority of dynamics while requiring considerably less computer resources. A further benefit of MAVEN is that it is setup to accept protein, RNA, DNA, and small molecule coordinates. From an unprocessed PDB file, one can generate a standard alpha carbon model, our atom selector can be used to save a subset of atom types for use in any ENM, points can be picked from electron density contours, united atoms representing the centroid of a set of atoms can be generated, or one may compose or edit an initial model using other software (such as a molecular viewer) and use MAVEN for ENM generation and analysis. See Figure 1 for examples of these model types.
Multiple analysis features are presently included. Selected methods are shown in Figure 2. These include the ability to analyze Principal Components constructed from multiple static structures, an NMR ensemble, or frames from an MD trajectory and compare them to the normal modes. Multiple studies including  and  have shown that the variance seen in ensembles of structures derived from MD trajectories, NMR, or X-ray crystallography can be reproduced with ENMs. This represents an important method for ENM model validation and further exploration of functional motions. MAVEN also has the ability to compare two ENMs of different types or having different parameter choices, and to analyze the effect of the mode-motions on subsets of the structure, comparing within or across subsets. For a full list of our analysis features and examples of their use, please consult our user's guide (Additional File 1). Future additions are likely to include automated methods for batch model generation and comparison, spectral analysis, or Markov Propagation Model simulations which probe paths of information transfer within the structure and were recently cast into the ENM framework .
Analysis and visualization of the resulting data can be performed within MAVEN. Alternatively, PyMOL , VMD , and other molecular viewers specialized for visualization of molecular systems and can be used. For this reason, animations of the modes are saved in PDB file format (each frame is a separate MODEL) so that any molecular viewer can be used to visualize them. The MAVEN interface is configured so that generation of animation files, loading them into a molecular viewer (PyMOL has been our preferred viewer), and setting up an appropriate initial view is performed.
MAVEN implements multiple types of ENMs for atomic, coarse-grained, and mixed coarse-grained representations and assists the user in generating these, permitting selection by atom or residue type, spherical coarse-graining, and united atom modeling (combining multiple atoms into one placed at their mean position). By implementing these and other methods for ENM model generation, MAVEN allows for diverse and detailed hypothesis testing. One may use sparse methods for fast mode generation, making large systems more tractable. Analysis of internal motions, directional correlations within the structure, comparing the mode shapes to the variance within a structural ensemble, and comparing anisotropies of motion are presently included. MAVEN, source code, and all optional components are freely available to assist the scientific community with dynamic studies of biomolecules.
Availability and requirements
Project name: MAVEN
Project home page: http://maven.sourceforge.net
Operating system(s): Windows, Mac, and Linux
Programming language: MATLAB, Perl, and C++
Other requirements: MCR (freely available on our web site)
License: Lesser General Public License (LGPL)
We gratefully acknowledge the support of NIH grants R01GM072014, R01GM073095, and R01GM081680.
- Levitt M, Warshel A: Computer simulation of protein folding. Nature 1975, 253: 694–698. 10.1038/253694a0View ArticlePubMedGoogle Scholar
- McCammon JA, Gelin BR, Karplus M: Dynamics of folded proteins. Nature 1977, 267: 585–590. 10.1038/267585a0View ArticlePubMedGoogle Scholar
- Sanbonmatsu KY, Joseph S, Tung CS: Simulating movement of tRNA into the ribosome during decoding. Proc Natl Acad Sci USA 2005, 102: 15854–15859. 10.1073/pnas.0503456102PubMed CentralView ArticlePubMedGoogle Scholar
- Chou KC, Maggiora GM, Mao B: Quasi-continuum models of twist-like and accordion-like low-frequency motions in DNA. Biophys J 1989, 56: 295–305. 10.1016/S0006-3495(89)82676-1PubMed CentralView ArticlePubMedGoogle Scholar
- Tolman JR, Flanagan JM, Kennedy MA, Prestegard JH: NMR evidence for slow collective motions in cyanometmyoglobin. Nat Struct Biol 1997, 4: 292–297. 10.1038/nsb0497-292View ArticlePubMedGoogle Scholar
- Sinkala Z: Soliton/exciton transport in proteins. J Theor Biol 2006, 241: 919–927.View ArticlePubMedGoogle Scholar
- Lewandowski JR, Sein J, Blackledge M, Emsley L: Anisotropic collective motion contributes to nuclear spin relaxation in crystalline proteins. J Am Chem Soc 2010, 132: 1246–1248. 10.1021/ja907067jView ArticlePubMedGoogle Scholar
- Bouvignies G, Bernado P, Meier S, Cho K, Grzesiek S, Bruschweiler R, Blackledge M: Identification of slow correlated motions in proteins using residual dipolar and hydrogen-bond scalar couplings. Proc Natl Acad Sci USA 2005, 102: 13885–13890. 10.1073/pnas.0505129102PubMed CentralView ArticlePubMedGoogle Scholar
- Hayward S, de Groot BL: Normal Modes and Essential Dynamics. In Molecular Modeling of Proteins. Volume 443. 1st edition. Springer; 2008:89–106. 10.1007/978-1-59745-177-2_5View ArticleGoogle Scholar
- Yang L, Song G, Carriquiry A, Jernigan RL: Close correspondence between the motions from principal component analysis of multiple HIV-1 protease structures and elastic network modes. Structure 2008, 16: 321–330. 10.1016/j.str.2007.12.011PubMed CentralView ArticlePubMedGoogle Scholar
- Bakan A, Bahar I: Computational generation inhibitor-bound conformers of p38 map kinase and comparison with experiments. Pac Symp Biocomput 2011, 181–192.Google Scholar
- Sen TZ, Feng Y, Garcia JV, Kloczkowski A, Jernigan RL: The Extent of Cooperativity of Protein Motions Observed with Elastic Network Models Is Similar for Atomic and Coarser-Grained Models. J Chem Theory Comput 2006, 2: 696–704. 10.1021/ct600060dPubMed CentralView ArticlePubMedGoogle Scholar
- Yang L, Song G, Jernigan RL: How well can we understand large-scale protein motions using normal modes of elastic network models? Biophys J 2007, 93: 920–929. 10.1529/biophysj.106.095927PubMed CentralView ArticlePubMedGoogle Scholar
- Jernigan RL, Kloczkowski A: Packing regularities in biological structures relate to their dynamics. Methods Mol Biol 2007, 350: 251–276.PubMed CentralPubMedGoogle Scholar
- Lu M, Ma J: The role of shape in determining molecular motions. Biophys J 2005, 89: 2395–2401. 10.1529/biophysj.105.065904PubMed CentralView ArticlePubMedGoogle Scholar
- Go N, Scheraga HA: Analysis of Contribution of Internal Vibrations to Statistical Weights of Equilibrium Conformations of Macromolecules. Journal of Chemical Physics 1969, 51: 4751-&. 10.1063/1.1671863View ArticleGoogle Scholar
- Go N, Noguti T, Nishikawa T: Dynamics of a small globular protein in terms of low-frequency vibrational modes. Proc Natl Acad Sci USA 1983, 80: 3696–3700. 10.1073/pnas.80.12.3696PubMed CentralView ArticlePubMedGoogle Scholar
- Hinsen K, Petrescu AJ, Dellerue S, Bellissent-Funel MC, Kneller GR: Harmonicity in slow protein dynamics. Chemical Physics 2000, 261: 25–37. 10.1016/S0301-0104(00)00222-6View ArticleGoogle Scholar
- Riccardi D, Cui Q, Phillips GN Jr: Application of elastic network models to proteins in the crystalline state. Biophys J 2009, 96: 464–475. 10.1016/j.bpj.2008.10.010PubMed CentralView ArticlePubMedGoogle Scholar
- Yang L, Song G, Jernigan RL: Protein elastic network models and the ranges of cooperativity. Proc Natl Acad Sci USA 2009, 106: 12347–12352. 10.1073/pnas.0902159106PubMed CentralView ArticlePubMedGoogle Scholar
- Lin TL, Song G: Generalized spring tensor models for protein fluctuation dynamics and conformation changes. BMC Struct Biol 2010, 10(Suppl 1):S3. 10.1186/1472-6807-10-S1-S3PubMed CentralView ArticlePubMedGoogle Scholar
- Mendez R, Bastolla U: Torsional network model: normal modes in torsion angle space better correlate with conformation changes in proteins. Phys Rev Lett 2010, 104: 228103.View ArticlePubMedGoogle Scholar
- Stember JN, Wriggers W: Bend-twist-stretch model for coarse elastic network simulation of biomolecular motion. J Chem Phys 2009, 131: 074112. 10.1063/1.3167410PubMed CentralView ArticlePubMedGoogle Scholar
- Doruker P, Jernigan RL, Bahar I: Dynamics of large proteins through hierarchical levels of coarse-grained structures. J Comput Chem 2002, 23: 119–127. 10.1002/jcc.1160View ArticlePubMedGoogle Scholar
- Bahar I, Atilgan AR, Erman B: Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. Fold Des 1997, 2: 173–181. 10.1016/S1359-0278(97)00024-2View ArticlePubMedGoogle Scholar
- Haliloglu T, Bahar I, Erman B: Gaussian dynamics of folded proteins. Physical Review Letters 1997, 79: 3090–3093. 10.1103/PhysRevLett.79.3090View ArticleGoogle Scholar
- Tirion MM: Large amplitude elastic motions in proteins from a single-parameter, atomic analysis. Physical Review Letters 1996, 77: 1905–1908. 10.1103/PhysRevLett.77.1905View ArticlePubMedGoogle Scholar
- Atilgan AR, Durell SR, Jernigan RL, Demirel MC, Keskin O, Bahar I: Anisotropy of fluctuation dynamics of proteins with an elastic network model. Biophys J 2001, 80: 505–515. 10.1016/S0006-3495(01)76033-XPubMed CentralView ArticlePubMedGoogle Scholar
- Kurkcuoglu O, Jernigan RL, Doruker P: Mixed levels of coarse-graining of large proteins using elastic network model succeeds in extracting the slowest motions. Polymer 2004, 45: 649–657. 10.1016/j.polymer.2003.10.071View ArticleGoogle Scholar
- Kurkcuoglu O, Turgut OT, Cansu S, Jernigan RL, Doruker P: Focused functional dynamics of supramolecules by use of a mixed-resolution elastic network model. Biophys J 2009, 97: 1178–1187. 10.1016/j.bpj.2009.06.009PubMed CentralView ArticlePubMedGoogle Scholar
- Tama F, Sanejouand YH: Conformational change of proteins arising from normal mode calculations. Protein Eng 2001, 14: 1–6. 10.1093/protein/14.1.1View ArticlePubMedGoogle Scholar
- Leo-Macias A, Lopez-Romero P, Lupyan D, Zerbino D, Ortiz AR: An analysis of core deformations in protein superfamilies. Biophys J 2005, 88: 1291–1299. 10.1529/biophysj.104.052449PubMed CentralView ArticlePubMedGoogle Scholar
- Doruker P, Jernigan RL: Functional motions can be extracted from on-lattice construction of protein structures. Proteins 2003, 53: 174–181. 10.1002/prot.10486View ArticlePubMedGoogle Scholar
- Eyal E, Yang LW, Bahar I: Anisotropic network model: systematic evaluation and a new web interface. Bioinformatics 2006, 22: 2619–2627. 10.1093/bioinformatics/btl448View ArticlePubMedGoogle Scholar
- Suhre K, Sanejouand YH: ElNemo: a normal mode web server for protein movement analysis and the generation of templates for molecular replacement. Nucleic Acids Res 2004, 32: W610-W614. 10.1093/nar/gkh368PubMed CentralView ArticlePubMedGoogle Scholar
- Zheng W, Doniach S: A comparative study of motor-protein motions by using a simple elastic-network model. Proc Natl Acad Sci USA 2003, 100: 13253–13258. 10.1073/pnas.2235686100PubMed CentralView ArticlePubMedGoogle Scholar
- Lindahl E, Azuara C, Koehl P, Delarue M: NOMAD-Ref: visualization, deformation and refinement of macromolecular structures based on all-atom normal mode analysis. Nucleic Acids Res 2006, 34: W52-W56. 10.1093/nar/gkl082PubMed CentralView ArticlePubMedGoogle Scholar
- Franklin J, Koehl P, Doniach S, Delarue M: MinActionPath: maximum likelihood trajectory for large-scale structural transitions in a coarse-grained locally harmonic energy landscape. Nucleic Acids Res 2007, 35: W477-W482. 10.1093/nar/gkm342PubMed CentralView ArticlePubMedGoogle Scholar
- Chennubhotla C, Bahar I: Signal propagation in proteins and relation to equilibrium fluctuations. PLoS Comput Biol 2007, 3: 1716–1726.PubMedGoogle Scholar
- The PyMOL Molecular Graphics System Volume 1.3. Schrödinger, LLC; 2011.Google Scholar
- Humphrey W, Dalke A, Schulten K: VMD: visual molecular dynamics. J Mol Graph 1996, 14: 33–38. 10.1016/0263-7855(96)00018-5View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.