"Free Energy Profile of RNA Hairpins: A Molecular Dynamics Simulation Study".
Nan-Jie Deng 1, @ , and Piotr Cieplak 2
1 School of Life Science, University of Science
and Technology of China Hefei, Anhui, China
2 Burnham Institute for Medical Research, La Jolla,
California
@ Correspondence: nanjie.deng@gmail.com
Submitted March 25, 2009, and accepted for publication October 26,
2009.
Editor: Kathleen B. Hall.
RNA hairpin loops are one of the most abundant secondary structure elements and participate in RNA folding and protein-RNA recognition. To characterize the free energy surface of RNA hairpin folding at an atomic level, we calculated the potential of mean force (PMF) as a function of the end-to-end distance, by using umbrella sampling simulations in explicit solvent. Two RNA hairpins containing tetraloop cUUCGg and cUUUUg are studied with AMBER ff99 and CHARMM27 force fields. Experimentally, the UUCG hairpin is known to be significantly more stable than UUUU. In this study, the calculations using AMBER force field give a qualitatively correct description for the folding of two RNA hairpins, as the calculated PMF confirms the global stability of the folded structures and the resulting relative folding free energy is in quantitative agreement with the experimental result. The hairpin stabilities are also correctly differentiated by the more rapid molecular mechanics-Poisson Boltzmann-surface area approach, but the relative free energy estimated from this method is overestimated. The free energy profile shows that the native state basin and the unfolded state plateau are separated by a wide shoulder region, which samples a variety of native-like structures with frayed terminal basepair. The calculated PMF lacks major barriers that are expected near the transition regions, and this is attributed to the limitation of the 1-D reaction coordinate. The PMF results are compared with other studies of small RNA hairpins using kinetics method and coarse grained models. The two RNA hairpins described by CHARMM27 are significantly more deformable than those represented by AMBER. Compared with the AMBER results, the CHARMM27 calculated Gfold for the UUUU tetraloop is in better agreement with the experimental results. However, the CHARMM27 calculation does not confirm the global stability of the experimental UUCG structure; instead, the extended conformations are predicted to be thermodynamically stable in solution. This finding is further supported by separate unrestrained CHARMM27 simulations, in which the UUCG hairpin unfolds spontaneously within 10 ns. The instability of the UUCG hairpin originates from the loop region, and propagates to the stem. The results of this study provide a molecular picture of RNA hairpin unfolding and reveal problems in the force field descriptions for the conformational energy of certain RNA hairpin.
RNA folding is believed to proceed in a hierarchical manner, as the initial rapid formation of the secondary structures is followed by slow development of tertiary structures (1). The latter involves the assembly and limited structural rearrangements of preformed secondary structure elements. The relative stabilities and the underlying energy surfaces of the secondary structures are of fundamental importance to the thermodynamics and kinetics of RNA folding (2).
Hairpin loops are a common secondary structural motif in RNA. It consists of several loop-forming nucleotides, flanked by a Watson-Crick basepaired helical stem. Hairpin loop has been suggested to act as initiation points in RNA folding (3). It is also found at recognition sites in the protein-RNA and RNA tertiary interactions (4). The stability and flexibility of RNA hairpin have been proposed as determining factors of its function. For example, the GNRA tetraloop (N: A, C, G or U; R: A or G) occurs more frequently in tertiary recognition sites than the UUCG tetraloop. One explanation for this is that the UUCG tetraloop is more rigid than the GNRA loop, and therefore it is less tolerant of the conformational changes important for binding (4,5).
Because of its importance in RNA structure and function, hairpin loop has been the subject of numerous experimental and theoretical studies aimed at understanding the atomic interactions involved in RNA folding. With a reasonable computational cost, all-atom molecular mechanics force fields provide currently the most detailed description of nucleic acids in ionic solutions (6,7,8). Computational studies using molecular dynamics (MD) simulation give important insights into the possible hairpin folding pathway (9,10,11), the effect of base substitutions (12) and the influence of loop-closing basepair on hairpin stability (13), and the role of water in hairpin folding (11). The results of these earlier theoretical investigations were summarized in Deng and Cieplak (14). In a recent study using replica exchange molecular dynamics in explicit solvent, Garcia and Paschek (15) observed the reversible folding of an 8-nt RNA hairpin UUCG starting from extended conformations. The authors also obtained the pressure temperature (P-T) free energy diagram for the hairpin loop formation and found that the RNA hairpin folding is destabilized by the increase of hydrostatic pressure. However, the calculated folding free energy of 0.65 kcal/mol at 310 K is different from the corresponding experimental value of ~1.0 kcal/mol. Villa et al. (16) studied the folding mechanisms of two 14-nt hairpin loops uCACGg and cUUCGg using replica exchange molecular dynamics. Their simulations confirmed the structural similarity of the two hairpins, and reproduced the relative melting temperature, but the absolute values of Tm are found to be 20% too high. The study showed that the folding of the uCACGg hairpin is more cooperative than that of the cUUCGg hairpin.
In this study, we are interested in characterizing the free energy profile of RNA hairpin and determining the free energy of folding. Theoretical calculation of free energy is central to our understanding of the conformational transitions between unfolded and compact states in biomolecules. The free energy of folding may be estimated using the molecular mechanics-Poisson Boltzmann-surface area (MM-PB/SA) method (17). This approach has been adopted in studying the thermal equilibrium of RNA secondary structures such as duplex, hairpin loop, and single strand (14), and the calculation correctly described the temperature dependence of the conformational equilibrium. Whereas MM-PB/SA allows rapid estimation of the fee energy, the conformational entropy associated with large structural changes is not adequately taken into account by the method. Moreover, the small folding free energy is obtained from the difference between two end point free energies, which are of large magnitudes. In this study, we compare the folding free energies calculated using MM-PB/SA and those obtained using the potential of mean force (PMF). The PMF is calculated by umbrella sampling simulations (18) in explicit solvent, with the end-to-end distance chosen as the order parameter. In contrast to the MM-PB/SA approach, here the conformational entropy is not estimated separately. Also, in the pathway approach, the free energy of folding (of the order of kcal/mol) is not calculated from the difference between two large numbers.
In recent years, the kinetics and thermodynamics of RNA hairpin folding have been the focus of a number of statistical mechanical calculations using coarse-grained models (19,20,21). In a joint experimental and theoretical study of several 8-nt RNA hairpins (21), Ma et al. showed that the thermodynamics and the temperature-dependent kinetics data can be explained by using a four-state model, derived from a sequence-specific 2-D lattice representation of RNA hairpin. Even for the short 8-nt RNA, the free energy landscape described by the four-state model is rugged, with barrier heights ranging from 2 to 9 kBT. It is of interest to compare our atomistic simulation results with the statistical mechanical calculations using low-resolution representations of RNA.
To study how free energy surface depends on hairpin sequence and
thermal stability, we calculate the PMF of two RNA tetraloop sequences
cUUCGg and cUUUUg. The UUCG hairpin is one of the best characterized
RNA tetraloop and is considerably more stable than UUUU (2).
We analyze the results from umbrella sampling to estimate folding free
energy, to identify major transitions during hairpin folding,
and compare the results with other theoretical and experimental studies
of RNA hairpin. The folding free energies are also calculated using
the more rapid MM-PB/SA approach, and the free energy contributions breakdown
in this method gives additional insights into the energetics of hairpin
folding. To examine the influence of different force fields on the
calculated free energies, we compare the results given by the two
widely used nucleic acids force fields AMBER ff99 (22)
and CHARMM27 (23).
Materials and Methods:
RNA hairpins
The conformational equilibrium of two RNA tetraloops 5-GCC(UUCG)GGC-3 and 5-CGC(UUUU)GCG-3 are studied in this work (Fig. 1). The native structure of UUCG was taken from the hairpin segment (residues 615) in the crystal structure PDB 1F7Y. The coordinates of the UUUU hairpin were built by using the hairpin segment of the RNA structure PDB 1MME as the template. To compare with the experimental results at the same temperature, all the simulations were carried out at 37C. The melting temperature for a similar 10-nt UUCG tetraloop is 69.0C (13), whereas the Tm for a 12-nt UUUU loop is between 59.2C and 60.4C (2).
Figure 1: Native structures of (a) 5'-GCC(UUCG)GGC-3' and (b) 5'-CGC(UUUU)GCG-3' hairpin loops.
Choosing a suitable reaction coordinate is the first step in a free energy profile study. Experimentally, solution NMR studies of RNA hairpins (24) indicate that RNA tetraloops adopt hairpin conformation under native conditions and become single stranded under denatured conditions. The end-to-end distance therefore provides a measure for the progress of hairpin folding and is used as the order parameter in the umbrella sampling calculation of PMF. Details of the umbrella sampling and MD simulation protocol are provided in the Supporting Material .
To ascertain the statistical significance of the calculated PMF,
three independent umbrella sampling simulations were carried out for each
of the UUCG and UUUU hairpins using the AMBER force field. Different initial
velocities were used in these simulations. A total of ~2.3 s explicit
solvent simulations have been carried out for calculating the PMF of
two hairpins using two force fields.
Results and Discussions:
To investigate the folding energetics of small RNA hairpins, we computed the PMF of end-to-end distance for the RNA hairpins UUCG and UUUU, using AMBER ff99 and CHARMM27 force fields. In the following sections, we first discuss the results obtained using AMBER force field. We then analyze the result given by CHARMM27 force field.
Before moving to the results section, here we discuss the caveats of the calculations. First, the umbrella sampling simulations are started from the folded structures; hence there may be some bias toward the native state in the calculated PMF. Second, the sampling may be insufficient for the characterization of free energy surface. Although the use of umbrella potentials in multiple windows facilitates uphill movement in the free energy surface along the reaction coordinate, the sampling in other degrees of freedom remains hindered by energy or entropy barriers in those directions. These limitations in the umbrella sampling will affect the accuracy of the PMF and may have contributed to the overestimation in the absolute folding free energy as discussed below.
Free energies calculated using AMBER ff99
Figure 2 and Figure 3 shows the PMF and the representative snapshots along the reaction coordinate obtained with AMBER ff99. To examine the statistical significance of the results, three independent sets of PMF were computed with different velocity distributions for both UUCG and UUUU. The variations between different runs are 2.5 kcal/mol for UUCG and 1.2 kcal/mol for UUUU. Although the deviations between the different sets of PMF are substantial, especially in the case of UUCG, the main features of the PMF are preserved in the different simulations. As discussed below, these features include the location and depth of the native minimum, the lack of barriers in the shoulder region, the heights of the unfolded state plateau, and the relative stability of the two hairpins.
Figure 2. Potential of mean force (PMF) as a function of the end-to-end distance for UUCG, obtained with different initial velocities using AMBER ff99.
Solid: set 1; dotted: set 2; dashed: set 3. The structures shown along with the PMF are snapshots of structures sampled by the umbrella sampling simulation. Note that because the hairpin becomes more disordered at large end-to-end distance, a significantly larger ensemble was sampled than is suggested by these snapshots at large r. The sampled conformations are better represented by these snapshots at short end-to-end extensions.
Figure 3. PMF of UUUU obtained with different initial velocities using AMBER ff99.
Solid: set 1; dotted: set 2; dashed: set 3. The snapshots of representative structures sampled by the umbrella sampling simulation are also displayed.
1. The global free energy minimum at 10.8 Å is associated with the fully folded conformations. Thus, the overall thermodynamic stability of the experimental structure is predicted by the calculated PMF. The native energy well ends at r ~ 12.4 Å, where the terminal G-C basepair frays. Based on the PMF profiles, the free energy cost of breaking the terminal basepair is estimated to be between 3.7 kcal/mol to 5.3 kcal/mol. To verify that global stability of the folded structure is not caused by the umbrella biasing potential, we carried out an unrestrained simulation with AMBER ff99 for 100 ns starting from the crystal structure. In this simulation, the average RMS deviation from the initial structure is ~ 1.46 Å (Fig. S9 , Fig. S10 , and Fig. S12 ). The helical stem is stable throughout the trajectory and the key structural features in the loop region (i.e., U5 and C6 have C2-endo ribose puckering; G7 in syn conformation) are reproduced. Thus, the result from this extended simulation supports the use of AMBER ff99 in modeling RNA hairpin.
2. The shoulder region spans from 12.4 to 27.6 Å in the end-to-end distance space. In this region the RNA molecule adopts various native-like structures, in which the terminal stem basepair is frayed, but the UUCG loop and the other two stem basepairs remain largely intact. Although the basepairing between the two terminal nucleotides is broken, their bases continue to interact with the rest of the hairpin unit through stacking and hydrogen bonding (see Fig. 2 and Fig. S1 for the snapshots of the sampled structures). These nonbonded interactions are weakened and eventually broken by the progressive increase of the end-to-end distance. The loss of these interactions contributes to the gradual energy increase within the shoulder region, which is partially offset by the conformational entropy gains due to the liberation of the two end nucleotides from stacking and hydrogen bonding. Another important factor is the solvation free energy change, which contains two opposite contributions: the nonpolar component favors compact conformations due to the hydrophobicity of the nucleotide bases, whereas the electrostatic component prefers open conformations that are better solvated by the polar solvent. The net effect of the above contributions is a 5.2 kcal/mol increase of free energy in the shoulder region. As seen from Fig. 2, none of the three free energy profiles contain major barriers. The height of the small barrier near r = 22.3 Å in the PMF set 1 is of the order of 1 kBT, which can be easily surmounted by thermal fluctuation. This points to the lack of kinetic traps in the native like shoulder region.
3. Further extension in the end-to-end distance beyond 27.6 Å induces the unfolding of the remaining hairpin structure, which includes the middle and closing stem basepairs and the UUCG loop. The fraying of these structures results from a small increase (3 Å) in the chain extension, causing a relatively steep rise in the free energy ( 4.0 kcal/mol). The reverse process is strongly downhill in free energy, indicating that the formation of the tetraloop and its nearest two stem basepairs is cooperative. Intermediate states such as hairpin loop closed by a single basepair are therefore unstable. This result is in agreement with the experimental observation that stable hairpin formation requires at least two basepairs in the stem (25). Beyond the narrow transition region (27.6 Å < r < 30.6 Å), the RNA molecule exists predominantly in the single stranded form, where all the native interactions are lost.
The PMF and the sampled structures of the UUUU hairpin are shown in Fig. 3 (see also Fig. S2 ). To facilitate comparisons with the UUCG result shown in Fig. 2, the same vertical scale is used in Fig. 3 for plotting the PMF. Broadly speaking, the free energy profiles also contain three regions: a native state basin (r < 11.6 Å), a shoulder region (11.6 Å < r < 27.8 Å) and an unfolded region (r > 30 Å). On the other hand, the boundary between the native basin and the shoulder region is not defined as clearly as in UUCG's case (see Figure 2 and Figure 3). As with UUCG, the folded structure of UUUU is also found at the global free energy minimum (r ~ 10.5 Å). A second free energy minimum was found at 8.8 Å in the PMF set 3. Because this second minimum is located close to the global free energy minimum at 10.5 Å and because it was only observed in one data set, we speculate that it might eventually merge with the global minimum with a longer simulation time. Within the intermediate region, the conformational fluctuations are dominated by the large movement of the two terminal nucleotides, whereas the rest of the hairpin structure experiences limited structural changes. The transition between the unfolded states and the native-like conformations occurs cooperatively within a narrow range between 27.8 Å and 30.1 Å. The free energy profiles of the UUUU hairpin are essentially barrierless.
The PMF results (Figure 2 and Figure 3) as a function of the end-to-end distance are comparable to a hairpin unfolding pathway in which the unfolding is started at the terminal basepair and then propagates to the loop region. Because different pathways exist and compete with each other, the PMF results do not represent the full picture of reversible unfolding and folding. In general, the unfolding pathway obtained from umbrella sampling will be dependent on the choice of reaction coordinate. Because the hairpin was pulled at the two ends in our simulation, the umbrella sampling is, to some extent, related to the single molecule end-to-end pulling experiments at slow pulling rate. Therefore the results (Figure 2 and Figure 3) may be interpreted as a force extension induced unfolding that proceeds in two unit steps, i.e., the breaking of the terminal basepair, followed by the unfolding of the loop and its two nearest basepairs. The main differences between the PMF of the two hairpins are: 1), the free energy cost of deformation in UUCG is higher than that in UUUU; and 2), the intermediate region of UUCG is better defined than that of UUUU, as the transitions in the former are sharper than those in the latter. These observations are consistent with the fact that the UUCG hairpin is more stable than the UUUU hairpin (2).
Next we compare the calculated PMF with related studies of RNA hairpin folding. Ma et al. (21) studied the kinetics of small RNA hairpins using temperature jump experiments. They found that the folding kinetics are sensitive to the temperature as well as the loop and stem sequence. At temperatures below Tm, a UUCG hairpin shows double-exponential folding kinetics, whereas a UUUU hairpin shows single exponential relaxation. These experimental observations are to some extent compatible with the calculated PMF in this study, which are also determined at T < Tm. Whereas the PMF of UUCG resembles that of a three-state system (Fig. 2), containing the native basin, the intermediate shoulder region and the unfolded ensemble, the PMF of UUUU is intermediate between a two-state and a three-state model, as the boundary between the intermediate states and the native minimum is less clear-cut. In our calculations (Figure 2 and Figure 3), the hairpin folding are essentially downhill in free energy, as no major free energy barriers are apparent in the transition regions. The reason for the smooth free energy profiles may have to do with the nature of the reaction coordinate: with a 1-D reaction coordinate, different conformations having the same end-to-end distance are combined together and appear at the same point on the reaction coordinate; hence the roughness of the conformational free energy surface may be hidden. In principle, the use of higher dimensional reaction coordinates could provide better resolution to show the energy barriers in the transition regions, but the computing cost associated with the multidimensional umbrella sampling can be prohibitive. Related to these observations, we note that in a recent replica exchange MD on a 14-nt UUCG hairpin (15), free energy barriers were indeed observed in a projected 2-D reaction coordinates surface (only at temperatures T > 400 K).
In the study of Ma et al. (21), the authors found that the temperature-dependent kinetics data can be explained by a 2-D lattice model. The distinct microstates in the 2-D lattice model are further simplified to generate a four-state model that comprises a native state (denoted as N), a state having a frayed terminal stem basepair and a native-like loop (E), a unfolded state (U) and a state containing a native stem and a nonnative loop (state S) (Fig. S8 ). Comparing the four-state model with our atomistic simulation results, we note that the state N, E, and U correspond approximately to the native basin, the intermediate shoulder and the unfolded state plateau shown in Fig. 2. However, structures that resemble the state S is not observed in our umbrella sampling simulations. The reason for this could be the following: our simulations are carried out at low temperature, which favors compact structures with low energy and low entropy. Under such conditions the state S is unstable because it contains a disordered loop.
Using the information from the PMF, we now estimate the free energy
of hairpin formation by assuming the two-state model. The calculated
folding free energy depends on how the reaction coordinate space is partitioned
between the folded and unfolded states. We use the mid-point
of the transition between the shoulder and plateau region to define
folded and unfolded states (Figure 2 and Figure
3). This location coincides with the disruption of the central stem
basepair. There is an experimental basis to this criterion for folded
and unfolded states because in both ultraviolet absorbance and solution
NMR, the signature for the melting transition arises directly from
the formation/disruption of H-bonding in basepairing. It therefore
allows a meaningful comparison with experimental free energy to
assess the quality of the force field and extent of sampling. The Gfold
is estimated as:
(Equation 1:)
where K is the equilibrium constant and p(ri)
is the probability distribution of the end-to-end distance ri.
The results of the calculation are summarized in Table 1.
The free energies calculated with AMBER ff99 are -13.5 +/- 2.0 kcal/mol
(UUCG) and -10.0 +/- 0.7 kcal/mol (UUUU). The experimentally
determined thermodynamic data for two closely related RNA hairpin sequences
are -3.7 +/- 0.2 kcal/mol (UUCG) and -1.5 +/- 0.1 kcal/mol
(UUUU) (2,13). Therefore, the
calculation significantly overestimates the absolute folding
free energies for both hairpins. However, the calculated relative
free energy DDGfold of -3.5 kcal/mol
between the two hairpins is in good agreement with the experimental value
of -2.2 kcal/mol (2). We note that, if the native state
basins in the PMF (Figure 2 and Figure
3) are chosen to define the folded and unfolded states, then
the calculated absolute free energies DGfold
would be significantly less overestimated, whereas the relative free energy
DDGfold would be essentially unaffected.
Table 1: Folding free energies of RNA hairpin UUCG and
UUUU calculated with AMBER ff99.
DGfold is the average
result calculated using three independent PMF. The error was estimated
from calculations using the first and second half of the trajectory of
each umbrella sampling window.
Experimental result is for the 5'-CGC(UUCG)GCG-3' tetraloop
(13).
Estimated from the experimental results of DDGfold
between two 12-nt UUCG and UUUU tetraloops (2).
What is the underlying cause for the overestimation of absolute folding free energies calculated by the PMF approach (Table 1)? Because DGfold = DHfold - TDSfold, there are two possibilities: 1), the intramolecular interaction given by the force field may be too attractive, which will give a overly negative enthalpy of folding DHfold; or 2), the limited sampling in our simulations may cause the configurations of the unfolded states to be inadequately explored and underweighted, and this will lead to a too small TDSfold. At this point, it is not yet clear which of the two factors is most responsible for the overestimated absolute folding free energy.
The folding free energies were also estimated using the MM-PB/SA method. With this approach, two unrestrained 6-ns MD simulations in explicit TIP3P water were carried out, one starting from the folded structure and the other from single stranded form. During these simulations, the RNA molecules remained in their respective initial conformational ensembles. The main contribution to the folding free energy comes from the difference between ensemble averaged effective energies (the sum of intramolecular energy and solvation free energy, with the latter determined by solving the Poisson-Boltzmann equation plus a solvent accessible surface area term) of the two trajectories. The vibrational entropy contribution estimated by the normal mode analysis is also included. The detailed breakdown of contributions is given in Table S1 and Table S2 . The DGfold predicted by the MM-PB/SA are 12.6 kcal/mol and 2.8 kcal/mol for UUCG and UUUU, respectively (Table 1). Thus, the calculations also correctly differentiate the thermodynamic stabilities of the two hairpins, while overestimating the relative stability. We noted that the electrostatic solvation free energy component is sensitive to the atomic radii used in the Poisson-Boltzmann calculation. The results given in Table 1 are obtained with the van der Waals radii from the force field. Using the PARSE radii set (26) destabilizes both hairpins by 9 kcal/mol (data not shown), which would predict a positive folding free energy for the UUUU hairpin. Because the force field van der Waals radii yields the correct sign for folding free energy, it is more appropriate for characterizing the electrostatic solvation effect in these RNA hairpins.
Besides estimating the free energy difference, the MM-PB/SA calculations also provide insights into the energetics of hairpin folding. As seen from Table S1 and Table S2 , the folding is driven by the solute van der Waals interactions, and to a lesser extent, by the nonpolar solvation term. Both the solute strain energy and vibrational entropy are unfavorable to the hairpin formation. The reduction of the vibrational entropy on folding indicates that the folded molecule resides in narrow and steep energy wells compared with the unfolded molecule. The contribution from the overall electrostatics varies with molecular system: the overall electrostatics contribution DG(total_elec) stabilizes the native conformation in UUCG, but disfavors the folding of UUUU. It should be noted that the entropy associated with jumping between energy wells are not accounted for in MM-PB/SA. This is because the only solute entropy term considered by the MM-PB/SA calculation is the vibrational entropy contribution, which is calculated by assuming that the solute molecule moves within a single harmonic energy well, which is clearly incorrect. The inadequate account for entropy in this method could be the main reason for the overestimation of the relative folding free energy between the two hairpins.
PMF calculated using CHARMM27
The PMF and the associated structural snapshots obtained with the CHARMM27 force field are presented in Figure 4 and Figure 5. For a better comparison with the AMBER results, the PMF are shown with the same vertical scale as those of Figure 2 and Figure 3. The structural snapshots sampled during the umbrella sampling simulations are also shown with more detail in Fig. S3 and Fig. S4 . These results indicate that the RNA molecules described by CHARMM27 are more flexible than those modeled by AMBER, as the free energy cost of deformation calculated by CHARMM27 is significantly smaller. As seen from Figure 4 and Figure 5, the free energies are within 4.5 kcal/mol from the ground state level, whereas in Figure 2 and Figure 3 the free energies of the unfolded states are 10 kcal/mol higher than those of the folded states. This suggests that the two force fields differ in the balance of intrasolute interaction and solute-solvent interaction in the RNA molecules studied here.
Figure 4: PMF of UUCG calculated using CHARMM27 and the snapshots of representative structures sampled by the umbrella sampling simulations.
Figure: 5 PMF of UUUU calculated using CHARMM27 and the snapshots of representative structures sampled by the umbrella sampling simulation.
Figure 6: Time series of the heavy atom RMS difference relative to the native structure of UUCG.
The three unrestrained MD simulations were started from the experimental structure and lasted for 7 ns and 10 ns. Solid lines: two CHARMM27 trajectories (7 ns and 10 ns); dotted line: an AMBER ff99 trajectory (10 ns).
Figure 7: CHARMM27 simulation time series of RMS difference relative to the crystal structure of UUCG.
(a) 7 ns trajectory; (b) 10 ns trajectory.
Solid line: loop region; dotted line: stem.
In this study, the potential of mean force of RNA hairpins UUCG and UUUU has been determined in explicit ionic solvent using the end-to-end distance as the order parameter. The energetics of the hairpin folding has also been studied using the MM-PB/SA approach. We have also investigated the force field dependency of the free energy calculation by comparing the results obtained from AMBER ff99 and CHARMM27.
The AMBER results described correctly the physical interactions in the two hairpins because both the native state stability and the relative free energy are reproduced. The PMF profiles show that the native state basin is separated from the unfolded state plateau by a wide shoulder region, which samples a variety of native-like structures with frayed terminal basepair. The presence of the unfolded state plateau and the intermediate shoulder region can be viewed as entropic barriers that need to be overcome by diffusional search (biased toward the native state) in a hairpin folding pathway in which the formation of the loop region occurs before the formation of the stem. There are differences in the details of the PMF between the two hairpins: 1), the free energy cost of unfolding for UUCG is higher than that for UUUU; and 2), the boundary between the native states and the intermediate shoulder region in the UUCG PMF is clearly defined, whereas the same boundary in the UUUU PMF is less clear-cut. These properties of the calculated PMF are consistent with the thermodynamic data on the relative stability of the two hairpins and some aspects of the kinetic experiments. For example, at low temperatures, the free energy profile of UUCG is more compatible with a three-state system, whereas that of UUUU resembles a two-state system. However, the calculated PMF lacks major barriers that could be expected in transition regions, and we think that this is related to the use of 1-D reaction coordinate, which does not have the resolution to fully describe the roughness of the underlying free energy surface. Additional insights into the energetics of hairpin folding are gained by MM-PB/SA calculations through its breakdown of energy contributions. Here, information provided by MM-PB/SA is mainly enthalpic because it is very limited in correctly estimating the conformational entropy change associated with large structural transitions. MM-PB/SA gives correct ranking order of stability, but its estimation of relative free energy is not as good as that given by PMF.
Although the relative folding free energy is quantitatively reproduced by our PMF calculation using the AMBER force field, the absolute folding free energies are overestimated for both hairpins. This error is likely to originate from the sampling limitation, but it may also come from the AMBER force field. The limitations in sampling can cause the conformational entropy of the unfolded states to be underestimated, which leads to the overstabilization of the folded state. Explorations aimed at enhancing the sampling are being investigated currently.
The results of CHARMM27 are qualitatively different from those
obtained with AMBER, in terms of both the free energy profile and
the conformational property. The RNA hairpins modeled by CHARMM27
are significantly less rigid, which probably led to more complete
sampling of the conformational space and the more accurate estimate of
the folding free energy for UUUU. Although the structural stability of
the UUUU hairpin is maintained in CHARMM27, the global stability
of the UUCG hairpin is not reproduced by the PMF calculated. Separate simulations
using CHARMM27 show that the force field related instability of the UUCG
hairpin originates from the loop region, and propagates to the
stem region after several nanoseconds. Consequently, the problem may
not be as pronounced in shorter simulations and/or with longer stem regions.
Acknowledgments:
We thank Prof. Yunyu Shi for her support of this work; Prof. C. L. Brooks III for very helpful discussions; Prof. Haiyan Liu for providing the computing facility; and Mr. Jian Zhan for help maintaining the Linux clusters on which the calculations are carried out.
http://www.cell.com/biophysj/supplemental/S0006-3495(09)01682-8
Document S1. Umbrella sampling, MD protocol, three tables,
and 16 figures (PDF 1076 kb)
1. Sheehy JP, Davis AR, and Znosko BM,
"Thermodynamic
characterization of naturally occurring RNA tetraloops".
2. Schudoma C, May P, Nikiforova V, and Walther D,
"Sequence–structure
relationships in RNA loops: establishing the basis for loop homology modeling".
3. Nicodemi M, and Prisco A,
"Thermodynamic
Pathways to Genome Spatial Organization in the Cell Nucleus".
1. Each cell retains all of its embryonic genes for a lifetime.
2. Controls for embryonic genes are often absent in adults.
3. Uncontrolled embryonic genes can replicate wildly.
4. Replicating genes participate in intra-cellular competition.
5. The basis for gene competition is selective transcription.
6. MicroRNAs can reprogram embryomic transcription.
7. Gene reprogramming can produce normal phenotypes.
8. Normal phenotypes can by-pass chromosomal lesions.
9. MicroRNA therapy may need to be permanent.
10. Transplantation of microRNAs could be preferred.
1. Pathways within cell genomes involve a flow of information.
2. Information can flow by direct contact or by third parties.
3. Direct contact within whole genomes is difficult to regulate.
4. DNA-DNA direct contects are influenced by agents.
5. Nuclear agents include hydrophilic ionic and hydrophobic conforming ligands.
6. Third parties within genomes involve RNAs and proteins.
7. RNAs and proteins are easy to regulate or reverse.
8. Information can be shared, lost, or transformed.
9. System information can be hidden during system isolation.
10. Local information can be permanently lost during system entropy.
Links to Current
Research in Euchromatin:
Links to
Euchromatin Activator RNA Reviews:
Links to
Euchromatin Activator RNA Research:
Links to Ultrastructural
Probes of DNase I-Sensitive Sites:
Links to
RNA as a Therapeutic Agent:
Links to Hodgkin Lymphoma
Immuno-Pathology:
Links to Activated
T-Lymphocyte Immunotherapy:
Links to Medical
Systems Biology:
Links to Selective
Gene Transcription:
Links to RNA-Induced
Epigenetics:
Links to RNA-Induced
Embryogenesis:
Links to RNA and
Biological Causality:
Links to Reprogramming
and Neoplasia:
A Brief History of Activator RNA:
"Ultrastructural
Probes of Active DNA Sites, and the RNA Activators of DNA".
(PowerPoint Presentation).
Top of Page - Euchromatin
Network - Euchromatin
Research - Research
in Quantitative Radiology
For Further Information and Feedback:
Jeannette A. Hovsepian, M.D.
E-mail: frensasc@ix.netcom.com
Phone: +1 650 367 6483