Restraints

Special potentials are used for imposing restraints on the motion of the system, either to avoid disastrous deviations, or to include knowledge from experimental data. In either case they are not really part of the force field and the reliability of the parameters is not important. The potential forms, as implemented in GROMACS, are mentioned just for the sake of completeness. Restraints and constraints refer to quite different algorithms in GROMACS.

Position restraints

These are used to restrain particles to fixed reference positions Ri. They can be used during equilibration in order to avoid drastic rearrangements of critical parts (e.g. to restrain motion in a protein that is subjected to large solvent forces when the solvent is not yet equilibrated). Another application is the restraining of particles in a shell around a region that is simulated in detail, while the shell is only approximated because it lacks proper interaction from missing particles outside the shell. Restraining will then maintain the integrity of the inner part. For spherical shells, it is a wise procedure to make the force constant depend on the radius, increasing from zero at the inner boundary to a large value at the outer boundary. This feature has not, however, been implemented in GROMACS.

The following form is used:

(206)Vpr(ri)=12kpr|riRi|2

The potential is plotted in Fig. 29.

../../_images/f-pr.png

Fig. 29 Position restraint potential.

The potential form can be rewritten without loss of generality as:

(207)Vpr(ri)=12[kprx(xiXi)2 x^+kpry(yiYi)2 y^+kprz(ziZi)2 z^]

Now the forces are:

(208)Fix=kprx (xiXi)Fiy=kpry (yiYi)Fiz=kprz (ziZi)

Using three different force constants the position restraints can be turned on or off in each spatial dimension; this means that atoms can be harmonically restrained to a plane or a line. Position restraints are applied to a special fixed list of atoms. Such a list is usually generated by the pdb2gmx program.

Note that position restraints make the potential dependent on absolute coordinates in space. Therefore, in general the pressure (and virial) is not well defined, as the pressure is the derivative of the free-energy of the system with respect to the volume. When the reference coordinates are scaled along with the system, which can be selected with the mdp option refcoord-scaling=all, the pressure and virial are well defined.

Flat-bottomed position restraints

Flat-bottomed position restraints can be used to restrain particles to part of the simulation volume. No force acts on the restrained particle within the flat-bottomed region of the potential, however a harmonic force acts to move the particle to the flat-bottomed region if it is outside it. It is possible to apply normal and flat-bottomed position restraints on the same particle (however, only with the same reference position Ri). The following general potential is used (Figure 30 A):

(209)Vfb(ri)=12kfb[dg(ri;Ri)rfb]2H[dg(ri;Ri)rfb],

where Ri is the reference position, rfb is the distance from the center with a flat potential, kfb the force constant, and H is the Heaviside step function. The distance dg(ri;Ri) from the reference position depends on the geometry g of the flat-bottomed potential.

../../_images/fbposres.png

Fig. 30 Flat-bottomed position restraint potential. (A) Not inverted, (B) inverted.

The following geometries for the flat-bottomed potential are supported:
Sphere (g=1): The particle is kept in a sphere of given radius. The force acts towards the center of the sphere. The following distance calculation is used:
(210)dg(ri;Ri)=|riRi|
Cylinder (g=6,7,8): The particle is kept in a cylinder of given radius parallel to the x (g=6), y (g=7), or z-axis (g=8). For backwards compatibility, setting g=2 is mapped to g=8 in the code so that old tpr files and topologies work. The force from the flat-bottomed potential acts towards the axis of the cylinder. The component of the force parallel to the cylinder axis is zero. For a cylinder aligned along the z-axis:
(211)dg(ri;Ri)=(xiXi)2+(yiYi)2
Layer (g=3,4,5): The particle is kept in a layer defined by the thickness and the normal of the layer. The layer normal can be parallel to the x, y, or z-axis. The force acts parallel to the layer normal.
(212)dg(ri;Ri)=|xiXi|,ordg(ri;Ri)=|yiYi|,ordg(ri;Ri)=|ziZi|.

It is possible to apply multiple independent flat-bottomed position restraints of different geometry on one particle. For example, applying a cylinder and a layer in z keeps a particle within a disk. Applying three layers in x, y, and z keeps the particle within a cuboid.

In addition, it is possible to invert the restrained region with the unrestrained region, leading to a potential that acts to keep the particle outside of the volume defined by Ri, g, and rfb. That feature is switched on by defining a negative rfb in the topology. The following potential is used (Figure 30 B):

(213)Vfbinv(ri)=12kfb[dg(ri;Ri)|rfb|]2H[(dg(ri;Ri)|rfb|)].

Angle restraints

These are used to restrain the angle between two pairs of particles or between one pair of particles and the z-axis. The functional form is similar to that of a proper dihedral. For two pairs of atoms:

(214)Var(ri,rj,rk,rl)=kar(1cos(n(θθ0))),    where  θ=arccos(rjrirjrirlrkrlrk)

For one pair of atoms and the z-axis:

(215)Var(ri,rj)=kar(1cos(n(θθ0))),    where  θ=arccos(rjrirjri(001))

A multiplicity (n) of 2 is useful when you do not want to distinguish between parallel and anti-parallel vectors. The equilibrium angle θ should be between 0 and 180 degrees for multiplicity 1 and between 0 and 90 degrees for multiplicity 2.

Dihedral restraints

These are used to restrain the dihedral angle ϕ defined by four particles as in an improper dihedral (sec. Improper dihedrals) but with a slightly modified potential. Using:

(216)ϕ=(ϕϕ0) MOD 2π

where ϕ0 is the reference angle, the potential is defined as:

(217)Vdihr(ϕ) = {12kdihr(ϕΔϕ)2forϕ>Δϕ0forϕΔϕ

where Δϕ is a user defined angle and kdihr is the force constant. Note that in the input in topology files, angles are given in degrees and force constants in kJ/mol/rad2.

Distance restraints

Distance restraints add a penalty to the potential when the distance between specified pairs of atoms exceeds a threshold value. They are normally used to impose experimental restraints from, for instance, experiments in nuclear magnetic resonance (NMR), on the motion of the system. Thus, MD can be used for structure refinement using NMR data. In GROMACS there are three ways to impose restraints on pairs of atoms:

  • Simple harmonic restraints: use [ bonds ] type 6 (see sec. Exclusions).

  • Piecewise linear/harmonic restraints: [ bonds ] type 10.

  • Complex NMR distance restraints, optionally with pair, time and/or ensemble averaging.

The last two options will be detailed now.

The potential form for distance restraints is quadratic below a specified lower bound and between two specified upper bounds, and linear beyond the largest bound (see Fig. 31).

(218)Vdr(rij) = {12kdr(rijr0)2forrij<r00forr0rij<r112kdr(rijr1)2forr1rij<r212kdr(r2r1)(2rijr2r1)forr2rij
../../_images/f-dr.png

Fig. 31 Distance Restraint potential.

The forces are

(219)Fi = {kdr(rijr0)rijrijforrij<r00forr0rij<r1kdr(rijr1)rijrijforr1rij<r2kdr(r2r1)rijrijforr2rij

For restraints not derived from NMR data, this functionality will usually suffice and a section of [ bonds ] type 10 can be used to apply individual restraints between pairs of atoms, see Topology file. For applying restraints derived from NMR measurements, more complex functionality might be required, which is provided through the [ distance_restraints ] section and is described below.

Time averaging

Distance restraints based on instantaneous distances can potentially reduce the fluctuations in a molecule significantly. This problem can be overcome by restraining to a time averaged distance 91. The forces with time averaging are:

(220)Fi = {kdra(r¯ijr0)rijrijforr¯ij<r00forr0r¯ij<r1kdra(r¯ijr1)rijrijforr1r¯ij<r2kdra(r2r1)rijrijforr2r¯ij

where r¯ij is given by an exponential running average with decay time τ:

(221)r¯ij = <rij3>1/3

The force constant kdra is switched on slowly to compensate for the lack of history at the beginning of the simulation:

(222)kdra=kdr(1exp(tτ))

Because of the time averaging, we can no longer speak of a distance restraint potential.

This way an atom can satisfy two incompatible distance restraints on average by moving between two positions. An example would be an amino acid side-chain that is rotating around its χ dihedral angle, thereby coming close to various other groups. Such a mobile side chain can give rise to multiple NOEs that can not be fulfilled by a single structure.

The computation of the time averaged distance in the mdrun program is done in the following fashion:

(223)r3ij(0)=rij(0)3r3ij(t)=r3ij(tΔt) exp(Δtτ)+rij(t)3[1exp(Δtτ)]

When a pair is within the bounds, it can still feel a force because the time averaged distance can still be beyond a bound. To prevent the protons from being pulled too close together, a mixed approach can be used. In this approach, the penalty is zero when the instantaneous distance is within the bounds, otherwise the violation is the square root of the product of the instantaneous violation and the time averaged violation:

(224)Fi = {kdra(rijr0)(r¯ijr0)rijrijforrij<r0andr¯ij<r0kdramin((rijr1)(r¯ijr1),r2r1)rijrijforrij>r1andr¯ij>r10otherwise

Averaging over multiple pairs

Sometimes it is unclear from experimental data which atom pair gives rise to a single NOE, in other occasions it can be obvious that more than one pair contributes due to the symmetry of the system, e.g. a methyl group with three protons. For such a group, it is not possible to distinguish between the protons, therefore they should all be taken into account when calculating the distance between this methyl group and another proton (or group of protons). Due to the physical nature of magnetic resonance, the intensity of the NOE signal is inversely proportional to the sixth power of the inter-atomic distance. Thus, when combining atom pairs, a fixed list of N restraints may be taken together, where the apparent “distance” is given by:

(225)rN(t)=[n=1Nr¯n(t)6]1/6

where we use rij or (221) for the r¯n. The rN of the instantaneous and time-averaged distances can be combined to do a mixed restraining, as indicated above. As more pairs of protons contribute to the same NOE signal, the intensity will increase, and the summed “distance” will be shorter than any of its components due to the reciprocal summation.

There are two options for distributing the forces over the atom pairs. In the conservative option, the force is defined as the derivative of the restraint potential with respect to the coordinates. This results in a conservative potential when time averaging is not used. The force distribution over the pairs is proportional to r6. This means that a close pair feels a much larger force than a distant pair, which might lead to a molecule that is “too rigid.” The other option is an equal force distribution. In this case each pair feels 1/N of the derivative of the restraint potential with respect to rN. The advantage of this method is that more conformations might be sampled, but the non-conservative nature of the forces can lead to local heating of the protons.

It is also possible to use ensemble averaging using multiple (protein) molecules. In this case the bounds should be lowered as in:

(226)r1 = r1M1/6r2 = r2M1/6

where M is the number of molecules. The GROMACS preprocessor grompp can do this automatically when the appropriate option is given. The resulting “distance” is then used to calculate the scalar force according to:

(227)Fi = { 0rN<r1kdr(rNr1)rijrijr1rN<r2kdr(r2r1)rijrijrNr2

where i and j denote the atoms of all the pairs that contribute to the NOE signal.

Using distance restraints

A list of distance restrains based on NOE data can be added to a molecule definition in your topology file, like in the following example:

[ distance_restraints ]
; ai   aj   type   index   type'      low     up1     up2     fac
10     16      1       0       1      0.0     0.3     0.4     1.0
10     28      1       1       1      0.0     0.3     0.4     1.0
10     46      1       1       1      0.0     0.3     0.4     1.0
16     22      1       2       1      0.0     0.3     0.4     2.5
16     34      1       3       1      0.0     0.5     0.6     1.0

In this example a number of features can be found. In columns ai and aj you find the atom numbers of the particles to be restrained. The type column should always be 1. As explained in  Distance restraints, multiple distances can contribute to a single NOE signal. In the topology this can be set using the index column. In our example, the restraints 10-28 and 10-46 both have index 1, therefore they are treated simultaneously. An extra requirement for treating restraints together is that the restraints must be on successive lines, without any other intervening restraint. The type’ column will usually be 1, but can be set to 2 to obtain a distance restraint that will never be time- and ensemble-averaged; this can be useful for restraining hydrogen bonds. The columns low, up1, and up2 hold the values of r0, r1, and r2 from  (218). In some cases it can be useful to have different force constants for some restraints; this is controlled by the column fac. The force constant in the parameter file is multiplied by the value in the column fac for each restraint. Information for each restraint is stored in the energy file and can be processed and plotted with gmx nmr.

Orientation restraints

This section describes how orientations between vectors, as measured in certain NMR experiments, can be calculated and restrained in MD simulations. The presented refinement methodology and a comparison of results with and without time and ensemble averaging have been published 92.

Theory

In an NMR experiment, orientations of vectors can be measured when a molecule does not tumble completely isotropically in the solvent. Two examples of such orientation measurements are residual dipolar couplings (between two nuclei) or chemical shift anisotropies. An observable for a vector ri can be written as follows:

(228)δi=23tr(SDi)

where S is the dimensionless order tensor of the molecule. The tensor Di is given by:

(229)Di=ciriα(3xx13xy3xz3xy3yy13yz3xz3yz3zz1)
(230)with:x=ri,xri,y=ri,yri,z=ri,zri

For a dipolar coupling ri is the vector connecting the two nuclei, α=3 and the constant ci is given by:

(231)ci=μ04πγ1iγ2i4π

where γ1i and γ2i are the gyromagnetic ratios of the two nuclei.

The order tensor is symmetric and has trace zero. Using a rotation matrix T it can be transformed into the following form:

(232)TTST=s(12(1η)00012(1+η)0001)

where 1s1 and 0η1. s is called the order parameter and η the asymmetry of the order tensor S. When the molecule tumbles isotropically in the solvent, s is zero, and no orientational effects can be observed because all δi are zero.

Calculating orientations in a simulation

For reasons which are explained below, the D matrices are calculated which respect to a reference orientation of the molecule. The orientation is defined by a rotation matrix R, which is needed to least-squares fit the current coordinates of a selected set of atoms onto a reference conformation. The reference conformation is the starting conformation of the simulation. In case of ensemble averaging, which will be treated later, the structure is taken from the first subsystem. The calculated Dic matrix is given by:

(233)Dic(t)=R(t)Di(t)RT(t)

The calculated orientation for vector i is given by:

(234)δic(t)=23tr(S(t)Dic(t))

The order tensor S(t) is usually unknown. A reasonable choice for the order tensor is the tensor which minimizes the (weighted) mean square difference between the calculated and the observed orientations:

(235)MSD(t)=(i=1Nwi)1i=1Nwi(δic(t)δiexp)2

To properly combine different types of measurements, the unit of wi should be such that all terms are dimensionless. This means the unit of wi is the unit of δi to the power 2. Note that scaling all wi with a constant factor does not influence the order tensor.

Time averaging

Since the tensors Di fluctuate rapidly in time, much faster than can be observed in an experiment, they should be averaged over time in the simulation. However, in a simulation the time and the number of copies of a molecule are limited. Usually one can not obtain a converged average of the Di tensors over all orientations of the molecule. If one assumes that the average orientations of the ri vectors within the molecule converge much faster than the tumbling time of the molecule, the tensor can be averaged in an axis system that rotates with the molecule, as expressed by (233)). The time-averaged tensors are calculated using an exponentially decaying memory function:

(236)Dia(t)=u=t0tDic(u)exp(tuτ)duu=t0texp(tuτ)du

Assuming that the order tensor S fluctuates slower than the Di, the time-averaged orientation can be calculated as:

(237)δia(t)=23tr(S(t)Dia(t))

where the order tensor S(t) is calculated using expression (235) with δic(t) replaced by δia(t).

Restraining

The simulated structure can be restrained by applying a force proportional to the difference between the calculated and the experimental orientations. When no time averaging is applied, a proper potential can be defined as:

(238)V=12ki=1Nwi(δic(t)δiexp)2

where the unit of k is the unit of energy. Thus the effective force constant for restraint i is kwi. The forces are given by minus the gradient of V. The force Fi working on vector ri is:

(239)Fi(t)=dVdri=kwi(δic(t)δiexp)dδi(t)dri=kwi(δic(t)δiexp)2cir2+α(2RTSRri2+αr2tr(RTSRririT)ri)

Ensemble averaging

Ensemble averaging can be applied by simulating a system of M subsystems that each contain an identical set of orientation restraints. The systems only interact via the orientation restraint potential which is defined as:

(240)V=M12ki=1Nwiδic(t)δiexp2

The force on vector ri,m in subsystem m is given by:

(241)Fi,m(t)=dVdri,m=kwiδic(t)δiexpdδi,mc(t)dri,m

Time averaging

When using time averaging it is not possible to define a potential. We can still define a quantity that gives a rough idea of the energy stored in the restraints:

(242)V=M12kai=1Nwiδia(t)δiexp2

The force constant ka is switched on slowly to compensate for the lack of history at times close to t0. It is exactly proportional to the amount of average that has been accumulated:

(243)ka=k1τu=t0texp(tuτ)du

What really matters is the definition of the force. It is chosen to be proportional to the square root of the product of the time-averaged and the instantaneous deviation. Using only the time-averaged deviation induces large oscillations. The force is given by:

(244)Fi,m(t)={0forab0kawia|a|abdδi,mc(t)dri,mforab>0
(245)a=δia(t)δiexpb=δic(t)δiexp

Using orientation restraints

Orientation restraints can be added to a molecule definition in the topology file in the section [ orientation_restraints ]. Here we give an example section containing five N-H residual dipolar coupling restraints:

[ orientation_restraints ]
; ai   aj  type  exp.  label  alpha    const.     obs.   weight
;                                Hz      nm^3       Hz    Hz^-2
  31   32     1     1      3      3     6.083    -6.73      1.0
  43   44     1     1      4      3     6.083    -7.87      1.0
  55   56     1     1      5      3     6.083    -7.13      1.0
  65   66     1     1      6      3     6.083    -2.57      1.0
  73   74     1     1      7      3     6.083    -2.10      1.0

The unit of the observable is Hz, but one can choose any other unit. In columns ai and aj you find the atom numbers of the particles to be restrained. The type column should always be 1. The exp. column denotes the experiment number, starting at 1. For each experiment a separate order tensor S is optimized. The label should be a unique number larger than zero for each restraint. The alpha column contains the power α that is used in (229)) to calculate the orientation. The const. column contains the constant ci used in the same equation. The constant should have the unit of the observable times nmα. The column obs. contains the observable, in any unit you like. The last column contains the weights wi; the unit should be the inverse of the square of the unit of the observable.

Some parameters for orientation restraints can be specified in the grompp mdp file, for a study of the effect of different force constants and averaging times and ensemble averaging see 92. Information for each restraint is stored in the energy file and can be processed and plotted with gmx nmr.