Introduction to Auto-Docking The ability to predict the conformation of a ligand in a protein or nucleic acid binding site is of central importance in pharmaceutical design. Detailed knowledge of the molecular recognition forces is necessary for designing new ligands. Knowing the binding site conformation helps to show the important interactions that stabilize the complex. Automated procedures are very helpful in exploring a wide variety of binding modes in complex systems. The advent of rapid parallel binding assays through combinatorial chemistry and high throughput screening also motivates a similar need for efficient, rapid computational screening procedures. The ability to computationally predict possible tight-binding ligands lowers the time and energy necessary to plan a major combinatorial study. One approach to auto-docking is to use molecular mechanics techniques to minimize the energy of a ligand in a binding pocket starting from a series of random positions and orientations within the binding site. Finding the low energy binding conformations will then help to highlight the interactions that are responsible for tight binding. For example, the hydrogen bonds in the low energy complexes can be displayed. Electrostatic surfaces of the binding pocket and the ligand can also be displayed to observe how the distribution of charge in the ligand and binding pocket complement each other. Several methods have been developed to efficiently sample possible binding conformations. The steps in docking are outlined below. The complete step-by-step instructions follow: Docking Summary: 1. Select the ligand. 2. Build the Docking box to delineate the region in the binding pocket to place the ligand. 3. The program then does the calculations in two steps: Step 1: Generate random starting orientations. Step 2: Use random small moves to find a low energy conformation. 4. Extract the lowest energy conformer (or a few low energy conformers). 5. Minimize the resulting models using normal molecular mechanics. Placement Methodology The Placement phase of Dock generates poses from ligand conformations. The available methods are: Method Description Alpha Triangle Poses are generated by superposition of ligand atom triplets and triplets of receptor site points. The receptor site points are alpha sphere centers which represent locations of tight packing. At each iteration a random conformation is selected, a random triplet of ligand atoms and a random triplet of alpha sphere centers are used to determine the pose. Alpha PMI Poses are generated by aligning ligand conformations' principal moments of inertia to a randomly generated subset of alpha spheres in the receptor site. This method is most suited to tight pockets, is quite fast and the search space is greatly truncated. Flex* Flex* is a third party docking program, which is implemented in MOE as if it were a type of placement. A separate license from BioSolveIT is required to run Flex*. Configuration and operation of Flex* is described later. Flex* uses an incremental fragment growth strategy to achieve its function. Pharmacophore Information from the pharmacophore is directly used to place the ligand in the binding site. This is the preferred method when a pharmacophore is present, but will not work if there is no pharmacophore. Proxy Triangle This method was developed for tackling medium to large sized ligands which have a very large number of conformers. For small ligands, the Triangle Matcher method will be called. For medium and large ligands, conformers are pre-superposed prior to being placed into the binding site, thus saving computational time. For large ligands, the score for internal ranking will use atom representatives instead of all the ligand atoms. More time is allowed for larger ligands. Poses are generated by aligning ligand triplets of atoms on Triangle Matcher triplets of alpha spheres in a more systematic way than in the (default) Alpha Triangle method. None This placement method is a special pass-through technique: the initial coordinates are emitted unmodified. Choose this method when you wish to use Dock's refinement and scoring tools to postprocess already docked poses, either from an earlier Dock run or from other sources. Note! While the placement stage produces identical results under identical set up conditions, it is normal for the output to vary from run to run if there is a refinement stage. This is because of the irreproducible nature of the energy minimization process. Scoring For all scoring functions, lower scores indicate more favourable poses. The unit for all scoring functions is kcal / mol. ASE Scoring. This score is proportional to the sum of the Gaussians R1R2exp(-0.5d2) over all ligand atom - receptor atom pairs and ligand atom - alpha sphere pairs. R1 and R2 are the radii of the atoms in Å, or is -1.85 for alpha spheres. d is the distance between the pair in Å. The proportionality constant is default to be 0.035 kcal / mol but can be changed upon pressing the Configure... button. Affinity dG Scoring. This function estimates the enthalpic contribution to the free energy of binding using a linear function: G = Chb fhb + Cion fion + Cmlig fmlig + Chh fhh + Chp fhp + Caa faa where the f terms fractionally count atomic contacts of specific types and the C's are coefficients that weight the term contributions to the affinity estimate. The individual terms are: Term hb Description Interactions between hydrogen bond donor-acceptor pairs. An optimistic view is taken; for example, two hydroxyl groups are assumed to interact in the most favorable way. ion Ionic interactions. A Coulomb-like term is used to evaluate the interactions between charged groups. This can contribute to or detract from binding affinity. mlig Metal ligation. Interactions between Nitrogens/Sulfurs and transition metals are assumed to be metal ligation interactions. hh Hydrophobic interactions, for example, between alkane carbons; these interactions are generally favorable. hp Interactions between hydrophobic and polar atoms; these interactions are generally unfavorable. aa An interaction between any two atoms. This interaction is weak and generally favorable. The coefficients may be adjusted by pressing the Configure... button on the Dock panel. Alpha HB Scoring. This score is a linear combination of two terms. The first term measures the geometric fit of the ligand to the binding site. The second term measures hydrogen bonding effects. Both terms are summed over all ligand atoms. The weights of the two terms may be adjusted through a panel upon pressing the Configure... button. The first term has an attractive part and a repulsive part. The attractive part is summed over atoms in the ligand. Each ligand atom that is within 3 Å of an alpha sphere center contributes A exp(-0.5d2) to this term, where d is the distance from the ligand atom to the nearest alpha sphere center, and A assumes the value of -0.6845. The repulsive part is summed over all pairs of atomic overlap between the ligand and the receptor. For each pair of overlap, the contribution is between 0 and 1 depending on the severity of the overlap. The second term measures hydrogen bond effects. For non-sp3 donors and acceptors, hydrogen bonding sites are projected from the atom. If the site is occupied by a favorable atom, there is a score of -2 (negative means favorable). Otherwise if it is occupied by some other ligand atom, there is a score of +1. For sp3 donors and acceptors, all favorable atoms within 3.5 Å contribute a score of -1 while all other atoms contribute +1. Metals in the receptor are treated as acceptors but with a three-fold effect. London dG Scoring (default). The London dG scoring function estimates the free energy of binding of the ligand from a given pose. The functional form is a sum of terms: where c represents the average gain/loss of rotational and translational entropy; Eflex is the energy due to the loss of flexibility of the ligand (calculated from ligand topology only); fHB measures geometric imperfections of hydrogen bonds and takes a value in [0,1]; cHB is the energy of an ideal hydrogen bond; fM measures geometric imperfections of metal ligations and takes a value in [0,1]; cM is the energy of an ideal metal ligation; and Di is the desolvation energy of atom i. The difference in desolvation energies is calculated according to the formula where A and B are the protein and/or ligand volumes with atom i belonging to volume B; Ri is the solvation radius of atom i (taken as the OPLS-AA van der Waals sigma parameter plus 0.5 Angstrom); and ci is the desolvation coefficient of atom i. The coefficients {c, cHB, cM, ci} were fitted from ~400 x-ray crystal structures of protein-ligand complexes with available experimental pKi data. Atoms are categorized into ~12 atom types for the assignment of the ci coefficients. The triple integrals are approximated using Generalized Born integral formulas. Refinement The Forcefield refinement scheme is more accurate than the GridMin scheme but it also takes more time. Forcefield Refinement. Energy minimization of the system is carried out using the conventional molecular mechanics setup. The current prevailing molecular mechanics forcefield will be used. However, some of the set up options, like the cutoff settings, will be overridden during the refinement process. The charges of all atoms will be assigned using the current forcefield. During the course of the refinement, solvation effects are treated by using the reaction field functional form for the electrostatic energy term. By default, the final energy is evaluated using the Generalized Born solvation model (GB/VI) but this can be changed (see below). In order to speed up the calculation, residues (or atoms, depending on the setting) over a cutoff distance (Pocket Cutoff on the Configure panel, upon pressing the Configure... button) away from all the pre-refined poses are ignored, both during the refinement and in the final energy evaluation. However, if after the refinement any poses moved to within 80% of the cutoff distance of any ignored atoms, the refinement stage will be repeated with an increased cutoff distance. In the presence of a pharmacophore defined using the Unified Annotation Scheme, restraints will be set up between the features on the ligand and the corresponding pharmacophore query points. If the distance between the feature and the query point exceeds a certain threshold (see below) by an amount x, there will be a penalty in the potential of the form kx2. The threshold is the radius associated with the pharmacophore query point minus the Radius Offset parameter on the Configure panel, or zero if this results in a negative value. The force constant k is also adjustable via the Configure panel. The following are the default settings: Receptor residues over 6 Å (the Pocket Cutoff parameter) away from all of the prerelaxed poses are ignored. All receptor atoms will be held fixed during the energy minimization. The final energy will be calculated using the Generalized Born solvation model (GB/VI , see MOE Potential Energy Model). For the pharmacophore restraints, the force constant k is set to 10.0 kcal / (mol Å2) and the Radius Offset is set to 0.4 &Aring. These defaults may be altered through a panel upon pressing the Configure... button. If GB/VI scoring is disabled, the final energy will simply be calculated in the same way as during the course of minimization instead of using the Generalized Born solvation model. If a flexible receptor is desired, the sidechains of the receptor can be tethered. However, backbone atoms are always held fixed. GridMin Refinement. In order to speed up the energy minimization, a grid is used for electrostatic calculations during the minimization process. The distance dependent dielectric model is used. In contrast, van der Waals interactions are treated explicitly. However, the final electrostatic energy is calculated using the explicit Coulomb form instead of the grid. If an energy-minimized pose lies within 2 Å of any receptor atom or lies outside the site, a huge energy will be assigned to the pose. In the presence of a pharmacophore defined using the Unified Annotation Scheme, restraints will be set up between the features on the ligand and the corresponding pharmacophore query points. If the distance between the feature and the query point exceeds a certain threshold (see below) by an amount x, there will be a penalty in the potential of the form kx2. The threshold is the radius associated with the pharmacophore query point minus the Radius Offset parameter on the Configure panel, or zero if this results in a negative value. The force constant k is also adjustable via the Configure panel. The following are the default settings: Receptor atoms within 4 Å of any pre-relaxed pose will be included for the van der Waals energy calculations. Receptor residues within 5.5 Å of any pre-relaxed pose will be included for the electrostatic calculations. The van der Waals interaction formula will assume the standard 6-12 form, as opposed to a softer 4-8 form. For the pharmacophore restraints, the force constant k is set to 10.0 kcal / (mol Å2) and the Radius Offset is set to 0.4 &Aring. These defaults may be altered through a panel upon pressing the Configure... button. Duplication Removal In the Dock application, duplication removal is not performed using the conventional rmsd cutoff approach. Poses are considered duplicated if they have the same hydrogen bonding pattern and hydrophobic interaction pattern with the receptor. A pair of ligand-receptor atoms of the appropriate types are considered to have a hydrogen bond interaction if they are less than 3.5 Å apart. A hydrophobic atom on the ligand is considered to have a hydrophobic interaction with a receptor residue if that atom is under 4.5 Å away from a hydrophobic atom of that residue. Poses are considered duplicated if the same set of ligand-receptor atom pairs are involved in hydrogen bond interactions and the same set of ligand atom receptor residue pairs are involved in hydrophobic interactions.