Tutorial flexible receptor

This page was last updated on October 30th, 2019 at 11:03 pm

Docking a flexible ligand into a receptor with flexible side chains

In this tutorial we illustrate how to re-dock a known ligand into its native receptor. We will use the cyclic dependent kinase protein 2 CDK2 (pdbid:4EK3) and one of its ligands (pdbid:4EK4).

In this tutorial you will learn:

to compute affinity maps for a pocket with flexible receptor side chains
to run ADFR to re-dock the ligand into the receptor with 3 flexible side chains

Generate target file

Generate the target file containing the affinity maps.

flexible residues can be specified on the command line using the -f/–flexRes option with a selection string e.g. “A:ILE10,VAL32;B:SER48“.

Copy to Clipboard

Details: a target file for pocket of PDB id 1EK3 binding the crystallographic ligand can be computed using:

Copy to Clipboard

However, agfr will fail to generate the target file in this case because the docking box defined with the default padding of 4 Angstroms around the ligand is too small to cover the flexible side chains

To correct this we will increase the padding to 8.0. For reducing calculation time for the tutorial we also tell adfr to only compute maps for atom types present in the ligand ( -m ligand), and use the AutoSite 1.0 algorithm -as.

Copy to Clipboard

-r : specifies the receptor file

-l : specifies the ligand file

-f : indicates the receptor side chains to be made flexible. These atoms will not contribute to the calculation of the grids. To describe residues in different chains use a semi-colon between residues from different chains”;” e.g. “A:PHE80 ; B:LYS89”.

-m : specifies that maps should only be calculated for atom types present in the ligand

-o : specifies the name of the target file to create

The command produces the following output:

#################################################################
# If you used AGFR in your work, please cite:                   #
#                                                               #
# P.A. Ravindranath S. Forli, D.S. Goodsell, A.J. Olson and     #
# M.F. Sanner                                                   #
# AutoDockFR: Advances in Protein-Ligand Docking with           #
# Explicitly Specified Binding Site Flexibility                 #
# PLoS Comput Biol 11(12): e1004586                             #
# DOI:10.1371/journal.pcbi.1004586                              #
#                                                               #
# P. Ananad Ravindranath and M.F. Sanner                        #
# AutoSite: an automated approach for pseudoligands prediction  #
# - From ligand binding sites identification to predicting key  #
# ligand atoms                                                  #
# Bioinformatics (2016)                                         #
# DOI:10.1093/bioinformatics/btw367                             #
#                                                               #
# Please see http://adfr.scripps.edu for more information.      #
#################################################################

Computing grids on fiji a Darwin-17.7.0-x86_64-i386-64bit computer
Date Thu Apr 18 11:43:07 2019

loading receptor: data/4EK3_rec.pdbqt
loading ligand: data/4EK4_lig.pdbqt

set box using ligand
    Box center:    23.332    28.922    29.598
    Box length:    25.500    24.750    20.250
    Box size  :        68        66        54
    padding   :     8.000
    spacing   :     0.375

identifying pockets using AutoSite ....
    found 9 pocket(s)

pocket|  energy | # of |Rad. of | energy |   bns    | score  
    number|         |points|gyration|per vol.|buriedness|v*b^2/rg
    ------+---------+------+--------+--------+----------+---------
        1   -227.36   455    5.82     -0.50      0.91      65.11
        2    -63.93    83    2.67     -0.77      0.99      30.35
        3    -32.06    59    2.31     -0.54      0.61       9.40
        4    -30.03    47    2.62     -0.64      0.69       8.58
        5    -11.83    25    1.58     -0.47      0.70       7.68
        6     -9.12    21    1.51     -0.43      0.71       7.06
        7     -9.12    16    1.43     -0.57      0.64       4.56
        8     -8.36    12    1.25     -0.70      0.62       3.74
        9     -4.24    10    1.19     -0.42      0.58       2.87
    merging clusters ...
done. got 728 fill Points, in 1.26 (sec)

setting map types using: ligand to ['A' 'Br' 'C' 'HD' 'N' 'NA' 'OA']

computing maps for center=(23.332 28.922 29.598) size=(25.500 24.750 20.250) dims=(68 66 54) ...
    711 points inside the box

maps computed in 6.95 (sec)
the following 14 flexible receptor atoms did not contribute to the grid calculation:
  A:ILE10:CA,CB,CG1,CG2,CD1,
  A:LYS33:NZ,H21,H32,H43,CA,CB,CG,CD,CE,
Adding gradient to maps ...
processing maps ... done 9.00791692734
writing maps ... done 1.12659287453
done adding gradient to maps 10.34 (sec)
making target file 4EK3_rec_FR_10_33.trg ...done.
    done. 19.17 (sec)

Target file meta-data

Display the meta-data from a target file

Copy to Clipboard

Details:

Docking the ligand

Perform the docking

Copy to Clipboard

Details: ADFR will dock the ligand into receptor while treating the 2 side chains A:ILE10,LYS33 as flexible. adfr detects the number of cores available and by default will use them all to perform 8 independent searches (–nbRuns 8) each using up to 20’000 evaluations of the scoring function (–maxEvals 2000). By default adfr performs 50 searches, each allotted 2.5 million evaluations. Typically, more complex docking problems require more searches to be performed to increase the chances to find the best possible docked pose (i.e. global minimum of the scoring function). Here we set these parameters to lower values to perform a quick run that is sufficient to illustrate the docking principles.

Running this command generates the following 3 files.

4EK4_random_flexRec_summary.dlg : Docking log file. captures most of the messages printed to stdout and lists additional clustering information
4EK4_random_flexRec_out.pdbqt : Multi-model pose file, listing the solutions
4EK4_random_flexRec.dro : Docking Result Object file, containing the input, output and meta-data for this docking

NOTES:

The output files are named using the ligand name followed by the job name (–jobName if specified)
ADFR’s search procedure is stochastic, meaning that docking the same ligand into the same target twice can produce different results if different random number generator seeds are used. However, the energy landscape for this receptor and ligand is the same in both runs. If both docking runs find the global minimum of this energy landscape, the solutions produced by both runs will be the same, independently of the paths taken by the search to get there. On the other hand, searches that get trapped in a local minima, yield docking poses that differ from each other. Specifying the seeds used by the random number generator (–seed) allows reproducing a docking calculation, for a given version of the code.

The output of the command is listed below:

#################################################################
# If you used ADFR in your work, please cite:                   #
#                                                               #
# P.A. Ravindranath S. Forli, D.S. Goodsell, A.J. Olson and     #
# M.F. Sanner                                                   #
#                                                               #
# AutoDockFR: Advances in Protein-Ligand Docking with           #
# Explicitly Specified Binding Site Flexibility                 #
# PLoS Comput Biol 11(12): e1004586                             #
#                                                               #
# DOI:10.1371/journal.pcbi.1004586                              #
#                                                               #
# Please see http://adfr.scripps.edu for more information.      #
#################################################################

Docking on fiji a Darwin-17.7.0-x86_64-i386-64bit computer
Date Thu Apr 18 11:50:17 2019
reading ligand /Users/sanner/test1.1/data/4EK4_random.pdbqt
Detected 4 cores, using 4 cores
Unpacking maps /Users/sanner/test1.1/4EK3_rec_FR_10_33.trg
Performing search (8 GA evolutions with 20000 maxEvals each) ...
0%   10   20   30   40   50   60   70   80   90   100%
|----|----|----|----|----|----|----|----|----|----|
***************************************************
Termination status
    0/   8  0.0% runs failed
    8/   8 100.0% runs exhausted their evaluations
    0/   8  0.0% runs stopped converged 1 or 2 clusters
    0/   8  0.0% runs stopped after no improvement in clusters
    0/   8  0.0% runs stopped because GA ran out of choices
    0/   8  0.0% runs stopped because GA population converged

Refining results ...
done.

Docking performed in 8.19 seconds, i.e. 0 hours 00 minutes 08.185095 seconds

*************** first GA command ***************************
"/Users/sanner/test1.1/mgltools2_x86_64Darwin_1.2/bin/pythonsh" "/Users/sanner/test1.1/ADFR/bin/runOneGA.py" -F "/var/folders/gm/frz8gxj57hzbyyzwtt5bq8m40000gn/T/tmpORGHr_/4EK3_rec_FR_10_33" -M rigidReceptor -R "/var/folders/gm/frz8gxj57hzbyyzwtt5bq8m40000gn/T/tmpORGHr_/4EK3_rec_FR_10_33/4EK3_rec.pdbqt" -X "A:ILE10,LYS33" -T "/var/folders/gm/frz8gxj57hzbyyzwtt5bq8m40000gn/T/tmpORGHr_/4EK3_rec_FR_10_33/translationPoints.npy" "-l" "data/4EK4_random.pdbqt" "--jobName" "flexRes" "--maxEvals" "20000" "-O" -S 1 -j 1 -o "4EK4_random_flexRes/flexRes0001.dlg"

packaging docking results in to 4EK4_random_flexRes.droin  0.10 (s.)

Understanding the output

Here we describe line by line the messages output during the docking procedure.

Hostname and platform architecture on which the program is running
Date and time of execution
ligand docked
number of detected and used cores.
target files used

NOTES:

Number of cores. By default ADFR will use all cores available to parallelize the search threads comprised in a run. Use the “-c” command line option to limit the number of used cores.

By default, ADFR performs 50 independent searches, i.e. 50 evolutions of a population of 100 individuals using a Genetic Algorithm (GA). In this example we intentionally reduced this number to 8 very short runs.

Lines 1-4 display a progress bar indicating the percentage of these runs that completed.

The lines below provide statistics over the termination status of these searches. ADFR implements several termination criteria in its search method. In this example all search terminated because they reached their maximum number of evaluations. The default number of evaluations is 2.5 millions and is usually never reached because of other termination criteria such as convergence of the population, meaning that there is no more diversity in the population and the chances to discover new solutions has become small, or the population still has diversity (i.e. it contains multiples competitive solutions) but none of these solution has improves over a user-defined number of generations (default 5).

Typically, you want searches to end because the population converged or there was no improvement. A result like the one shown here is a clear indication that this docking problem needs more evaluations per search (i.e. increased –maxEvals).

The next section lists the results:

In this docking run, the 8 searches lead to 4 distinct solutions, listed in the result table above. The solutions are sorted by descending predicted affinity. The top ranking solution was identified by 4 of the 8 searches (clust. size column) and the pose with the best affinity was found by search number 3 (best run column). The second best solution was found 1 time and has an RMSD of 2.8 Angstroms with the top ranking solution (clust. rmsd column). If a reference ligand pose had been specified (-r/reference option), The ref. rmsd column would list the RMSD between the docked pose and the reference structure instead of listing -1.

NOTE: RMSD values are calculated using all isomorphisms between the 2 molecules, thus matching symmetry related atoms and providing a more accurate measure that used in AutoDock4, Vina, and previous versions of AutoDockFR.

Docking a flexible ligand into a receptor with flexible side chains

Generate target file

Target file meta-data

Docking the ligand

Understanding the output

Generate target file

Generate the target file containing the affinity maps.

The command produces the following output:

Target file meta-data

Display the meta-data from a target file

Docking the ligand

Perform the docking

Understanding the output