tutorial covalent

Docking a covalently bound ligand

In this tutorial we illustrate how to re-dock the native covalent ligand of PDB id 3c9w. Both, the receptor (3cw9.pdbqt) and ligand (3c9w_ligandWithSideChain_random.pdbqt) are available in the data file associate with this tutorial.

For covalent docking, the receptor and covalent ligand need to share 3 atoms as sown below. 2 atoms (green) for the covalent bond and 1 atom (yellow) is an anchor atom on the receptor side. These atoms are used to transform the ligand into place with respect to the receptor to create the covalent bond.

NOTE: for covalent docking no translational points are needed as the ligand is positioned in the docking box using the covalent bond.

In this tutorial you will learn:

to generate a target file for covalent docking
to run ADFR to dock the ligand
to understand the output of an ADFR docking run

Generate target file

Generate the target file containing the affinity maps.

Copy to Clipboard

Details:

In this example we specify the box placement and size manually to illustrate this capability. Note that this is the only -b/-boxMode option for which padding is ignored.
The PDB serial numbers of the 2 receptor atoms forming the covalent bond are specified using the -c/–covalentBond option. The numbers are the serial codes appearing in the pdbqt file.
The third atom defining the covalent attachment is used to compute the torsion angle of the covalent bond. It is specified using the -t/–covalentBondTorsionAtom option.
The -x/–covalentResidues option allows to limit the traversal of the receptor to a list or residues. This is needed sometimes as covalent residues can create bonds with the receptor other than the covalent attachment and this is the case with the native ligand of 3cw9. When agfr identifies the sub-tree beyond the covalent bond to find which atoms to cut out of the receptor for calculating affinity maps, it would include a large part of the receptor because of the spurious bond the ligand makes with the receptor and cut out in excess of 1600 atoms. The -x option prevents this from happening.
the target file is called 3c9w_cov_cmdline.trg (-o option)

The output of this commands is shown below:

#################################################################
# If you used AGFR in your work, please cite:                   #
#                                                               #
# P.A. Ravindranath S. Forli, D.S. Goodsell, A.J. Olson and     #
# M.F. Sanner                                                   #
# AutoDockFR: Advances in Protein-Ligand Docking with           #
# Explicitly Specified Binding Site Flexibility                 #
# PLoS Comput Biol 11(12): e1004586                             #
# DOI:10.1371/journal.pcbi.1004586                              #
#                                                               #
# P. Ananad Ravindranath and M.F. Sanner                        #
# AutoSite: an automated approach for pseudoligands prediction  #
# - From ligand binding sites identification to predicting key  #
# ligand atoms                                                  #
# Bioinformatics (2016)                                         #
# DOI:10.1093/bioinformatics/btw367                             #
#                                                               #
# Please see http://adfr.scripps.edu for more information.      #
#################################################################

Computing grids on fiji a Darwin-17.7.0-x86_64-i386-64bit computer
Date Thu Apr 18 12:31:04 2019

loading receptor: data/3c9w.pdbqt
set box using user
    Box center:    28.565     6.329     6.985
    Box length:    22.500    22.500    22.500
    Box size  :        60        60        60
    padding   :     0.000
    spacing   :     0.375

setting map types using: all to ['HS', 'Mg', 'HD', 'NA', 'Fe', 'Br', 'NS', 'A', 'C', 'Mn', 'G', 'F', 'I', 'H', 'J', 'N', 'Q', 'P', 'S', 'GA', 'Z', 'Zn', 'Cl', 'Ca', 'OA', 'SA', 'OS']

computing maps for center=(28.565 6.329 6.985) size=(22.500 22.500 22.500) dims=(60 60 60) ...
    maps computed in 9.86 (sec)
the following 33 covalent ligand atoms did not contribute to the grid calculation:
  A:CYS164:CA,CB,SG,O23,C7,O8,C9,C25,C10,C11,C12,C2,C1,O20,H20,C6,C5,O21,C22,C4,C3,C18,O31,C17,C16,C15,O29,H29,C14,O28,H28,C13,O24,
Adding gradient to maps ...
processing maps ... done 26.2598040104
writing maps ... done 4.58263802528
done adding gradient to maps 33.02 (sec)
making target file 3c9w_cov_cmdline.trg ...done.
    done. 45.70 (sec)

Inspect target file

Display information about a target file

Copy to Clipboard

Details: the target file meta data is read and displayed.

The command produces the following output:

docking target file
  date       : Thu Apr 18 13:41:22 2019
  node       : fiji
  AGFR       : v1.2

receptor   : 3c9w.pdbqt
    FlexRec  : None
    covBond  : [3c9w:_A:CYS164:N (1591)] 3c9w:_A:CYS164:CA (1593) -- 3c9w:_A:CYS164:CB (1596)
    coords   : [23.521, 5.081, 12.263] [24.700, 4.867, 11.312] [23.696, 4.237, 13.432]
    covRes   : A:CYS164
    covFile  : covalenLig.pdbqt
    ignAtms  : [1592, 1595, 1596, 3349, 3350, 3351, 3352, 3353, 3354, 3355, 3356, 3357, 3358, 3359, 3360, 3361, 3362, 3363, 3364, 3365, 3366, 3367, 3368, 3369, 3370, 3371, 3372, 3373, 3374, 3375, 3376, 3377, 3378]

box        :  mode ['user', '28.565', '6.329', '6.985', '22.5', '22.5', '22.5'], padding 4.00
    center   : 28.565 6.329 6.985
    length   : 22.500 22.500 22.500
    size     : 0060 0060 0060
    spacing  : 0.375

maps       : 
    types    :  C  GA  Fe   A   e   d   S   Q   F  SA   G   P  Mg  Cl  Zn  OA  NS  Ca  Br  Mn  OS  
               NA   J   I  HS  HD   H   Z   N   W 
    W map    : weight 0.60 entropy -0.20
    gradients: Yes,  kept largest negative cluster

pocketMode : ['user', '28.565', '6.329', '6.985', '22.5', '22.5', '22.5']
    #fillpts : 0 points

Docking ligand

Dock the randomized ligand using the generated target file

Copy to Clipboard

Details: Here we re-dock the native ligand, that has been randomized (i.e. its conformation as well as it positions and orientation in the crystal structure have been randomly modified). adfr detects the number of cores available and by default will use them all to perform 8 independent searches (–nbRuns 8) each using up to 100’000 evaluations of the scoring function (–maxEvals 100’00). By default adfr performs 50 searches, each allotted 2.5 million evaluations. Typically, more complex docking problems require more searches to be performed to increase the chances to find the best possible docked pose (i.e. global minimum of the scoring function). Here we set these parameters to lower values to perform a quick run that is sufficient to illustrate the docking principles.

This calculation generates the following files:

3c9w_ligandWithSideChain_random_covalent_summary.dlg   # the docking log file that captures most of what is displayed on the terminal

3c9w_ligandWithSideChain_random_covalent_out.pdbqt     # the docking pose file containing the docking solutions

3c9w_ligandWithSideChain_random_covalent.dro           # the docking object file that contains input, output, and meta-data for this docking run

NOTES:

The output files are named using the ligand name followed by the job name (–jobName if specified)
ADFR’s search procedure is stochastic, meaning that docking the same ligand into the same target twice can produce different results if different random number generator seeds are used. However, the energy landscape for this receptor and ligand is the same in both runs. If both docking runs find the global minimum of this energy landscape, the solutions produced by both runs will be the same, independently of the paths taken by the search to get there. On the other hand, searches that get trapped in a local minima, yield docking poses that differ from each other. Specifying the seeds used by the random number generator (–seed) allows reproducing a docking calculation, for a given version of the code.

The output of the command is listed below:

#################################################################
# If you used ADFR in your work, please cite:                   #
#                                                               #
# P.A. Ravindranath S. Forli, D.S. Goodsell, A.J. Olson and     #
# M.F. Sanner                                                   #
#                                                               #
# AutoDockFR: Advances in Protein-Ligand Docking with           #
# Explicitly Specified Binding Site Flexibility                 #
# PLoS Comput Biol 11(12): e1004586                             #
#                                                               #
# DOI:10.1371/journal.pcbi.1004586                              #
#                                                               #
# Please see http://adfr.scripps.edu for more information.      #
#################################################################

Docking on fiji a Darwin-17.7.0-x86_64-i386-64bit computer
Date Thu Apr 18 13:46:27 2019
reading ligand /Users/sanner/test1.1/data/3c9w_ligandWithSideChain_random.pdbqt
Detected 4 cores, using 4 cores
Unpacking maps /Users/sanner/test1.1/3c9w_cov_cmdline.trg
Performing search (8 GA evolutions with 100000 maxEvals each) ...
0%   10   20   30   40   50   60   70   80   90   100%
|----|----|----|----|----|----|----|----|----|----|
***************************************************
Termination status
    0/   8  0.0% runs failed
    8/   8 100.0% runs exhausted their evaluations
    0/   8  0.0% runs stopped converged 1 or 2 clusters
    0/   8  0.0% runs stopped after no improvement in clusters
    0/   8  0.0% runs stopped because GA ran out of choices
    0/   8  0.0% runs stopped because GA population converged

Refining results ...
done.

Docking performed in 13.17 seconds, i.e. 0 hours 00 minutes 13.165729 seconds

*************** first GA command ***************************
"/Users/sanner/test1.1/mgltools2_x86_64Darwin_1.2/bin/pythonsh" "/Users/sanner/test1.1/ADFR/bin/runOneGA.py" -F "/var/folders/gm/frz8gxj57hzbyyzwtt5bq8m40000gn/T/tmpaAcjqR/3c9w_cov_cmdline" -M rigidReceptor -R "/var/folders/gm/frz8gxj57hzbyyzwtt5bq8m40000gn/T/tmpaAcjqR/3c9w_cov_cmdline/3c9w.pdbqt" "-l" "data/3c9w_ligandWithSideChain_random.pdbqt" "--jobName" "covalent" "-C" "1" "2" "3" "--maxEvals" "100000" "-O" -V "1591" "1593" "1596" -S 1 -j 1 -o "3c9w_ligandWithSideChain_random_covalent/covalent0001.dlg"

Understanding the output

Here we describe line by line the messages output during the docking procedure.

Hostname and platform architecture on which the program is running
Date and time of execution
ligand docked
number of detected and used cores.
target files used

NOTES:

Number of cores. By default ADFR will use all cores available to parallelize the search threads comprised in a run. Use the “-c” command line option to limit the number of used cores.

By default, ADFR performs 50 independent searches, i.e. 50 evolutions of a population of 100 individuals using a Genetic Algorithm (GA). In this example we intentionally reduced this number to 8 very short runs.

Lines 1-4 display a progress bar indicating the percentage of these runs that completed.

The lines below provide statistics over the termination status of these searches. ADFR implements several termination criteria in its search method. In this example all search terminated because they reached their maximum number of evaluations. The default number of evaluations is 2.5 millions and is usually never reached because of other termination criteria such as convergence of the population, meaning that there is no more diversity in the population and the chances to discover new solutions has become small, or the population still has diversity (i.e. it contains multiples competitive solutions) but none of these solution has improves over a user-defined number of generations (default 5).

Typically, you want searches to end because the population converged or there was no improvement. A result like the one shown here is a clear indication that this docking problem needs more evaluations per search (i.e. increased –maxEvals).

The next section lists the results:

In this docking run, the 8 searches lead to the same solution as indicated by the cluster size 8 on the first and only solution (clust. size column).

Docking meta-data

Display information about a docking result

Copy to Clipboard

Details: the meta data about this docking run is displayed

docking result file
  date       : Thu Apr 18 13:46:27 2019
  node       : fiji
  platform   : Darwin-17.7.0-x86_64-i386-64bit
  ncores     : 4

receptor   : 3c9w_cov_cmdline.trg
   docking target file
     date       : Thu Apr 18 13:41:22 2019
     node       : fiji
     AGFR       : v1.2

receptor   : 3c9w.pdbqt
       FlexRec  : None
       covBond  : [3c9w:_A:CYS164:N (1591)] 3c9w:_A:CYS164:CA (1593) -- 3c9w:_A:CYS164:CB (1596)
       coords   : [23.521, 5.081, 12.263] [24.700, 4.867, 11.312] [23.696, 4.237, 13.432]
       covRes   : A:CYS164
       covFile  : covalenLig.pdbqt
       ignAtms  : [1592, 1595, 1596, 3349, 3350, 3351, 3352, 3353, 3354, 3355, 3356, 3357, 3358, 3359, 3360, 3361, 3362, 3363, 3364, 3365, 3366, 3367, 3368, 3369, 3370, 3371, 3372, 3373, 3374, 3375, 3376, 3377, 3378]

box        :  mode ['user', '28.565', '6.329', '6.985', '22.5', '22.5', '22.5'], padding 4.00
       center   : 28.565 6.329 6.985
       length   : 22.500 22.500 22.500
       size     : 0060 0060 0060
       spacing  : 0.375

maps       : 
       types    :  C  GA  Fe   A   e   d   S   Q   F  SA   G   P  Mg  Cl  Zn  OA  NS  Ca  Br  Mn  OS  
                  NA   J   I  HS  HD   H   Z   N   W 
       W map    : weight 0.60 entropy -0.20
       gradients: Yes,  kept largest negative cluster

pocketMode : ['user', '28.565', '6.329', '6.985', '22.5', '22.5', '22.5']
       #fillpts : 0 points

ligand     : 3c9w_ligandWithSideChain_random.pdbqt
     nbAtoms : 34
     types   : A:6 C:16 OA:8 HD:3 SA:1 
  lig ref    : None
  summary    : 3c9w_ligandWithSideChain_random_covalent_summary.dlg
  GA params  : 
    maxEvals : 100000
    nbRuns   : 50
    jobName  : covalent
    seed     : 1
    covLig   : [1, 2, 3]

Docking a covalently bound ligand

Generate target file

Inspect target file

Docking ligand

Understanding the output

Docking meta-data

Generate target file

Generate the target file containing the affinity maps.

Inspect target file

Display information about a target file

Docking ligand

Dock the randomized ligand using the generated target file

Understanding the output

Docking meta-data

Display information about a docking result