Conversation
VGPReys
left a comment
There was a problem hiding this comment.
I am only half way.
Will continue later.
Thank you for the detailed review! UPD: all commenets incorporated, ready for the second part:) |
|
Covalently bound to one, and interact with another for the transfer to happen
… On 16 Mar 2026, at 23:26, Victor Reys ***@***.***> wrote:
Interact ?
or covalently bound ?
|
|
We don’t distinguish anymore between expert and guru. So elevated is in principle the correct wording.
|
|
@amjjbonvin litttle question. I found two saved server runs for the E2A-HPR system described in this tutorial. One result page linked inside the tutorial (https://wenmr.science.uu.nl/haddock2.4/result/4242424242/195967-E2A-HPR) and another one linked in “Example” tab of HADDOCK server (https://wenmr.science.uu.nl/haddock2.4/result/4242424242/0-E2A-HPR) Wonder if we need to keep both or if it’s fine to keep only one of these result pages permanently available? |
|
If they are the same, then the second one is a simpler link
|
|
|
||
| <a class="prompt prompt-question">Is there a phosphate group present in this structure?</a> | ||
|
|
||
| *Hint* : you can select phosphate atoms with the following command and check how many atoms are in this selection: |
There was a problem hiding this comment.
Not found of the check
| *Hint* : you can select phosphate atoms with the following command and check how many atoms are in this selection: | |
| *Hint* : you can select phosphate atoms with the following command and read from the log how many atoms are in this selection: |
| Another way to save the structure as a PDB file is via the command: | ||
| <a class="prompt prompt-pymol">save e2a_1F3G.pdb, 1F3G</a> | ||
|
|
||
| The file will be written to the current working directory: if PyMOL was launched from a terminal, it will be saved in the directory from which PyMOL was started; if PyMOL was opened manually (e.g., via the graphical interface), it is typically saved in your home directory. |
There was a problem hiding this comment.
What about a list ?
| The file will be written to the current working directory: if PyMOL was launched from a terminal, it will be saved in the directory from which PyMOL was started; if PyMOL was opened manually (e.g., via the graphical interface), it is typically saved in your home directory. | |
| The file will be written to the current working directory: | |
| - if PyMOL was launched from a terminal, it will be saved in the directory from which PyMOL was started | |
| - if PyMOL was opened manually (e.g., via the graphical interface), it is typically saved in your home directory. |
| Otherwise, the number of possible combinations between the input molecules can quickly escalate (i.e. become very large). | ||
| For example, if both partners contain 10 conformers, this results in 100 possible starting combinations. If 1000 rigid-body models are generated (see [HADDOCK general concepts](#haddock-general-concepts) above), each combination would then be sampled only 10 times! | ||
|
|
||
| In case if limiting number of input conformers is an unreasonable choice, it is possible to increase the number of models generated in the rigid-body docking stage (it0). |
There was a problem hiding this comment.
| In case if limiting number of input conformers is an unreasonable choice, it is possible to increase the number of models generated in the rigid-body docking stage (it0). | |
| If limiting the number of input conformers is an unreasonable choice, it is possible to increase the number of models generated in the rigid-body docking stage (it0). |
| In case if limiting number of input conformers is an unreasonable choice, it is possible to increase the number of models generated in the rigid-body docking stage (it0). | ||
| However, this requires elevated permissions level on the HADDOCK 2.4 server, which you can request via "[User Dashboard](https://wenmr.science.uu.nl/dashboard){:target="_blank"}". | ||
|
|
||
| Now let's display all models of this NMR ensemble simultaneously in ribbon representation. |
There was a problem hiding this comment.
Before the word cartoon was used, I guess it is better to keep it. Also in line with PyMOL naming
| Now let's display all models of this NMR ensemble simultaneously in ribbon representation. | |
| Now let's display all models of this NMR ensemble simultaneously as cartoon representation. |
There was a problem hiding this comment.
I think we are rather showing the models as ribbon and not cartoon - easier to see the differences between the conformations
There was a problem hiding this comment.
The command below explicitly uses the ribbon representation which is basically a CA trace of the backbone.
| You should now be seing the 30 conformers present in this NMR structure. To illustrate the potential benefit of using an ensemble of conformations as starting point for docking let's look at the side-chains of the active residues: | ||
|
|
||
| You should now be seeing the 30 conformers present in this NMR structure. | ||
| It may appear that conformations are fairly conserved across all 30 models, but let us look at the side chains of the active residues: |
There was a problem hiding this comment.
| It may appear that conformations are fairly conserved across all 30 models, but let us look at the side chains of the active residues: | |
| At first, it may appear that conformations are fairly conserved across all 30 models, but let us look at the side chains of the active residues: |
| As final step, save the molecule as a new PDB file which we will call: *hpr-ensemble.pdb* | ||
| For this in the PyMOL menu select: | ||
| You should now be able to observe the range of conformational space sampled by these surface side chains. | ||
| Some residues clearly adopt a wide variety of conformations, and one of these might lead to much better docking results. |
There was a problem hiding this comment.
| Some residues clearly adopt a wide variety of conformations, and one of these might lead to much better docking results. | |
| Some residues clearly adopt a wide variety of conformations, and one of these might resemble the bound conformation, hence leading to much better docking results. |
| <a class="prompt prompt-info">Edit the PDB file (*e2a_1F3G.pdb*) in your favorite editor</a> | ||
| <a class="prompt prompt-info">Change the name of histidine 90 to NEP </a> | ||
| <a class="prompt prompt-info">Save the file (as simple text file) under a new name, e.g. *e2aP_1F3G.pdb*</a> | ||
| To use a modified amino acid in HADDOCK, it is sufficient to edit the PDB file and change the residue name of the corresponding residue. There is no need to add or delete atoms - HADDOCK will take care of this automatically. |
There was a problem hiding this comment.
| To use a modified amino acid in HADDOCK, it is sufficient to edit the PDB file and change the residue name of the corresponding residue. There is no need to add or delete atoms - HADDOCK will take care of this automatically. | |
| To use a modified amino acid in HADDOCK, it is sufficient to edit the PDB file and change the residue name of the corresponding residue. There is no need to add or delete atoms - HADDOCK will take care of building the missing atoms automatically while generating the topology of the system. |
|
|
||
| **Note:** The same procedure can be used to introduce a mutation in an input protein structure. | ||
| To introduce this modification: | ||
| <a class="prompt prompt-info">Open the PDB file *e2a_1F3G.pdb* in your favorite text editor</a> |
There was a problem hiding this comment.
Do we want to specify editors not to use ?
Or typical editors to use
| **Note:** The same procedure can be used to introduce a mutation in an input protein structure. | ||
| To introduce this modification: | ||
| <a class="prompt prompt-info">Open the PDB file *e2a_1F3G.pdb* in your favorite text editor</a> | ||
| <a class="prompt prompt-info">Find histidine with residue sequence number equal to 90</a> |
There was a problem hiding this comment.
| <a class="prompt prompt-info">Find histidine with residue sequence number equal to 90</a> | |
| <a class="prompt prompt-info">Find the histidine with residue number 90</a> |
| To introduce this modification: | ||
| <a class="prompt prompt-info">Open the PDB file *e2a_1F3G.pdb* in your favorite text editor</a> | ||
| <a class="prompt prompt-info">Find histidine with residue sequence number equal to 90</a> | ||
| Remember that residue sequence number is the second integer value in the line starting with "ATOM". Check [this link](https://www.cgl.ucsf.edu/chimera/docs/UsersGuide/tutorials/pdbintro.html){:target="_blank"} for more info. |
There was a problem hiding this comment.
proposing a simpler link
| Remember that residue sequence number is the second integer value in the line starting with "ATOM". Check [this link](https://www.cgl.ucsf.edu/chimera/docs/UsersGuide/tutorials/pdbintro.html){:target="_blank"} for more info. | |
| Remember that residue sequence number is the second integer value in the coordinate lines (starting with "ATOM" or "HETATM"). Check [this link](https://cupnet.net/pdb-format/){:target="_blank"} for more info. |
| hide lines<br> | ||
| </a> | ||
|
|
||
| Since this is an NMR structure it does not contain any water molecules and we don't need to remove them. |
There was a problem hiding this comment.
Why remove this? It is relevant since we do remove the water for the first molecule
| In case if limiting number of input conformers is an unreasonable choice, it is possible to increase the number of models generated in the rigid-body docking stage (it0). | ||
| However, this requires elevated permissions level on the HADDOCK 2.4 server, which you can request via "[User Dashboard](https://wenmr.science.uu.nl/dashboard){:target="_blank"}". | ||
|
|
||
| Now let's display all models of this NMR ensemble simultaneously in ribbon representation. |
There was a problem hiding this comment.
The command below explicitly uses the ribbon representation which is basically a CA trace of the backbone.
|
Be aware that there is a copy of this tutorial we use for teaching purposes in the 2nd year bachelor course. Might need to be updated too, but keeping the questions. Check https://www.bonvinlab.org/education/NMRMolmod/ |
| #### HADDOCK submission: Input data | ||
|
|
||
| For this we will make us of the [HADDOCK 2.4 interface](https://wenmr.science.uu.nl/haddock2.4/submit/1){:target="_blank"} of the HADDOCK web server. | ||
| Locate "[Submit a new job](https://wenmr.science.uu.nl/haddock2.4/submit/1){:target="_blank"}" button. Note that you are now in "Input data" tab. |
There was a problem hiding this comment.
| Locate "[Submit a new job](https://wenmr.science.uu.nl/haddock2.4/submit/1){:target="_blank"}" button. Note that you are now in "Input data" tab. | |
| Locate and click on the "[Submit a new job](https://wenmr.science.uu.nl/haddock2.4/submit/1){:target="_blank"}" button. Note that you are now in "Input data" tab. |
| * **Step1:** In the field "Job name", define a name for your docking run, e.g. *E2A-HPR*. | ||
|
|
||
| * **Step2:** Select the number of molecules to dock, in this case the default *2*. | ||
| * **Step2:** In the field "Number of molecules", select the number of molecules to dock, in this case 2. |
There was a problem hiding this comment.
| * **Step2:** In the field "Number of molecules", select the number of molecules to dock, in this case 2. | |
| * **Step2:** In the field "Number of molecules", select the number of molecules to dock, in this case 2, which is the default value. |
|
|
||
| **Note:** Leave all other options to their default values. | ||
| Leave all other options to their default values. | ||
| **_Note_** that you can fold and unfold the "Molecule 1 - input" section by clicking on the ▼ icon. This works for any section and subsection of HADDOCK server. |
There was a problem hiding this comment.
| **_Note_** that you can fold and unfold the "Molecule 1 - input" section by clicking on the ▼ icon. This works for any section and subsection of HADDOCK server. | |
| **_Note_** that you can fold and unfold the "Molecule 1 - input" section by clicking on the ▼ icon. This works for any section and subsection on the HADDOCK2.4 server. |
| ##### Definition of restraints | ||
|
|
||
| #### Definition of restraints | ||
| In this tab, we will define distance restraints by specify active residues for each molecule. |
There was a problem hiding this comment.
| In this tab, we will define distance restraints by specify active residues for each molecule. | |
| In this tab, we will define distance restraints by specifying active residues for each molecule. |
| The page will automatically refresh and the results will appear upon completions (which can take between 1/2 hour to several hours depending on the size of your system and the load of the server). You will be notified by email once your job has successfully completed. | ||
| This run will take between 30 minutes to several hours - depending on the load of the server. You will be notified by email once your job has been completed. The results will remain accessible for a week. | ||
|
|
||
| You do not have to keep this page open, all resent jobs can be accessed via the "[Workspace](https://wenmr.science.uu.nl/haddock2.4/workspace){:target="_blank"}" button in the navigation bar. |
There was a problem hiding this comment.
| You do not have to keep this page open, all resent jobs can be accessed via the "[Workspace](https://wenmr.science.uu.nl/haddock2.4/workspace){:target="_blank"}" button in the navigation bar. | |
| You do not have to keep this page open, all recent jobs can be accessed via the "[Workspace](https://wenmr.science.uu.nl/haddock2.4/workspace){:target="_blank"}" button in the navigation bar. |
|
|
||
| Once your run has completed you will be presented with a result page showing the cluster statistics and some graphical representation of the data (and if registered, you will also be notified by email). Such an example output page can be found [here](https://wenmr.science.uu.nl/haddock2.4/run/4242424242/195967-E2A-HPR){:target="_blank"} in case you don't want to wait for the results of your docking run. | ||
| Once your run has completed (you will also be notified by email about it) you will be presented with a result page showing the cluster statistics and graphical representation of the run. | ||
| An example output page for E2A-HPR docking can be found [here](https://wenmr.science.uu.nl/haddock2.4/run/4242424242/195967-E2A-HPR){:target="_blank"} - just in case you don't want to wait for the results of your docking run. |
There was a problem hiding this comment.
| An example output page for E2A-HPR docking can be found [here](https://wenmr.science.uu.nl/haddock2.4/run/4242424242/195967-E2A-HPR){:target="_blank"} - just in case you don't want to wait for the results of your docking run. | |
| An example output page for E2A-HPR docking can be found [here](https://wenmr.science.uu.nl/haddock2.4/run/4242424242/195967-E2A-HPR){:target="_blank"} - just in case you don't want to wait for the results of your own docking run. |
|
|
||
| <a class="prompt prompt-question">Inspect the result page</a> | ||
| <a class="prompt prompt-question">How many clusters are generated?</a> | ||
| <a class="prompt prompt-question">Inspect the result page. How many clusters have been generated?</a> |
There was a problem hiding this comment.
At that stage, do we want to make a description of the different parts available on the page ?
| HADDOCK_score = 1.0 * E_vdw + 0.2 * E_elec + 1.0 * E_desol + 0.1 * E_air, | ||
| </pre> | ||
| where Evdw is the intermolecular van der Waals energy, Eelec the intermolecular electrostatic energy, Edesol represents an empirical desolvation energy term adapted from Fernandez-Recio *et al.* J. Mol. Biol. 2004, and Eair the AIR energy. The cluster numbering reflects the size of the cluster, with cluster 1 being the most populated cluster. The various components of the HADDOCK score are also reported for each cluster on the results web page. | ||
| where *E_vdw* is the intermolecular van der Waals energy, *E_elec* is the intermolecular electrostatic energy, *E_desol* represents an empirical desolvation energy term adapted from Fernandez-Recio *et al.* J. Mol. Biol. 2004, and *E_air* is a penalty for violation of the restraints. |
There was a problem hiding this comment.
what about adding a link to the publication itself ?
| where *E_vdw* is the intermolecular van der Waals energy, *E_elec* is the intermolecular electrostatic energy, *E_desol* represents an empirical desolvation energy term adapted from Fernandez-Recio *et al.* J. Mol. Biol. 2004, and *E_air* is a penalty for violation of the restraints. | |
| where *E_vdw* is the intermolecular van der Waals energy, *E_elec* is the intermolecular electrostatic energy, *E_desol* represents an empirical desolvation energy term adapted from (Fernandez-Recio *et al.* J. Mol. Biol. 2004)[https://www.sciencedirect.com/science/article/pii/S0022283603013755], and *E_air* is a penalty for violation of the restraints. |
| ## Visualisation | ||
|
|
||
| The new HADDOCK2.4 server integrates the NGL viewer which allows you to quickly visualize a specific structure. For that click on the "eye" icon next to a structure. | ||
| HADDOCK server integrates the NGL viewer which allows you to quickly visualize a specific structure among clustered. For that click on the "eye" icon next to a structure. |
There was a problem hiding this comment.
| HADDOCK server integrates the NGL viewer which allows you to quickly visualize a specific structure among clustered. For that click on the "eye" icon next to a structure. | |
| HADDOCK server integrates the NGL viewer which allows you to quickly visualize a specific structure among clustered ones. For that click on the "eye" icon next to a structure. |
| <a class="prompt prompt-question">Based on this analysis, which cluster does satisfy best the biolocal information?</a> | ||
|
|
||
| <a class="prompt prompt-question">Is this cluster also the best ranked one?</a> | ||
| <a class="prompt prompt-question">Based on this analysis, which cluster fits biological information the mos does satisfy best the biological information? Is this cluster also the best ranked one?</a> |
There was a problem hiding this comment.
| <a class="prompt prompt-question">Based on this analysis, which cluster fits biological information the mos does satisfy best the biological information? Is this cluster also the best ranked one?</a> | |
| <a class="prompt prompt-question">Based on this analysis, which cluster best satisfies the biological information?> | |
| <a class="prompt prompt-question">Is this cluster also the best ranked one?</a> |
| <a class="prompt prompt-pymol"> | ||
| alter (chain B and 1GGR), resv -=300<br> | ||
| </a> | ||
| This shift is critical for the RMDS calculation described below! |
There was a problem hiding this comment.
| This shift is critical for the RMDS calculation described below! | |
| This shift is critical for appropriate residue mapping when performing the RMDS calculation described below! |
Issue #772