how2009b_homology_modelling_answers

advertisement
SWISS-MODEL Workspace
Create a user account for SWISS-MODEL Workspace
http://swissmodel.expasy.org/workspace/
1. SWISS-MODEL Repository
1. Retrieve a precomputed homology model for the platypus myoglobin
MYG_ORNAN from the SWISS-PROT Repository.
Q1.1 What template structure has been used in the modeling process?
Pig myoglobin (1myh.pdb)
Q1.2 What is the sequence identity percentage between the target and the
template?
Sequence identity:
84%
Q1.3 Is this sufficient for reliable homology modeling?
Yes. Target/template sequence identity above 50 % will typically yield high
quality homology models.
2. Download the model as a Swiss-PDB Viewer project file and open the file
in DeepView.
Identify the residues that are different between the target and the
template.
Hint: Select variable positions
Select > Residues Identical to Reference Layer
Select > invert selection
3. Create a visualisation that allows you to see the side chains in the variable
positions relative to the overall fold of the protein. Copy paste an image of
your visualization below.
Q1.4 Are any of the variable residues located in the hydrophobic core of the
protein?
None of the variable residues are located in the hydrophobic core. All of them are
on the surface, which typically evolves faster than the core.
2. Alignment mode & comparing models
2.1 Automated mode
Use sequence_11B from sequence assignments list as TARGET
Submit TARGET sequence to Swissmodel's automatic mode
- note sequence identity, e-values from template search
- look at the quality profiles
- download model, send to SolvX server (must check 'Yes' first chain only)
3.2. Swissmodel alignment mode with manually improved GTG alignment
The TARGET was submitted to GTG server. The best template identified from
GTG results is 1j6oA. Below is the GTG's alignment in clustal W format:
CLUSTAL W
QUERY YSDIHLHYVDFFQ----ESAGITKLL---EGVASAKIEHAVFTGISVAKEWDENEPVQPRYYAG-DDAKAYW----YSAT
1J6O-A XVDTHAH-LHFHQFDDDR------NAVISSFEENNIEFVVNV---------------G-VNL-EDSKKSLDLSKTSDR-QUERY DILIAHAYKQLPTEAKTRFHPFISGFNPNDMNAATQIKNMLDRDPGLWQGIGEVFTRHDYLTALIYGKAPRANSRALAKV
1J6O-A -------IFCSVGVH-----PH---DAKEVPEDFIEHLEKFAKDEKVVAIGETGLDFFRNISPAEVQKRVFVEQIELAGK
QUERY YRVAAEYDLPVLLHSNITSKRERSPLYLGEFEEALRANPDVRFIWSHAGTSMEIYRSQGKMAFLLPTVEQLLGKYPNLYI
1J6O-A LNLPLVV------HIRDAYSEAYEILRTES-------LPEKRGVIHAFSSDYE----------WA-K--KFIDLGFLLGI
QUERY DLSWSVLRPYLVDELGNP-DPAWLDLVSRYPTRFMLGSD
1J6O-A GG------PVTYPKNEALRE-----VVKRVGLEYIVLET
Manually refined alignment
Swissmodel in Alignment mode could not build a model because loops did not
close. Liisa has manually edited the alignment, moving inserts and gaps to
suitable positions (turns), adding C-terminal sequence to the alignment, and
doing five cycles of refinements to the alignment in order optimize the SolvX
profile. This is the alignment after five attempts (in fasta format):
>1j6oA
HHHHXVDTHAHLH-FHQFDDDRNAVISSFEENNIEFVVNV---------------GVNLEDSKKSLDLSKT-SDR
IFCSVGVHPHDAKEV------------PEDFIEHLEKFAKDEKVVAIGETGLDFFRNIS------PAEVQKRVFVEQ
IELAGKLNLPLVVHIRDAYSEAYEILRTES--------LPEKRGVIHAFS------------SDYEWAKKFIDLGFLLGI
GGPVTYP----------KNEALREVVKRVGLEYIVLET
DCPFLPPQPFRGKRNEPKYLKYVVETISQVLGVPEAKVDEATTENARRIFLEVKE
>TARGET_5
----YSDIHLHYVDFFQESAGITKLLEGVASAKIEHAVFTGISVAKEWDENEPVQPRYYAGDDAKAYWYSATDIL
IAHAYKQLPTEAKTRFHPFISGFNPNDMNAATQIKNMLDRDPGLWQGIGEVFTRHDYLTALIYGKAPRANSRALAKV
YRVAAEYDLPVLLHSNITSKRERSPLYLGEFEEALRANPDVRFIWSHAGTSMEIYRSQGKMAFLLPTVEQLLGKYPNLYI
DLSWSVLRPYLVDELGNPDPAWLDLVSRYP-TRFMLGS
DVVGTFGNLSKYMFGFEPFLDALPHDIAHRVALSNLISILPKARQKEALDMG---
Send this alignment to Swissmodel in Alignment mode. Download the model and
send to SolvX server.
Compare the models from Automated mode and manually edited alignment
submitted to Alignment mode. Look at the global quality scores and the quality
profiles.
Q2.1 - Which of these two models has better quality?
Manually edited alignment
Automated mode: solvx -29.3
Manual mode: solvx -44.5
Q2.2 - What are the SolvX scores for these two models? Which one is better?
The Manually edited alignment
Lessons: getting the alignment right is difficult. Manually editing the alignment
can improve the quality of a 3D model. It is difficult to achieve net negative, not
to mention native-like, SolvX profiles if the template is wrong.
Swissmodel uses two templates, which are structurally similar to 1j6oA.
Swissmodel's alignments are very bad though the templates are correct.
3 Phyre server
Submit sequence_11B to the Phyre modelling server at
http://www.sbg.bio.ic.ac.uk/phyre/
Analyse the resulting model using SolvX-server.
Compare the SolvX score of the model generated by Phyre to those created using
Swissmodel server.
Q3.1 - What is the solvation score of the Phyre model?
-143.9
Models generated by the Phyre server appear to have varying solvation
scores.
Q3.2 - Based on the SolvX score which server gives better results for this
sequence?
Phyre: -143.9
Swissmodel alignment mode: -50.7
Phyre is gives better results in this case
4 SWISS-MODEL Alignment mode
Sequence Alignment is the most important step in homology modeling. Incorrect
alignment will lead to wrong 3D model. Alignment between two sequences can
be obtained by pairwise alignment method (e.g. BLAST). However, in most cases
alignment can be improved using multiple sequence alignment.
Model the structure of the protease sequence (target) below:
>Murine_HV
SGIVKMVSPTSKVEPCIVSVTYGNMTLNGLWLDDKVYCPRHVICSSADMTDPDYPNLLCRVTSSDFCVMSGRMSLTVMSYQMQGCQ
LVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVTLRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMHQLELSTGCH
TGTDFSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAIFNRCNWFVQSDSCSLEEFNVWAMTNGFSSIKADLVLDALASMTGVT
VEQVLAAIKRLHSGFQGKQILGSCVLEDELTPSDVYQQLAGVKLQ
1. Identify structural templates from the exPDB database for your target
protein by using template search at :
SWISS-Model Workspace > Tools > Template Identification
To make the search faster check Gapped Blast Query only!
Q4.1 What is the highest scoring template?
3d23 (A, B, C or D chain)
2. The file proteases.fasta contains sequences of homologous Corona virus
proteases.
Retrieve the sequence for the highest scoring template pdb-file, and add it
to the file containing protease family sequences.
3. Align the sequences using Muscle multiple sequence alignment program.
Inspect your alignment in Jalview for any alignment problems.
4. Save your alignment in fasta format
5. Upload your alignment to:
SWISS-MODEL Workspace > Alignment Mode
Note: You may need to edit the template sequence name in order for the server
to be able to parse the alignment information correctly.
6. Download your model as a Swiss-PDB Viewer project file and open it in
DeepView
7. Color the target structure by Protein Problems
Protein Problems colour scheme:
This menu will affect the color of the backbone and of the sidechain separately
irrespective of the 'Color act on...' current setting. Residues with peptide bonds
too long will be colored in pink, while the rest of the protein is colored in grey.
This is very useful during modeling. In addition, the backbone of residues with
phi/psi angles laying in "forbidden" zones will be colored in yellow (except
GLY that are ignored) and Prolines whose backbone will be colored in red. Also,
the sidechains of residues capable of H-bonding that are not participating
in any H-bond with other residues are colored in orange. Finally, Clashes will
also be computed and displayed in pink dotted lines. Residues with clashes will
be selected.
http://spdbv.vital-it.ch/color_menu.html
Q 4.2 Which residue is flagged as lacking expected hydrogen bonds (orange)?
Trp31, Cys44, Asp48, Met49, Asn97, His134, Trp207, Cys225, Glu260, Gln277
(Orange)
Q 4.3 Which residues are flagged as having forbidden main chain bond angles?
Leu27, Asp33, Gln221 (yellow)
Leu27, Pro184 (red)
8. Look at the ANOLEA and GROMOS plots in SWISS-MODEL workspace
Q 4.3 Are there any residues with high scores (high energy) in the model?
No there are no high scoring residues with is a good thing.
9. Generate solvation profiles for your model and your template using Solvxserver. Copy paste the profiles below.
Q 4.4 What are the overall solvation scores for the target and the template?
Target: Overall Score: -68.4
Template Structure: 3D23-D , Overall Score: -72.0
Download