a Copy

advertisement
BOC334: Proteomics
Practical 2
Surname……………………………. Initials:………… Student nr..........................
TYPE YOUR ANSWERS IN THE BLOCKS
You can delete the instructions to save space
Identifying the product of an unknown gene and finding its relatives

Open EXPASY web site by typing http://www.expasy.org

Click “proteomics”

Select “translate” under “Tools”
Exercise 1
[6]
>exercise_1
ctgattagcctgagcggcgcggaagcggatctgctggcgaaaagctgggcgccggtgtttgcgaacaaagatgcgaa
cggcgataactttctgattgcgctgtttgaagcgtttccggatagcgcgaacttttttggcgattttaaaggcaaaa
gcattgcggatattcgcgcgagcccgaaactgcgcagcgtgagcagccgcattgtgaaccgcctgaacgattttgtg
ggcaacgcggcggatgcgggcaaaatggcgggcatgctggatcagtttagcaaagaacatgtgggctttggcgtggg
cagccagcagtttgaaaacgtgcgcagcatgtttccgggctttgtgagcagcgtggcggcgccgccggcgggcgcgg
atgcggcgtggggcaaactgtttggcctgattattgatgcgctgaaaaaagcgggcaaa
Copy and paste the >exercise_1 DNA sequence only into the sequence box, select
“verbose output” followed by “translate”.
Question 1: Can you identify possible reading frames that will code for proteins?
Copy and paste the results output here (change the Met to M) of the identified ORFs.
Highlight the possible ORFs.
Go to NCBI home page (http://www.ncbi.nlm.nih.gov/) and select BLAST followed by
BLASTp. Individually paste your putative protein sequence/s into the sequence box.
Run BLASTp using the default parameters against the nr database.
Question 2: Which ORF codes for a functional protein? Motivate (tip: look at the
e-values).
Type the answer here
Question 3: What protein is this? Motivate.
Type the answer here

Download six sequences of your functional protein and paste them into a file.
Prepare a FASTA format file in Word or Notepad. Use the first hit, the last one
(that is the same as your query sequence) and other hits in between with a range
of different e-values.

Go to http://www.ebi.ac.uk/Tools/msa/clustalw2/
Exercise 2
[5]
Paste all the sequences in FASTA format into the sequence box. Run ClustalW using
default parameters.
Question 4: Can you identify conserved amino acids?
Paste the results of the ClustalW here and highlight the conserved amino acids. Use
font size 9 when pasting into Word document.
Question 5: One of the residues is a histidine. Find out in the literature why this
is important for the function of the myoglobin?
Type your answer here
Finding related proteins for which structures are known
Repeat the BLAST search as before using the sequence of your functional protein but
use the PDB database.
Exercise 3
[5]
On the BLAST output screen, click on any of the links on the left hand of the screen
next to one of the results.
Question 6: Copy and paste the display on the screen.
Copy and paste here
Highlight the name of the molecule and the species from which it originated
Go to http://www.pdb.org
Type the letters corresponding to the code for the protein in the search window e.g.
1uc3
Download the pdb file.
Open Rasmol and load the pdb file.
Question 7: Prepare a picture using Rasmol which illustrates the most important
features of the molecule which you previously saw in practical 1.
Copy and paste (do “print screen”) the picture on the screen and point out the
most important features.
Copy and paste here
Highlight the name of the molecule and the species from which it originated
Exercise 4
[4]
ClustalW Alignment:
 Select any three sequences (from previous search) and download them into a
new file as before in FASTA format.

Align these three sequences with your original query sequence using ClustalW

Cut and paste the relevant alignments into this document as before.
Paste results here
Protparam:

Go
back
to
the
proteomics
and
sequence
analysis
tools
(http://www.expasy.org/tools/)

And find the www.expasy.org/tools/protparam.html page.

Individually paste two sequences into the query window and run the tool. Use
the original query sequence and any other sequence you obtained from your
searches.

Cut and paste your results into this file and save the file.
Paste results here
Download