Exercise 1: Protein of Unknown Function from Mus musculus 1. Obtain the “Sequence of a protein of unknown function from Mus musculus” in electronic form from your instructor. Mus musculus is the genus and species name for the common house mouse, a model organism studied by many researchers in the biological sciences. 2. Go to the National Center for Biotechnology Information website: http://www.ncbi.nlm.nih.gov/ Click on “BLAST”, then click on “Protein-protein BLAST (blastp)”. Copy the mouse protein sequence and paste it into the “Search” box. From the dropdown menu “Choose database,” select “refseq.” From the dropdown menu under “Options for advanced blasting,” select “Homo sapiens” so that you will be searching for human proteins similar to the mouse protein. Be sure that the “Do CDSearch” box is checked. When this boxed is checked, a search for conserved protein domains will be conducted. Leave all other settings and parameters the same and click on the BLAST! button. 3. In the next window that appears, there will be a colored, diagrammatic representation of the mouse protein sequence showing the locations of any functional/structural domains that are present in the protein. Click on the domain(s) to find out what they are. Take some time to explore the various links describing the domain(s). For instance, try clicking the [+] symbols on the left. This should give you some clues about the identity of the mouse protein. 4. In the original window showing the diagram of domains in the mouse protein, click the “Format!” button to see the BLAST results for the mouse protein. The BLAST results will appear in a new window; print this BLAST report. If you scroll down through the BLAST results, you will see many sequence alignments one after the other. Each alignment is a sequence comparison between the mouse protein and a human protein that is similar in sequence to it. The first alignment compares the mouse protein and a human protein that is the best match to the mouse protein; the second alignment compares the mouse protein and a human protein that is the second best match to the mouse protein; and so on… To infer the function of a protein about which little is known, one can compare the sequence of the “unknown protein” to other proteins of known function. If the unknown protein is very similar in sequence to a protein of known function, then there is a good chance that the unknown protein has the same function as the known protein. 2 Use the space provided to answer the following discussion questions: A. Is there a conserved domain in the mouse protein? Name the domain and discuss what you learned about it. B. Which human proteins are similar in sequence to the mouse protein? C. Speculate about the identity and function of the mouse protein. 3 Think back to what you learned about hemoglobin in class; name at least two amino acid residues that are important to the structure and/or function of hemoglobin and explain why they are important. (You may use your textbook or notes.) ______________________________________________________________________ ______________________________________________________________________ ______________________________________________________________________ Obtain the handout showing a sequence alignment for human -globin, -globin, and myoglobin from the instructor in order to do the question below. A number of residues that are important to the structure and/or function of hemoglobin are highlighted on this alignment. Compare the alignment on the handout to the first alignment on your BLAST results. Use the space provided to answer the following discussion questions: A. Does the mouse protein have all of the highlighted residues that you know to be important to the structure and/or function of human hemoglobin? Mark/highlight the locations of these residues in the mouse protein on the printout of your BLAST results. B. What can you conclude about the importance of these residues to the structure and/or function of hemoglobin in both organisms? C. What might happen if one of these residues was replaced by some other residue? D. Identify a few residues in the mouse protein that are different from those at the corresponding position in the human globins. Why do you think these residues differ, and what can you conclude about the importance of these residues to the structure and function of this protein in both organisms? 4 5 5. Repeat step 2, but this time select “nr” from the dropdown menu “Choose database.” From the dropdown menu under “Options for advanced blasting,” select “Takifugu rubripes,” the pufferfish. Click on “Format!” to see the BLAST results. Compare your BLAST results here to those from the first BLAST search for human proteins. Print the pufferfish BLAST results. Use the space provided to answer the following discussion questions: A. Is the mouse protein more similar to human -globin or to pufferfish -globin? Is this what you might have expected before doing the BLAST search? Why? B. Does pufferfish -globin have all of the residues highlighted on the handout that you know to be important to the structure and/or function of human -globin? Mark/highlight the locations of these residues in the pufferfish protein on the printout of your BLAST results. If any of these residues differ between pufferfish -globin and human -globin, are the substitutions conservative? 6 6. Repeat step 2, but this time select “nr” from the dropdown menu “Choose database.” From the dropdown menu under “Options for advanced blasting,” select “Arabidopsis thaliana.” Arabidopsis thaliana is a plant and is a commonly studied model organism. Click “Format!” to see the BLAST results. Print the Arabidopsis Blast results. Use the space provided to answer the following discussion questions: A. Is it surprising that an Arabidopsis -globin is not among the BLAST hits from this BLAST search? Why or why not? B. Name the Arabidopsis protein that is the top hit from the BLAST search. Based on its degree of similarity to the mouse protein, in terms of total protein sequence length and percent amino acid identity, do you think it is likely to have the same structure and function as the mouse protein? Explain your reasoning. When finished, turn in your three BLAST reports and answers to the discussion questions.