Shuchi (With answers as of 2010-01-12) Learning to navigate the RCSB PDB website (Homework assignment) Searching from the top-bar 1. Open the RCSB PDB website (www.pdb.org). Type “4hhb” next to the box PDB ID or Text in the top search bar and click on the Search button. You should be directed to the structure summary page for the PDB entry 4HHB. a. What is the molecule in this PDB entry? HUMAN DEOXYHAEMOGLOBIN b. What year was this PDB entry released? 1984 c. What experimental method was used to solve the structure in this entry? X-ray crystallography d. How many polymeric chains are present in this entry? 4 chains (2 alpha and 2 beta) e. Name 2 ligands that are present in this entry. Heme (HEM) and Phosphate (PO4) f. What is the most common secondary structural element seen in the polymers of this entry. (Hint: see the sequence tab or visualize the structure in Jmol) Alpha helices 2. Go back to the top search bar. Type “Hemoglobin” next to the box PDB ID or Text and click on the Search button. You should see a query results browser page. a. How many structures match this query? 504 b. What is the most common source organism? (Hint: Click on the Organism box in the Query Refinements section on the top of the Query Results Browser page) Homo sapiens (man) (204) c. Sort the results by “Release date” using the Sort by pull down menu. What is the PDB ID of the earliest released hemoglobin structure? When was it released? 1FDH, 1976 d. Create an Image Collage using the Generate Reports pull-down menu (under Custom Reports). Can you identify any 1 structure that has beta sheet structure? What is this structure (include the PDB ID)? 1FNE, histocompatibility antigen e. Did you expect to see this structure when you searched for “Hemoglobin”? Explain. No, since this is not really a complete hemoglobin structure. However, this structure has a modified peptide from hemoglobin bound and the word “Hemoglobin” appears in the file so it was picked up in the query. 3. Hemoglobin molecules have Heme groups bound to the protein. In the top search bar, select Chemical Name from the pull-down menu. Type “Heme” and click on the Search button. a. How many ligands match this query? Give 2 examples and include the 3 letter codes. 2-PHENYLHEME (2FH) Shuchi (With answers as of 2010-01-12) CIS-HEME D HYDROXYCHLORIN GAMMA-SPIROLACTONE (HDD) b. What is the 3 letter code of the most popular Heme in the PDB? (Hint: Look for the ligand with the maximum number of structures associated with it). How many entries are associated with this ligand? HEM, 2428 structures. c. Click on the list of structures with the most popular Heme ligand. Name any 2 proteins (other than Hemoglobin) that have this ligand bound to it. Mono-oxygenase and Lactoperoxidase Advanced Search: 4. Click on the Advanced Search button in the top search bar. A new page will open up with options to run customized queries. a. Choose a Query Type > Macromolecule Name (under Structure Annotation). Type “Hemoglobin” in the box that opens up for the macromolecule name. Click on the Result Count box. How many structures match this part of the query? 467 b. How does this number compare to the number of “hemoglobin” structures returned from the top search bar text-search with “Hemoglobin” (question 2a)? The above number is fewer (467 vs 504). This difference is due to the fact that the full text search includes files where “Hemoglobin” appears anywhere in the header section of the PDB file. c. Click on the + sign in the Add Search Criteria bar. This should open a box to include additional query criteria. Find the first hemoglobin structure deposited by Max Perutz. (Hint: Add Search Critera > Author Name > “Perutz” > Submit Query) 1FDH, 1967 d. From the left hand menu, look under Results > Query History. Select the query results for “Molecule Name Search : Molecule Name=hemoglobin”. Are there any structures in the result list that are not true hemoglobins? Give 2 examples. (Hint: Check out the Enzyme Classification Query Refinement options). 3AK5, Hemoglobin protease; 3F9Q plasmepsin II Sequence Search: 5. Type “1ZAA” in the top search box to open the structure summary page. a. Describe the contents of this entry in words. ZIF268 zinc finger domain bound to a double stranded DNA b. From the Advanced Search Interface select Sequence Features > Sequence search (BLAST or FASTA). Type “1ZAA” in the Structure ID box. Select chain ID “C” and click on Result Count. How many structures match this query? 189 c. Before clicking on the Submit Query button check on the Remove Similar Sequences box at 30% Identity. How many structures match this query? Explain the difference in number reported in answers 5b and 5c 55. The number in 5c is that of all sequence clusters (with sequences that have upto 30% identity) that match the zinc finger domain sequence in PDB entry 1zaa. Shuchi (With answers as of 2010-01-12) d. Can you quickly figure out what most of these proteins do? (Hint: check the GO Hits tab) Biological regulation and Signaling Other Search Options: 6. Most structures in the PDB are described in a journal article, called the primary citation. The PDB entry may also be referred in other publication. Which journal is the maximum number of primary citations published in? (Hint: try using the Search > Histograms options from the left hand menu.) J. Mol. Biol. 7. Based on SCOP and CATH classification (hint: use the Browse Database option in the left hand menu), a. Does the PDB presently have more structures that are mostly/all alpha or mostly/all beta? Mostly/all beta b. How do the number of all/mainly alpha, all/mainly beta structures compare with the alpha beta structures? Give approximate proportions. Mostly/all alpha + Mostly/all beta ~ Alpha beta.