Shuchi (With answers as of 2010-01-12)

advertisement
Shuchi
(With answers as of 2010-01-12)
Learning to navigate the RCSB PDB website (Homework assignment)
Searching from the top-bar
1. Open the RCSB PDB website (www.pdb.org). Type “4hhb” next to the box PDB ID or
Text in the top search bar and click on the Search button. You should be directed to the
structure summary page for the PDB entry 4HHB.
a. What is the molecule in this PDB entry?
HUMAN DEOXYHAEMOGLOBIN
b. What year was this PDB entry released?
1984
c. What experimental method was used to solve the structure in this entry?
X-ray crystallography
d. How many polymeric chains are present in this entry?
4 chains (2 alpha and 2 beta)
e. Name 2 ligands that are present in this entry.
Heme (HEM) and Phosphate (PO4)
f. What is the most common secondary structural element seen in the polymers of
this entry. (Hint: see the sequence tab or visualize the structure in Jmol)
Alpha helices
2. Go back to the top search bar. Type “Hemoglobin” next to the box PDB ID or Text and
click on the Search button. You should see a query results browser page.
a. How many structures match this query?
504
b. What is the most common source organism? (Hint: Click on the Organism box in
the Query Refinements section on the top of the Query Results Browser page)
Homo sapiens (man) (204)
c. Sort the results by “Release date” using the Sort by pull down menu. What is the
PDB ID of the earliest released hemoglobin structure? When was it released?
1FDH, 1976
d. Create an Image Collage using the Generate Reports pull-down menu (under
Custom Reports). Can you identify any 1 structure that has beta sheet structure?
What is this structure (include the PDB ID)?
1FNE, histocompatibility antigen
e. Did you expect to see this structure when you searched for “Hemoglobin”?
Explain.
No, since this is not really a complete hemoglobin structure. However, this structure
has a modified peptide from hemoglobin bound and the word “Hemoglobin” appears
in the file so it was picked up in the query.
3. Hemoglobin molecules have Heme groups bound to the protein. In the top search bar,
select Chemical Name from the pull-down menu. Type “Heme” and click on the Search
button.
a. How many ligands match this query? Give 2 examples and include the 3 letter
codes.
2-PHENYLHEME (2FH)
Shuchi
(With answers as of 2010-01-12)
CIS-HEME D HYDROXYCHLORIN GAMMA-SPIROLACTONE (HDD)
b. What is the 3 letter code of the most popular Heme in the PDB? (Hint: Look for
the ligand with the maximum number of structures associated with it). How many
entries are associated with this ligand?
HEM, 2428 structures.
c. Click on the list of structures with the most popular Heme ligand. Name any 2
proteins (other than Hemoglobin) that have this ligand bound to it.
Mono-oxygenase and Lactoperoxidase
Advanced Search:
4. Click on the Advanced Search button in the top search bar. A new page will open up with
options to run customized queries.
a. Choose a Query Type > Macromolecule Name (under Structure Annotation).
Type “Hemoglobin” in the box that opens up for the macromolecule name. Click
on the Result Count box. How many structures match this part of the query?
467
b. How does this number compare to the number of “hemoglobin” structures
returned from the top search bar text-search with “Hemoglobin” (question 2a)?
The above number is fewer (467 vs 504). This difference is due to the fact that the
full text search includes files where “Hemoglobin” appears anywhere in the header
section of the PDB file.
c. Click on the + sign in the Add Search Criteria bar. This should open a box to
include additional query criteria. Find the first hemoglobin structure deposited by
Max Perutz. (Hint: Add Search Critera > Author Name > “Perutz” > Submit
Query)
1FDH, 1967
d. From the left hand menu, look under Results > Query History. Select the query
results for “Molecule Name Search : Molecule Name=hemoglobin”. Are there
any structures in the result list that are not true hemoglobins? Give 2 examples.
(Hint: Check out the Enzyme Classification Query Refinement options).
3AK5, Hemoglobin protease;
3F9Q plasmepsin II
Sequence Search:
5. Type “1ZAA” in the top search box to open the structure summary page.
a. Describe the contents of this entry in words.
ZIF268 zinc finger domain bound to a double stranded DNA
b. From the Advanced Search Interface select Sequence Features > Sequence search
(BLAST or FASTA). Type “1ZAA” in the Structure ID box. Select chain ID “C”
and click on Result Count. How many structures match this query?
189
c. Before clicking on the Submit Query button check on the Remove Similar
Sequences box at 30% Identity. How many structures match this query? Explain
the difference in number reported in answers 5b and 5c
55. The number in 5c is that of all sequence clusters (with sequences that have upto
30% identity) that match the zinc finger domain sequence in PDB entry 1zaa.
Shuchi
(With answers as of 2010-01-12)
d. Can you quickly figure out what most of these proteins do? (Hint: check the GO
Hits tab)
Biological regulation and Signaling
Other Search Options:
6. Most structures in the PDB are described in a journal article, called the primary citation.
The PDB entry may also be referred in other publication. Which journal is the maximum
number of primary citations published in? (Hint: try using the Search > Histograms
options from the left hand menu.)
J. Mol. Biol.
7. Based on SCOP and CATH classification (hint: use the Browse Database option in the
left hand menu),
a. Does the PDB presently have more structures that are mostly/all alpha or
mostly/all beta?
Mostly/all beta
b. How do the number of all/mainly alpha, all/mainly beta structures compare with
the alpha beta structures? Give approximate proportions.
Mostly/all alpha + Mostly/all beta ~ Alpha beta.
Download