PDBE Small Molecules

advertisement
Bringing Structure to Biology:
Small Molecules and the PDBe
Protein Data Bank in Europe
www.pdbe.org
PDBe overview
• PDB is a core molecular database at EMBL-EBI
• PDBe is a founding partner of Worldwide Protein Data
Bank (wwPDB)
• Founder of Electron Microscopy Data Bank (EMDB)
• Mission: Bringing Structure to Biology
• Major activities:
• Deposition and annotation site for structural data on
biomacromolecules (X-ray, NMR, EM)
• Integrated resource of high-quality macromolecular structural
data and related information
• Provide tools and services for accessing, exploiting and
disseminating structural data to the wider biomedical community
Protein Data Bank in Europe
www.pdbe.org
PDB Depositions
10,000th PDBe annotated
structure - April 2011 (2yf6)
www.pdbe.org/2yf6
Protein Data Bank in Europe
www.pdbe.org
Chemical Component Dictionary
• Compounds in the PDB
• Small molecules bound to macromolecules
• Individual components of macromolecules
• wwPDB maintains dictionary
descriptions for all unique chemical
components
• Name, synonyms, formula, SMILES, …
• Atoms and bonds
• Ideal and representative coordinates
• Each new component assigned a
unique 3-letter identifier
• Release coincides with the release of the
parent PDB entry
Protein Data Bank in Europe
www.pdbe.org
Molecule search options
•
•
•
•
Compound name
Ligand 3-letter code
SMILES
Formula (exact or range)
e.g. C6-10 N4 O2 S0
• Chemical substructure
www.pdbe.org/chem
Protein Data Bank in Europe
www.pdbe.org
PDBe Home Page
Protein Data Bank in Europe
http://www.ebi.ac.uk/pdbe
www.pdbe.org
Ligands and the PDBe
Open
chemistry
sketchpad
Protein Data Bank in Europe
www.pdbe.org
Ligands and the PDBe
Protein Data Bank in Europe
www.pdbe.org
Ligands and the PDBe
Protein Data Bank in Europe
www.pdbe.org
2D Ligand Interaction Diagrams
www.pdbe.org/leview
• Interaction diagrams
for any given PDB
entry
• Interactive control of
distance criteria
• Diagram customisation
• Image export
png, jpg, eps…
S-benzyl-glutathione (GSB) Human Glyoxalase inhibitor (1guh)
Protein Data Bank in Europe
www.pdbe.org
PDBeXpress: rapid access to protein-ligand
interaction statistics
• Understand and assess binding site interactions
• Provide chemists with quick answers to common questions
without the need to construct complex search queries
• What residues interact?
• Which enzymes interact?
• What binds here?
• www.pdbe.org/express
Protein Data Bank in Europe
www.pdbe.org
What residues interact?
• PDB three-letter ligand code
• Ligand name
Protein Data Bank in Europe
www.pdbe.org
RTL - Retinol
What residues interact?
RTL - Retinol
Protein Data Bank in Europe
www.pdbe.org
Which enzymes interact?
• PDB three-letter ligand code
• Ligand name
Protein Data Bank in Europe
www.pdbe.org
MAN – Mannose
Which enzymes interact?
• PDB three-letter ligand code
• Ligand name
Protein Data Bank in Europe
www.pdbe.org
MAN – Mannose
What binds here?
• Search for ligands that interact with a given set of residues
• Can specify a partial or exact binding environment
Protein Data Bank in Europe
www.pdbe.org
What binds here?
Protein Data Bank in Europe
www.pdbe.org
PDBeMotif: powerful and flexible searching
• PDBeXpress modules driven by PDBeMotif
• PDBeMotif allows to combine protein sequence, chemical structure
and 3D data in a single search
Protein Data Bank in Europe
www.pdbe.org
PDBeMotif: powerful and flexible searching
• construct queries based on • ligands and their 3D environment
• secondary structure elements and small 3D motifs
• protein φ/ψ angle sequences - sequential representation of the
protein geometry
• results can be analysed against UniProt, CATH, PFAM or EC
Protein Data Bank in Europe
www.pdbe.org
Ligands need careful validation
• CCDC analysis of ligand geometries (using Relibase+/Mogul/EDS)
• Around 20% of recently determined structures have geometric errors
that could potentially cause a misleading interpretation of the binding
interactions
Wrong
Unusual/Strained
Correct
Liebeschuetz, J.W., Hennemann, J. The good, the bad and the twisted: A survey of ligand geometry in protein crystal structures
J. Comput. Aid. Mol. Des., 26, 169-183 (2012)
Protein Data Bank in Europe
www.pdbe.org
The solution…
• Mogul – a Knowledge-based library of molecular geometry derived
from the Cambridge Structural Database (CSD)
• Enables rapidly validation of the complete geometry of a given query
structure and identification of unusual features
Protein Data Bank in Europe
www.pdbe.org
Protein Data Bank in Europe
www.pdbe.org
MoU with CCDC
• wwPDB/CCDC Memorandum of Understanding
• wwPDB gets to use Mogul for validation of all current and future
compounds in the PDB
• wwPDB gets to incorporate and redistribute CSD coordinates for
all current and future ligand compounds in the PDB
• wwPDB gets to use Mogul and CSD coordinates to derive
dictionaries for all current and future compounds in the PDB
Protein Data Bank in Europe
www.pdbe.org
Prevention is the best cure
• Thanks to collaboration with CCDC
• We can add CSD coordinates for all existing small molecules in
the PDB (and variants, e.g. D-amino acids) that also occur in the
CSD
• We can use these coordinates and Mogul to derive refinement
dictionaries
• Grade (Global Phasing; uses Mogul and RM1)
• Will improve quality and consistency of the archive
• We can provide reasonable starting coordinates and refinement
dictionaries for all existing compounds in the PDB
Protein Data Bank in Europe
www.pdbe.org
Future of the PDB?
• At present PDB is a historic archive
• We have to accept and distribute everything
• “Archive” – i.e., what was described in the literature
• Essentially provider-centric
• We capture X-ray detector type but not ligand function…
• Organised by entry rather than molecule/complex/…
• Shifting user communities/demands
• We must serve the consumers of structural data (non-experts)
• Don’t think in terms of PDB entry codes
• Can’t tell a good from a bad model
Protein Data Bank in Europe
www.pdbe.org
PDBe Team February 2012
Protein Data Bank in Europe
www.pdbe.org
Funding
Protein Data Bank in Europe
www.pdbe.org
Thank you!
• Tutorials…
http://www.ebi.ac.uk/pdbe/resources/educationTabContent/tutorials/PDBeChem.pdf
http://www.ebi.ac.uk/pdbe-apps/quips?story=XmasFactor&auxpage=XmasChemTut
http://www.ebi.ac.uk/pdbe/docs/Tutorials/PDBeChem.html
• Contact us…
www.pdbe.org
pdbehelp@ebi.ac.uk
• Follow us…
http://www.facebook.com/proteindatabank
http://twitter.com/PDBeurope
Protein Data Bank in Europe
www.pdbe.org
Download