An Integrated Approach to ProteinProtein Docking Zhiping Weng Department of Biomedical Engineering Bioinformatics Program Boston University What is Protein Docking? Bioinformatics Biomedical Engineering Protein docking is the computational determination of protein complex structure from individual protein structures. L L R R Motivation Bioinformatics Biomedical Engineering • Biological activity depends on the specific recognition of proteins. • Understand protein interaction networks in a cell • Yield insight to thermodynamics of molecular recognition • The experimental determination of protein-protein complex structures remains difficult. Ubiquitination Bioinformatics Biomedical Engineering Experimental Tools for Studying Protein-Protein Interactions Bioinformatics Biomedical Engineering • 3-D structures of protein-protein complexes: X-ray crystallography & NMR • Binding affinity between two proteins: SPR, titration assays • Mutagenesis and its affect on binding • Yeast 2-hybrid system • Protein Chips? Computational Tools for Studying Protein-Protein Interactions Bioinformatics Biomedical Engineering • Protein docking • Binding affinity calculation • Analysis of site-specific mutation experiments • Protein design • The kinetics of protein-protein interactions Protein-Protein Interaction Thermodynamics Bioinformatics Biomedical Engineering water L L L R [ RL ] [ R][ L] R L L R L G RT ln R L R G : Binding Free Energy R L Bioinformatics Biomedical Engineering The Lowest Binding Free Energy G water L R R L L R L L R R General Derivations Two kinds of docking problems Bioinformatics Biomedical Engineering • Bound docking The complex structure is known. The receptor and the ligand in the complex are pulled apart and reassembled. • Unbound docking Individually determined protein structures are used. Challenges Bioinformatics Biomedical Engineering • • • • Large search space Imperfect understanding of thermodynamics Protein flexibility Heterogeneities in protein interactions Divide and Conquer Bioinformatics Biomedical Engineering • Initial stage of unbound docking – Assume minimum binding site information – Try to predict as many near-native structures (hits) as possible in the top 1000, for as many complexes as possible • Post-processing – Re-rank the 1000 structures in order to pick out near-native structures Energy Components An Effective Binding Free Energy Function Bioinformatics Biomedical Engineering ΔG=ΔE vdW +ΔG desol +ΔE elec +ΔG const ΔE vdW : van der Waals energy; Shape complementarity ΔG desol : Desolvation energy; Hydrophobicity ΔE elec : Electrostatic interaction energy ΔG const : Translational, rotational and vibrational free energy changes ΔG desol = N i ΔG i i N i : Number of atoms of type i ΔG i : Desolvation energy for an atom of type i Fast Fourier Transform Bioinformatics Biomedical Engineering FFT IFFT L Correlation R FFT Surface Y Interior X Binding Site Increase the speed by 107 DOCK by Kuntz et al. Evaluate Performance Bioinformatics Biomedical Engineering • Gold Standard: Crystal structure of the complex • A near-native structure (hit): RMSD of Ca after superposition < 2.5 Å N RMSD j (x j i x ic ) 2 ( yij yic ) 2 ( zij zic ) 2 i 1 N • Success rate: Given some number of predictions, percentage of complexes with at least one hit Docking Benchmark Bioinformatics Biomedical Engineering 25 55 non-redundant complexes http://zlab.bu.edu/~rong/dock/ 23 20 16 15 10 10 6 5 0 Enzyme/Inihbitor Antibody/Antigen Others Difficult Cases A Novel Shape Complementarity Function 100% Grid-based shape complementarity (GSC) GSC+Desolvation+Electrostatics Pairwise shape complementarity (PSC) PSC +Desolvation+Electrostatics 90% 80% Success Rate 70% 60% 50% 40% 30% 20% 10% 0% 1 10 100 Number of Predictions 1000 Post-Processing Using RDOCK CAPRI Results Total Target Contacts Top Predictions Our Prediction 1 52 17 (1st) 5 2 52 27 (2nd) 50 (1st) 3 62 45 (1st), 43 (2nd) 37 (3rd) 4 58 1 (1st) 0 5 64 10 (1st) 4 (2nd) 6 65 60(1st) 18 37 30(2nd,3rd), 29(4th,5th) 31(1st) 7 Target 2: Antibody/VP6 Red: Crystal Structure Blue: Prediction 50/52; 1st Target 7: T Cell Receptor / Toxin Red: Crystal Structure Blue: Prediction 31/37, 1st Target 3: Antibody/Hemagglutinin Red: Crystal Structure Blue: Prediction 37/62, 3rd Target 6: Camelide Antibody/a amylase Red: Crystal Structure Blue: Prediction 18/65 Target 1:Hpr/HPrK Red: Crystal Structure Blue: Prediction 5/52 Summary Bioinformatics Biomedical Engineering • Conformational change tolerant target functions are needed for unbound docking • We need to balance shape complementarity, desolvation, electrostatics components • If we submit 10 predictions, we have a 60% success rate. Future Work Bioinformatics Biomedical Engineering • An automatic protein-protein docking server • Large scale comparison of all docking algorithms on the benchmark • Post processing with binding site information, conformation space search, clustering and detailed free energy calculation • Make predictions!