Using Protein Structure to Study Network Pharmacology Hauptman Woodward Institute November 5, 2009 Philip E. Bourne University of California San Diego pbourne@ucsd.edu Big Questions in the Lab 1. 2. 3. 4. August 14, 2009 Can we improve how science is disseminated and comprehended? What is the ancestry of the protein structure universe and what can we learn from it? Are there alternative ways to represent proteins from which we can learn something new? What really happens when we take a drug? Motivation • The truth is we know very little about how the major drugs we take work • We know even less about what side effects they might have • Drug discovery seems to be approached in a very consistent and conventional way • The cost of bringing a drug to market is huge >$800M • The cost of failure is even higher e.g. Vioxx >$4.85Bn Motivation • The truth is we know very little about how the major drugs we take work – receptors are unknown • We know even less about what side effects they might have - receptors are unknown • Drug discovery seems to be approached in a very consistent and conventional way • The cost of bringing a drug to market is huge ~$800M – drug reuse is a big business • The cost of failure is even higher e.g. Vioxx $4.85Bn - fail early and cheaply Why Don’t we Do Better? A Couple of Observations • Gene knockouts only effect phenotype in 1020% of cases , why? – redundant functions – alternative network routes – robustness of interaction networks A.L. Hopkins Nat. Chem. Biol. 2008 4:682-690 • 35% of biologically active compounds bind to more than one target Paolini et al. Nat. Biotechnol. 2006 24:805–815 Implications • Ehrlich’s philosophy of magic bullets targeting individual chemoreceptors has not been realized • Stated another way – The notion of one drug, one target, one disease is a little naïve in a complex system How Can we Begin to Address the Problem? • Systematic screening for multiple targets • Integration of knowledge from multiple sources • Analyze the impact on the complete network 2. What is the ancestry of the protein structure universe? 4. What really happens when we take a drug? Valas, Yang & Bourne 2009 Current Opinions in Structural Biology 19:1-6 Phosphoinositide-3 Kinase (D) and Actin-Fragmin Kinase (E) PKA E. Scheeff and P.E. Bourne 2005 PLoS Comp. Biol. 1(5): e49. ChaK (“Channel Kinase”) 9 Implications • The ATP binding cassette is preserved yet the enzyme has evolved to bind a variety of different substrates • The evolutionary history of the protein kinase-like superfamily can be traced with careful analysis • So taking this a step further … The Role of Evolution What if only the binding pocket was conserved and the global structure of the protein has changed? A drug could potentially bind to distinctly different gene families The Role of Evolution If this is True What if… • We can characterize a protein-ligand binding site from a 3D structure (primary site) and search for that site on a proteome wide scale? • We could perhaps find alternative binding sites (offtargets) for existing pharmaceuticals and NCEs? • We could use it for lead optimization and possible ADME/Tox prediction • We might be able to construct a site similarity network for a given proteome to define multiple targets for dirty drugs The Role of Evolution What Do Off-targets Tell Us? • One of four things: 1. Nothing 2. A possible explanation for a side-effect of a drug 3. A possible repositioning of a drug to treat a completely different condition 4. A multi-target strategy to attack a pathogen Today I will give you examples of 2, 3 and 4 while illustrating the complexity of the problem The Role of Evolution Agenda • Computational Methodology • Side Effects - The Tamoxifen Story • Repositioning an Existing Drug - The TB Story • Salvaging $800M – The Torcetrapib Story • The Future? - The TB Drugome Need to Start with a 3D Drug-Receptor Complex - The PDB Contains Many Examples Generic Name Other Name Treatment PDBid Lipitor Atorvastatin High cholesterol 1HWK, 1HW8… Testosterone Testosterone Osteoporosis 1AFS, 1I9J .. Taxol Paclitaxel Cancer 1JFF, 2HXF, 2HXH Viagra Sildenafil citrate ED, pulmonary arterial hypertension 1TBF, 1UDT, 1XOS.. Digoxin Lanoxin Congestive heart failure 1IGJ Computational Methodology A Reverse Engineering Approach to Drug Discovery Across Gene Families Characterize ligand binding site of primary target (Geometric Potential) Identify off-targets by ligand binding site similarity (Sequence order independent profile-profile alignment) Extract known drugs or inhibitors of the primary and/or off-targets Search for similar small molecules … Dock molecules to both primary and off-targets Statistics analysis of docking score correlations Computational Methodology 43,738 Human Proteins map human proteins to drug targets with BLAST e-value < 0.001 map human proteins to PDB structures with >95% sequence identity 13,865 Human Proteins (2,002 Drug Targets) 3,158 Human Proteins (10,730 PDB Structures) map drug targets to PDB structures 1,585 PDB Structures (929 Drug Targets) cover 929/2,002 = 46.4% drug targets structurally remove redundant structures with 30% sequence identity 2,586 PDB Structures remove redundant structures with 30% sequence identity, 825 PDB Structures (druggable) What we Search Against The Human Target List Computational Methodology Characterization of the Ligand Binding Site - The Geometric Potential Conceptually similar to hydrophobicity or electrostatic potential that is dependant on both global and local environments • Initially assign Ca atom with a value that is the distance to the environmental boundary • Update the value with those of surrounding Ca atoms dependent on distances and orientation – atoms within a 10A radius define i GP P Pi cos(ai) 1.0 2.0 neighbors Di 1.0 Computational Methodology Xie and Bourne 2007 BMC Bioinformatics, 8(Suppl 4):S9 Discrimination Power of the Geometric Potential 4 binding site non-binding site 3.5 • Geometric potential can distinguish binding and non-binding sites 3 2.5 2 1.5 1 0.5 100 99 88 77 66 55 44 33 22 11 0 0 Geometric Potential Computational Methodology 0 Geometric Potential Scale Xie and Bourne 2007 BMC Bioinformatics, 8(Suppl 4):S9 Local Sequence-order Independent Alignment with Maximum-Weight Sub-Graph Algorithm Structure A Structure B LER VKDL LER VKDL • Build an associated graph from the graph representations of two structures being compared. Each of the nodes is assigned with a weight from the similarity matrix • The maximum-weight clique corresponds to the optimum alignment of the two structures Xie and Bourne 2008 PNAS, 105(14) 5441 Similarity Matrix of Alignment Chemical Similarity • Amino acid grouping: (LVIMC), (AGSTP), (FYW), and (EDNQKRH) • Amino acid chemical similarity matrix Evolutionary Correlation • Amino acid substitution matrix such as BLOSUM45 • Similarity score between two sequence profiles d f a Sb f b S a i i i i i i fa, fb are the 20 amino acid target frequencies of profile a and b, respectively Sa, Sb are the PSSM of profile a and b, respectively Computational Methodology Xie and Bourne 2008 PNAS, 105(14) 5441 Nothing in Biology {Including Drug Discovery} Makes Sense Except in the Light of Evolution Theodosius Dobzhansky (1900-1975) Lead Discovery from Fragment Assembly • Privileged molecular moieties in medicinal chemistry • Structural genomics and high throughput screening generate a large number of proteinfragment complexes • Similar sub-site detection enhances the application of fragment assembly strategies in drug discovery 1HQC: Holliday junction migration motor protein from Thermus thermophilus 1ZEF: Rio1 atypical serine protein kinase from A. fulgidus Computational Methodology Lead Optimization from Conformational Constraints • Same ligand can bind to different proteins, but with different conformations • By recognizing the conformational changes in the binding site, it is possible to improve the binding specificity with conformational constraints placed on the ligand 1ECJ: amido-phosphoribosyltransferase from E. Coli 1H3D: ATP-phosphoribosyltransferase from E. Coli Computational Methodology Scoring The Point is this Approach Can Now be Applied on a Proteome-wide Scale a) b) Blosum45 and b) McLachlan substitution matrices. • Scores for binding site matching by SOIPPA follow an extreme value distribution (EVD). Benchmark studies show that the EVD model performs at least two-orders faster and is more accurate than the non-parametric statistical method in the previous SOIPPA version Xie, Xie and Bourne 2009 Bioinformatics 25(12) 305-312 Computational Methodology Agenda • Computational Methodology • Side Effects - The Tamoxifen Story • Repositioning an Existing Drug - The TB Story • Salvaging $800M – The Torcetrapib Story • The Future? - The TB Drugome Found.. • Evolutionary linkage between: – NAD-binding Rossmann fold – S-adenosylmethionine (SAM)-binding domain of SAMdependent methyltransferases • Catechol-O-methyl transferase (COMT) is SAMdependent methyltransferase • Entacapone and tolcapone are used as COMT inhibitors in Parkinson’s disease treatment • Hypothesis: – Further investigation of NAD-binding proteins may uncover a potential new drug target for entacapone and tolcapone Repositioning an Existing Drug - The TB Story Functional Site Similarity between COMT and InhA • Entacapone and tolcapone docked onto 215 NADbinding proteins from different species • M.tuberculosis Enoyl-acyl carrier protein reductase ENR (InhA) discovered as potential new drug target • InhA is the primary target of many existing anti-TB drugs but all are very toxic • InhA catalyses the final, rate-determining step in the fatty acid elongation cycle • Alignment of the COMT and InhA binding sites revealed similarities ... Kinnings et al. 2009 PLoS Comp Biol 5(7) e1000423 Repositioning an Existing Drug - The TB Story Binding Site Similarity between COMT and InhA COMT SAM (cofactor) BIE (inhibitor) InhA NAD (cofactor) 641 (inhibitor) Repositioning an Existing Drug - The TB Story Summary of the TB Story • Entacapone and tolcapone shown to have potential for repositioning • Direct mechanism of action avoids M. tuberculosis resistance mechanisms • Possess excellent safety profiles with few side effects – already on the market • In vivo support • Assay of direct binding of entacapone and tolcapone to InhA reveals a possible lead with no chemical relationship to existing drugs Kinnings et al. 2009 PLoS Comp Biol 5(7) e1000423 Repositioning an Existing Drug - The TB Story Summary from the TB Alliance – Medicinal Chemistry • The minimal inhibitory concentration (MIC) of 260 uM is higher than usually considered • MIC is 65x the estimated plasma concentration • Have other InhA inhibitors in the pipeline Repositioning an Existing Drug - The TB Story Agenda • Computational Methodology • Side Effects - The Tamoxifen Story • Repositioning an Existing Drug - The TB Story • Salvaging $800M – The Torcetrapib Story • The Future? - The TB Drugome Selective Estrogen Receptor Modulators (SERM) • One of the largest classes of drugs • Breast cancer, osteoporosis, birth control etc. • Amine and benzine moiety PLoS Comp. Biol., 2007 3(11) e217 Side Effects - The Tamoxifen Story Adverse Effects of SERMs cardiac abnormalities thromboembolic disorders loss of calcium homeostatis ????? ocular toxicities PLoS Comp. Biol., 3(11) e217 Side Effects - The Tamoxifen Story Structure and Function of SERCA Sacroplasmic Reticulum (SR) Ca2+ ion channel ATPase • Regulating cytosolic calcium levels in cardiac and skeletal muscle • Cytosolic and transmembrane domains PLoS Comp. Biol., 3(11) e217 • Predicted SERM binding site locates in the TM, inhibiting Ca2+ uptake Side Effects - The Tamoxifen Story Binding Poses of SERMs in SERCA from Docking Studies • Salt bridge interaction between amine group and GLU • Aromatic interactions for both N-, and C-moiety 6 SERMS A-F (red) PLoS Comp. Biol., 3(11) e217 Side Effects - The Tamoxifen Story The Challenge • Design modified SERMs that bind as strongly to estrogen receptors but do not have strong binding to SERCA, yet maintain other characteristics of the activity profile PLoS Comp. Biol., 3(11) e217 Side Effects - The Tamoxifen Story Agenda • Computational Methodology • Side Effects - The Tamoxifen Story • Repositioning an Existing Drug - The TB Story • Salvaging $800M – The Torcetrapib Story • The Future? - The TB Drugome PLoS Comp Biol 2009 5(5) e1000387 The Torcetrapib Story Cholesteryl Ester Transfer Protein (CETP) CETP inhibitor X CETP LDL HDL Bad Cholesterol Good Cholesterol • collects triglycerides from very low density or low density lipoproteins (VLDL or LDL) and exchanges them for cholesteryl esters from high density lipoproteins (and vice versa) • A long tunnel with two major binding sites. Docking studies suggest that it possible that torcetrapib binds to both of them. • The torcetrapib binding site is unknown. Docking studies show that both sites can bind to torcetrapib with the docking score around -8.0. PLoS Comp Biol 2009 5(5) e1000387 The Torcetrapib Story Docking Scores eHits/Autodock Off-target PDB Ids Torcetrapib Anacetrapib JTT705 Complex ligand CETP 2OBD -11.675 / -5.72 -11.375 / -8.15 -7.563 / -6.65 -8.324 (PCW) Retinoid X receptor 1YOW 1ZDT -11.420 / -6.600 -6.74 -8.696 / -7.68 -7.35 -6.276 / -7.28 -6.95 -9.113 (POE) PPAR delta 1Y0S -10.203 / -8.22 -10.595 / -7.91 -7.581 / -8.36 -10.691(331) PPAR alpha 2P54 -11.036 / -6.67 -0.835 / -7.27 -9.599 / -7.78 -11.404(735) PPAR gamma 1ZEO -9.515 / -7.31 > 0.0 / -8.25 -7.204 / -8.11 -8.075 (C01) Vitamin D receptor 1IE8 >0.0/ -4.73 >0.0 / -6.25 -6.628 / -9.70 -8.354 (KH1) -7.35 Glucocorticoid Receptor 1NHZ 1P93 Fatty acid binding protein 2F73 2PY1 2NNQ >0.0/ -4.33 >0.0/-6.13 /-6.40 >0.0/ -7.81 >0.0/ -6.98 /-7.64 -7.191 / -8.49 /-6.33 /6.35 ??? T-Cell CD1B 1GZP -8.815 / -7.02 -13.515 / -7.15 -7.590 / -8.02 -6.519 (GM2) IL-10 receptor 1LQS / -4.59 / -6.77 GM-2 activator 2AG9 -9.345 / -6.26 -9.674 / -6.98 (3CA2+) CARDIAC TROPONIN C 1DTL /-5.83 /-6.71 /-5.79 cytochrome bc1 complex 1PP9 (PEG) /-6.97 /-9.07 /-6.64 1PP9 (HEM) /-7.21 /8.79 /-8.94 1V5H /-4.89 /-7.00 /-4.94 human cytoglobin PLoS Comp Biol 2009 5(5) e1000387 /-4.43 /-5.63 /-7.08 /-0.58 /-7.09 /-9.42 / -5.95 -8.617 / -6.17 ??? ??? (MYR) -4.16 The Torcetrapib Story JTT705 Torcetrapib Anacetrapib JTT705 VDR – RAS RXR + PPARα PPARδ FA ? FABP ? ? PPARγ + High blood pressure Anti-inflammatory function PLoS Comp Biol 2009 5(5) e1000387 JNK/IKK pathway JNK/NF-KB pathway Immune response to infection The Torcetrapib Story Agenda • Computational Methodology • Side Effects - The Tamoxifen Story • Repositioning an Existing Drug - The TB Story • Salvaging $800M – The Torcetrapib Story • The Future? - The TB Drugome Existing Drugs 3. Protein-ligand Docking Structural Proteome 2. Binding site Similarity … Protein-drug Interactome Drugome Target identification 1. Structural Determination & Modeling Genome 4.2 Network Integration 4.1 Network Reconstruction Metabolome Drug repurposing Side effect prediction New therapeutics Drug resistance mechanism Bioinformatics 2009 25(12) 305-312 The Future Existing Drugs 3. Protein-ligand Docking TB Structural Proteome … TB Protein-drug Interactome 2. Binding site Similarity Drugome/TB 1. Structural Determination & Modeling TB Genome 4.2 Network Integration 4.1 Network Reconstruction TB Metabolome Target identification Drug repurposing Side effect prediction New therapeutics for MDR and XDR-TB Drug resistance mechanism Bioinformatics 2009 25(12) 305-312 The TB Druggome Predicted protein-ligand interaction network of M.tuberculosis. Proteins that are predicted to have similar binding sites are connected. Squares represent the top 18 most connected proteins. Bioinformatics 2009 25(12) 305-312 The TB Druggome Bioinformatics 2009 25(12) 305-312 The TB Druggome Some Limitations • Structural coverage of the given proteome • False hits / poor docking scores • Literature searching • It’s a hypothesis – need experimental validation • Money Limitations Summary • We have established a protocol to look for offtargets for existing therapeutics and NCEs • Understanding these in the context of pathways would seem to be the next step towards a new understanding – cheminfomatics meets systems biology • Lots of other opportunities to examine existing drugs – DrugX and the Recovery Act Bioinformatics Final Examples.. • Donepezil for treating Alzheimer’s shows positive effects against other neurological disorders • Orlistat used to treat obesity has proven effective against certain cancer types • Ritonavir used to treat AIDS effective against TB • Nelfinavir used to treat AIDS effective against different types of cancers Lots of Opportunities Acknowledgements Lei Xie Li Xie Jian Wang Sarah Kinnings http://funsite.sdsc.edu