Modeling and Analysis Techniques in Systems Biology. CS 6221 Lecture 1 P.S. Thiagarajan Basic Info • • • • • P.S. Thiagarajan COM2 #03 – 55 ; Tel Ext. 67998 thiagu@comp.nus.edu.sg www.comp.nus.edu.sg/~thiagu Course web page: – www.comp.nus.edu.sg/~cs6221 – We will be using the IVLE system extensively. 2 Office Hours • Send mail first and fix an appointment. 3 Course Material • Selected Parts of the text book : – Systems Biology in Practice: E. Klipp, R. Herwig, A. Kowald, C. Wierling, H. Lehrach (Wiley) • Selected Survey papers, book chapters. • Lecture slides. • Research Articles. 4 Assignments • Lab Assignments –3 – tool based (Cell Illustrator, COPASI, SimBio) – Individual 5 Term Papers • Read a paper or –more likely- a bunch of papers on a topic. • Summarize in the form of a term paper. • First assignment: Common • Second assignment: – More substantial – Can be aligned to your interests 6 Talk • Give talk based on the second term paper. – 25 + 5 minutes. 7 Grading (Tentative) • Lab assignments 45% (15 + 15 + 15) • Term papers 40% (15 + 25) • Talk: 15% 8 What is the Course About? • Computational systems biology – Computational aspects of systems biology. • Systems biology: – Not just focus on individual components. • genes, mRNAs, proteins, membranes, ligands …. – But study a system of such components and their interactions. • Many different views of systems biology. 9 Why Systems Biology? • Biology has traditionally –and extremely successfully!- focused on what individual parts of a cell do . • Bio-chemistry of large and small molecules – The structure of DNA and RNA – Proteins, ligands,… 10 Why Systems Biology? • But functionality of a system is determined crucially by the interactions of the parts. • Many fundamental biological processes are dynamic. – cell growith/division/differentiation – Metabolism,…. • Many diseases are marked by malfunctioning of these processes. 11 Why Systems Biology? • Advances in experimental technology are producing vast amounts of data concerning biological processes. – Which genes get expressed “when” in controlled conditions. • One would like to understand this data in a systemic way. • Enter: computational systems biology! 12 The CSB Approach • View selected biological processes as dynamical systems. – – – – Model Simulate Analyze Predict • Many research communities study dynamical systems … 13 What do we need ? • Biology for computer scientists. – basic biological sub-systems/processes – experimental techniques. • Modeling, analysis and simulation techniques. • Biologists as collaborators! 14 Current Status • Modeling techniques. – Mathematical • differential equations, Linear algebra, probability theory, statistics, Boolean networks, Markov chains, Bayesian networks,…. – CS-specific: • Automata, Petri nets, Hybrid functional Petri nets, hybrid automata, Bayesian networks/inferencing/learning, Markov chains, Model checking…. 15 Current Status • Metabolism – Kinetics “laws” (models). – Enzyme kinetics, law of mass action, Michelis-Menten kinetics – Metabolic network models and flux analysis. 16 Current Status • Signal Transduction • Receptor-ligand interactions • Protein actors • signaling dynamics 17 Current Status • Other biological processes • • • • biological oscillations protein folding kinetics cell cycle Gene expression, regulation 18 Current Status • Modeling tools • Cell Illustrator, COPASI, SimBio, ….. 19 What shall we do? • Selected basic topics. – To illustrate the current state of the field. – To critically examine what is missing. – To discuss promising lines of research. 20 What can CS offer? • We “know” how to deal with complex systems. – Hierarchy • silicon realization of circuits, digital design, microarchitectures, assemble language, programming languages, GUIs, … – separation of concerns. – concepts (models), techniques, tools at each layer and for connecting the layers. 21 What can CS offer? • Deal with other disciplines. – Multi-media – Control – Manufacturing – Communications – Business! • Using computing power via algorithms and data structures! • Computational thinking?! 22 What can CS offer? • Find the right level abstractions. – approximations • Handle distributed dynamics • Deal with hybrid behaviors • Build tools. 23 What the Course is NOT about. • We will not deal with: – Traditional “Bio-Informatics” topics • data mining, sequence analysis, … – Computational aspects of structural biology • Proteins structure, folding… 24 Contents • Bio-chemical networks – The basics of chemical kinetics • Three types of bio-chemical networks – Gene networks – Metabolic networks – Signaling pathways 25 Bio-pathways • Many studies of biological sub-systems boil down to studying: – bio-pathways • A network of bio-chemical reactions. 26 The hierarchy of bio-chemical networks Bio-Chemical reactions Metabolic pathways A network of Bio-Chemical reactions Signaling pathways Gene regulatory networks Interacting networks of Bio-Chemical reactions Cell functions 27 Biopathways 28 Gene Regulatory networks • Boolean models • Differential equations • Bayesian networks. 29 Metabolic pathways • Petri nets • Linear algebra • Flux analysis 30 Signaling Pathways • • • • Differential equations. Hybrid functional Petri nets Hybrid automata Stochastic models. 31 Our Research • ODEs based modeling. – Parameter estimation techniques • Stochastic approximations of ODEs dynamics. – Parameter estimation, sensitivity analysis • GPU implementations • Probabilistic (statistical ) model checking 32 Our Research • Collaboration with biologists: – Signaling pathways: • • • • AKT/MAPK pathway Complement pathway TLR3-TLR7 signaling pathways DNA damage/repair pathways • www.comp.nus.edu.sg/~rpsysbio 33 Expected Outcomes • Have a sound grasp of: – current modeling and simulation techniques (Signaling pathways) – Reaction kinetics – stochastic models and simulations – Analysis techniques: • Parameter estimation, sensitivity analysis 34 Expected Outcomes • Be aware of the limitations of current techniques and state of knowledge • Be ready to undertake modeling and simulation work. 35 Let us get started. 36 Basic Biology: Sources • Chapter 2 (Biology in a Nutshell) of the book “Systems Biology in Practice” by E. Klipp et.al. • Chapter 1 (Molecular Biology for Computer Scientists) of the book “Artificial Intelligence and Molecular Biology” by Lawrence Hunter. • The internet! 37 A major goal of biology • Understand the molecular biology of eukaryotic cells. • Cell: the basic building block. – Two major families: Prokaryotes and Eukaryotes. – Eukaryotes • More complex; genetic material is contained in the nucleus; • Most multi-cellular organisms are made up of eukroyotes.; WE are made up of these types of cells. 38 Cells • In multi-cellular organisms; – Cells are differentiated. – Different types of cells have different functions (and composition). – Groups of cells for specific functionalities • tissues. • we have 14 different types of tissues. 39 Source ? 40 Major Classes of Bio-Molecules • • • • Carbohydrates Lipids Proteins Nucleic acids 41 Proteins • Many functions! • Build up the cytoskeletal structure of the cell (the scaffolding) • Responsible for cell movements (motility) • Serve as catalytic enzymes for bio-chemical reactions. • Induce signal transductions. • Control transcriptions and translation of genes • Control degradation of proteins. 42 Proteins • Proteins consist of polypeptides. – Polypeptide - a LONG chain of amino acids bonded together by peptide bonds between adjacent amino acid residues. • The order of amino acids constituting a peptide is fundamental. – Primary structure – coded by genetic information 43 Proteins • 20 (23?) different amino acids • A protein can have 50 – 4000 amino acids sequence. (50 – 1000 is the typical range) • 201000 possible proteins! • Actually, only a tiny fraction is found in nature. 44 Nucleic Acids • DNA (Deoxyribonucleic acid) molecules store genetic information. – Present in all living organisms • RNA (Ribonucleic acid) takes part in a large number of processes. – Transferring hereditary information in the DNA to synthesize proteins. 45 The Central Dogma • First enunciated by Francis Crick in 1958[1] – re-stated in a Nature paper published in 1970:[2] • Three major classes of information-carrying biopolymers: – DNA, RNA, proteins – Information encoded as sequences of molecules. 46 The Central Dogma • In principle there can be 9 types of transfers: DNA RNA Proteins DNA RNA Proteins 47 The Central Dogma • The “simple” form of central dogma states: DNA RNA Proteins 48 The Central Dogma • Information cannot be transferred back from protein to either protein or nucleic acid. • 'once information gets into protein, it can't flow back to nucleic acid.' 49 Current Known Information Flows Special flows occur in retro viruses ! 50 Information Flows (Replication) DNA DNA DNA mRNA (Transcription) Proteins (Translation) mRNA 51 Mechanism of Cellular Functions – Replication (of DNA) – Transcription of RNA and Processing –by splicingto yield mRNA which migrates to the cytoplasm. – Translation (by ribosomes) of the code carried by mRNA into proteins. 52 Legend: 53 Post-translational Modifications • Proteins undergo many modifications to implement cellular functions – Phosphorylation (Activation of proteins) – Dephosphorylation (Deactivation of proteins) – Methylation and acetylation (Gene silencing. Plays a role in cell differentiation) – Cleavage (Cutting of genes and proteins. For degradation and apoptosis) – Ubiquitination (Marking of proteins for further degradation) 54 Interaction roles of proteins • Proteins have specific roles in the form of chemical interactions. – Kinase (Catalyzes phosphorylation, thereby activating other proteins) – Phosphatase (Catalyzes dephosphorylation) – Transcriptional Co-factors 55 Role of Bio-pathways • Apoptosis – programmed cell death • Differentiation – Cells getting specialized for specific functions • Cell-cycle – Growth and replication of cells • Many others! 56 Example: Wnt Signaling Pathway • Most studies on each of the two types of pathways (Signaling and GRN) done in isolation • Wnt canonical pathway, starts with the binding of the Wnt ligand to Frz receptor • Chain of chemical reactions occur, which results in the transcription factor β-Catenin being translocated to the nucleus • Cofactor with TCF/LEF to up-regulate the transcription of several genes 57 Wnt Signaling Pathway (Canonical) Wnt Dsh Degradation Cytoplasmic When It will inhibit Wnt ligand the Complex B-catenin formation bindscan will to form be Frz, of then the phosphorylated Dsh translocate when degradation is GSK-3B recruited to by binds tothe the and phosphorylates plasma complex nucleus membrane where and gets it binds ‘marked’ and APC to anddegradation for gets co-factors Axin activated Tcf and Lef p Dsh GSK-3B p B-Catenin Lef B-Catenin p APC p Axin Tcf 58 Reaction Kinetics Sources • Chapter 5 (Metabolism) of the book “Systems Biology in Practice” by E. Klipp et.al. • Other related material to be uploaded. 60 Bio-Chemical Reactions • Bio-Chemical reaction: – A basic unit of biological processes. – Convert molecules of one type into another • Can be modeled at different levels of abstraction (time scales). – Microscopic: single molecules and their interactions – Macroscopic: Concentrations and rates (changes of concentration per time unit). 61 62 Reactions • Bio-chemical reaction: – Involves bio-molecules. • Proteins, carbohydrates, lipids,… – Creation and transformation of bio-molecules. – Control the flow of energy , materials and information through the cell. 63 Kinetic Models of Reactions • Reaction: – A chemical process resulting in inter-conversion of the reactants. • motion of electrons cause chemical bonds to break and form. • Reaction types – Isomerization • structural rearrangement (transform one isomer to another) • no change in net atomic composition 64 Reaction Types • Direct combination or synthesis: – two or more chemical elements or compounds unite to form a more complex product. • 2H2 + O2 → 2H2O • Chemical decomposition – a compound is decomposed into smaller compounds: • 2H2O → 2H2 + O2 65 Reaction Types • Single displacement or substitution – an element being displaced out of a compound by a more reactive element: • 2Na + 2HCl → 2NaCl + H2 • Double displacement – two compounds in aqueous solution exchange elements or ions to form different compounds. • NaCl + AgNO3 → NaNO3 + AgCl 66 Reaction Kinetics • Kinetics: – Determine reaction rates • Fix reaction law and • determine reaction rate constant • Solve the equation capturing the dynamics. • The reaction rate for a product or reactant in a particular reaction: – the amount (in moles or mass units) per unit time per unit volume that is formed or removed. 67 Reaction Rates • Influenced by: – Temperature – Concentration – Pressure – Light – Order (zero, first, second) – catalyst 68 Rate Laws • Rate law: – An equation that relates the concentrations of the reactants to the rate. • Differential equations are often used to describe these laws. • Assumption: The reactants participating in the reactions are abundant. 69 Rate Laws • Mass action law: – The reaction rate is proportional to the probability of collision of the reactants – Proportional to the concentration of the reactants to the power of their molecularities. 70 Mass action law S1 + S2 V P V = k. [S1] [S2] [S1] is the concentration (Moles/ litre) of S1 [S2] is the concentration (Moles/ litre) of S k is the rate constant V, the rate of the reaction 71 Mass action law S1 + S2 V1 V2 2P V = (V1) - (V2) = k1. [S1] [S2] – k2 [P]2 [S1] ([S2]) is the concentration (Moles/litre) of S1 (S2) k1 and k2 are the rate constants V1, the rate of the forward reaction V2, the rate of the backward reaction V, the net rate Molecularity is 1 for each substrate (reactant) of the forward reaction and 2 for the backward reaction 72 Mass-action Kinetics k1 E+S ES k3 E+P k2 73 To be continued…….. 74 75 • Assuming mass law kinetics we can write down a system of ordinary differential equations for the 6 species. • But we don’t know how to solve systems of ordinary (non-linear) differential equations even for dimension 4! • We must resort to numerical integration. 76 Given: 77 Initial values chosen “randomly” 78 Michaelis-Menton Kinetics • Describes the rate of enzyme-mediated reactions in an amalgamated fashion: – Based on mass action law. – Subject to some assumptions • Enzymes – Protein (bio-)catalysts • Catalyst: – A substance that accelerates the rate of a reaction without being used up. – The speed up can be enormous! 79 Enzymes • Substrate binds temporarily to the enzyme. – Lowers the activation energy needed for the reaction. • The rate at which an enzyme works is influenced by: – concentration of the substrate – Temperature • beyond a certain point, the protein can get denatured – Its 3 dimensional structure gets disrupted 80 Enzymes • The rate at which an enzyme works is influenced by: – The presence of inhibitors • molecules that bind to the same site as the substrate (competitive) – prevents the substrate from binding • molecules that bind to some other site of the enzyme but reduces its catalytic power (non-competitive) – pH (the concentration of hydrogen ions in a solution) • affects the 3 dimensional shape 81 Michaelis-Menton Kinetics k1 E+S ES k3 E+P k2 i) A reversible formation of the Enzyme-Substrate complex ES ii) Irreversible release of the product P from the enzyme. This is for a single substrate; no backward reaction; at least negligible if we focus on the initial phase of the reaction. 82 Michaelis-Menten Kinetics 83 Michaelis-Menton Kinetics k1 E+S ES k3 E+P k2 Use mass action law to model each reaction. 84 (1) This is the rate at which P is being produced. Assumption1: [ES] concentration changes much more slowly than those of [S] and [P] (quasisteady-state) We can then write: 85 This simplifies to: (2) 86 Michaelis-Menton Kinetics (1) (2) Define (Michaelis constant) (3) 87 Assumption1: [ES] concentration changes much more slowly than those of [S] and [P] (quasisteady-state) Assumption2: The total enzyme concentration does not change with time. [E0] = [E] + [ES] [E0] - initial concentration 88 Michaelis-Menton Kinetics [ E ] [ E0 ] [ ES ] [ S ]([ E0 ] [ ES ]) K M [ ES ] [ S ][ E0 ] K M [ ES ] [ ES ][ S ] [ S ][ E0 ] [ ES ] [S ] K M 89 Michaelis-Menton Kinetics [ S ][ E0 ] [ ES ] [S ] K M v k3[ ES ] (1) [ S ][ E0 ] v k3 [S ] K M 90 Michaelis-Menton Kinetics Vmax is achieved when all of the enzyme (E0) is substrate-bound. v k3[ ES ] (assumption: [S] >> [E0]) at maximum rate, [ ES ] [ E0 ] Thus, vmax k3[ ES ] k3[ E0 ] 91 Michaelis-Menton Kinetics [ S ][ E0 ] v k3 [S ] K M vmax k3[ E0 ] vmax [ S ] v [S ] K M This is the Michaelis-Menten equation! 92 Michaelis-Menton Kinetics [ S ][ E0 ] v k3 [S ] K M vmax k3[ E0 ] vmax [ S ] v [S ] K M This is the Michaelis-Menten equation! So what? 93 Michaelis-Menton Kinetics vmax [ S ] v [S ] K M Consider the case: vmax v 2 vmax vmax [ S ] 2 [S ] K M [S ] KM 2[S ] K M [S ] The KM of an enzyme is therefore the substrate concentration at which the reaction occurs at half of the maximum rate. 94 Michaelis-Menton Kinetics 95 Michaelis-Menton Kinetics 96 Michaelis-Menton Kinetics • KM is an indicator of the affinity that an enzyme has for a given substrate, and hence the stability of the enzyme-substrate complex. • At low [S], it is the availability of substrate that is the limiting factor. • As more substrate is added there is a rapid increase in the initial rate of the reaction. 97 Modeling Bio-Chemical networks • Enzyme catalyzed reaction (a) E S S+E k1 k2 S.E k3 S.E t1 : k1[S][E] E+P P t3 : k3[S.E] t2 : k2[S.E] (b) E S P t4 : Vmax[S] / (KM + [S]) 98