Work Part II

advertisement
PROBLEM SET 3
SIMULATION OF BIOCHEMICAL KINETICS
Part I: The first goal of Part I of this exercise is to compare two approaches to the
simulation of biochemical reaction kinetics and to see how error may propagate is such
system. The Michaelis-Menten enzyme reaction will constitute the experimental system.
The reaction mechanism for this system is:
k1
k3
[E] + [S] ===== [ES] ===== [E] +[P]
k2
k4
where [E] is an enzyme concentration, [S] is a substrate concentration, [P] is the product
concentration, [ES] is the concentration of the enzyme-substrate complex, and k1, k2, k3,
and k4, are rate constants.
The two approaches to simulation are based on either analytical or numerical
solution of rate equations. The analytical approach for a simple Michaelis-Menten
reaction system is based on a pseudo steady-state assumption and, strictly speaking, is
only valid during initial phase of the reaction. The derivation of the analytical solution
depends on solving the change rules for each of the four reactants:
1. d[E]/dt = [ES] (k2+k3) - k1[E][S] - k4[E][P]
2. d[S]/dt = k2[ES] - k1[E][S]
3. d[ES]/dt = k1[E][S] + k4[E][P] - [ES](k2 + k3)
4. d[P]/dt = k3[ES] - k4[E][P]
Assuming that equation 3 is zero (i.e. that the enzyme substrate complex is at steady
state), the change rules (eqs. 1 to 4 ) may by simplified to:
V 
Vm *[ S ]
, (5)
Km  [ S ]
where Vm = k3*ET, ET=[E] + [ES], and Km = (k2 + k3)/k1. This particular
simplification also requires that either k4 = 0 or [P] is zero or very near zero (as it would
be during the initial part of the reaction).
The numerical solution, of course, solves the change rules explicitly using some
numerical solution procedure (e.g. Euler or Runge-Kutta). In theory, it yields the time
course of any single reaction, but it both is slower than the pseudo steady-state analysis
and also is subject to numerical errors. Your work in this exercise is to explore the
differences and compatibility of these two approaches.
1
Part II: The second goal of this exercise is to compare various approaches to
estimation of parameters of the Michaelis-Menten model of enzyme kinetics. The
approaches that you will examine here are in two classes. The first, and the most
traditional class, is based on transformations of the pseudo steady-state rate law used in
Part I. The second class of approach uses the time course data, which you generate in
Part I, with the numerical solution of rate equations. Methods of estimating parameters
are described below.
1. Methods of Parameter Estimation
1.a. The Michaelis-Menten model specifies a relationship between a substrate
concentration and the reaction rate:
Vm * S
(6)
Km + S
V =
where parameters are defined in Part I. The two parameters in this model (Km and Vm)
are often measured graphically from plots of various linearizations of equation 6.
Three possible linearizations of equation 1 are:
1.
1
1
Km
Lineweaver-Burke


V Vm Vm * S
2.
S Km S


V Vm Vm
3. V  Vm 
Hanes
Km *V
S
Eddie-Hofstee
These three are the most popular in the biochemical literature. These are not only
convenient for plotting the data, but they are also compatible with simple linear
regression. The parameters can thus be estimated by least squares procedures.
For the simple regression model:
y = bo + b1* x
parameters and variables for each transformation are:
Transformation
1
2
3
y
x
bo
intercept
1/V
S/V
V
1/S
S
V/S
1/Vm
Km/Vm
Vm
2
b1
slope
Km/Vm
1/Vm
-Km
Part III: The purpose of this part III of the exercise is to explore ways of improving
precision and accuracy of parameter estimation. Two ways considered here are
alternatives regression procedures and variation of experimental design.
1. Weighted Linear Regression and a Non-Linear Regression Procedures.
The results of Part I should have convinced you that methods of estimating
Michael-Menten parameters vary considerably in accuracy. One way to improve
accuracy of parameter estimation is to use alternative regression procedures. You will try
two alternatives in this exercise: a weighted linear regression procedure and a non-linear
regression procedure. These procedures are contained in two files, WTREG.XLS and
NOLINEAR.XLS, on the Biology 315 Homepage.
1.a. The weighted linear regression procedure, WTREG.XLS, allows you to select one of
two transformations. Run for transformations 1 and 2, on the three data set for the three
VE/VP error ratios (0.1, 1, 10).
1.b. The non-linear regression program, NOLINEAR.XLS, uses a solver tool in EXCEL.
You will have to import your eight S,V pairs for each replicate. It should be run for all
three VE/VP error ratios files.
2. Design of Experiments:
The data sets generated in Part II were based on an implicit experimental design.
Eight substrate levels were selected and were evenly distributed over a range 5 to 40 mM.
At each substrate level, furthermore, the velocity measurement was replicate.
Experimental design, therefore, consisted of the range of substrate concentration, spacing
of substrate levels over the range, and the number of replicates at each substrate level. In
this Part III, you are to explore the influence of experimental design on accuracy and
precision of the parameters estimates by varying range and spacing of substrate levels.
Work Part I :
Ia. For the analytical model of a simple biochemical reaction system, you must write a
program that uses equation 5 to simulate the relation between initial reaction velocity and
substrate concentration. Your program must generate at least 20 pairs of substrate,
velocity data over the substrate range 0 to 40 mM. Assume Km is 21 mM and Vm is 0.01
mM/min. Create a X-Y plot of velocity vs substrate.
Ib. To compare numerical approaches to the same basic reaction system, substitute the
change rules in equations 1 to 4 into the Lotka-Volterra Population model that you
developed for problem set 2. Be sure to evaluate all change rules before updating
concentrations. Assume the following rate constants: k1=0.005, k2=0.005, k3=0.1 and
k4=0, the initial values of [P] and [ES] should be zero, and the value of ET should be 1.0.
Allow [S] to vary in each simulation. Next, run the simulation model using these new
parameter estimates for three starting substrate concentrations (0.5 Km, Km, and 1.5
Km). for each simulation, you must save at least 20 set of data (using a time step of 1
3
minute). Your output must include the concentration of the four reactants and the
velocity of the reaction (d[P]/dt). Generate five separate x-y plots for reaction velocity
(d[P]/dt), [P], [ES], [E], and [S].
Ic. The final part of this part I of the Problem Set 3 requires modification of your previous
program to a Monte-Carlo form. Error will be assumed to be N(0,s^2), where s is the
standard deviation of the error, and it will affect the initial enzyme concentration and/or
the measured reaction product. Assuming a 10% error on product (i.e. s/P=0.1) and 5%
error on starting enzyme concentration (i.e. s/Vm=0.05), obtain 10 replicate sets of t, Pt
for a starting substrate concentration equal to the Km of the reaction. The reactions
should run long enough to deplete at least 20% of the initial substrate concentration, and
you should have at least 20 pairs of t, Pt for each of the 10 replicates. Produce on scatter
plot of P versus t.
Work Part II
To explore errors you will take advantage of Monte-Carlo simulations to generate
hypothetical data sets from known models and parameters. Based on the error sources
discussed in part 3 of Problem Set 3 Part I, equation 5 becomes:
V
Vm * N 1,VE  * S
Km  S
 N  0,VP (7)
where N(,2) is a Normally distributed random variable with mean  and variance 2,
VE is the variance of the initial enzyme concentration, and VP is the variance of the
product.
You must write a program, that will generate pairs of substrate and observed velocity.
Remember to use a Z-distributed random variable for the error terms in equation 4.
r = 2*RND-1
Z = log{(1+r)/(1-r)}/1.82
IIa. Precision and Accuracy using Method 1.a. Create 12 different data sets containing
<S,V> data pairs. Initial substrate concentrations should be {5, 10, 15, 20, 25, 30, 35, and
40 mM}, Km = 21 mM, and Vm = 0.1 mM/min. The 12 data sets should be three
replicates of four different ratios of VE/VP (0, 0.1, 1.0, and 10.0). Where for VE/VP= 0,
VE=0 and VP=0. In last three cases, total variance (VE + VP) should be 0.012. Next,
obtain estimates of Vm and Km by linear regression with the three transformations.
Finally, for each of the ratios of VE/VP obtain mean and variance or standard deviation of
Km and Vm from the three replicates.
4
Work Part III
IIIa Modify your module in Part II to generate nine new data sets containing substrate and
velocity data pairs. Assume that VE/VP = 1 and VE+VP = 0.012 for all cases. These
new data sets will contain three replicates according to the following schedule:
Replicates
1 to 3
4 to 6
7 to 9
Substrate Concentrations
2.5, 5.0, 10.0, 20.0, 40.0
4.3, 9.1, 13.8, 18.6, 23.3
4.3, 6.6, 10.0, 15.3, 23.3
IIIb Obtain estimates of Km and Vm for transformation 1, 2 and 3 using the linear
regression procedures you employed in Problem Set 3 Part II. From the three replicates,
calculate the mean and variance or standard deviation of Km and Vm for each of the three
experimental designs and transformations.
IIIc. Using WTREG.XLS also obtain estimates of Km and Vm for transformation 1 and
2. Including all three replicates.
IIId. Finally, use solver (from NOLINEAR.XLS template) to obtain 3 replicate estimates
of Km and Vm from each of the experimental designs. From the three replicates,
calculate the mean and variance or standard deviation of Km and Vm each of the three
experimental designs and transformations. Prepare a table showing values of Km and
Vm obtained from these analyses.
SUBMIT:
Part I:
1) An excel file with parts Ia to Ic, including the 7 x-y plots (15 points)
2) Your discussion should include (15 points):
a. Discuss of all your results from Part I
b. Suggest a procedure by which to compare output
from models for Ia and Ib above;
c. Using this procedure, compare the output at the three
substrate levels in Ib, and you comment on three differences
d. Suggest a way to incorporate error observed in Ic into
a model like the analytical model in Part Ia.
Part II:
1 ) a) Your program (10 points)
b) For the four ratios of VE/VP and three transformations prepare a table showing
mean and variance or standard deviation of Km and Vm of the replicates of each
5
ratio/transformation combinations
2 ) Discussion addressing the following questions (15 points):
a) Based on the analysis in IIa, which of the transformations yields better estimates?
Does the ratio of VE/VP affect the relative performance of the various
transformations?
PART III:
1) Provide a grand summary table that summarizes the results of Part IIa and IIIb, IIIc
and IIId). The table should include Km, variance of Km, Vm and variance or standard
deviation of Vm where appropriate (10 points).
2) Discussion of the effect of experimental design and methods of parameter estimation
on the accuracy and precision of estimated parameters (15 points).
6
Download