slides - NCS2014 Non-Clinical Statistics Conference

advertisement
Equivalence margins to assess parallelism between 4PL curves
Perceval Sondag, Bruno Boulanger, Eric Rozet, and Réjane Rousseau
Contact:
perceval.sondag@arlenda.com
Mobile: +32491 22 17 56
Outline
 Bioassay and Relative Potency
 Parallelism Curve Assay
 Four parameters logistic model
 How to compute the Relative Potency?
 Equivalence tests for parallelism
 Frequentist and Bayesian methods VS classical Ref to Ref
comparison to establish equivalence margins
 Method
 Results
 Conclusion
Arlenda © 2014
2
Context
 Relative potency bioassay design:
 The RP is estimated from a concentration or log(concentration)-response
function, as the horizontal difference between sample and standard curves.
 Different functions can be considered according to the kind of response.
 Parallelism between function is required to compute RP !
 We focus here on Parallelism Curve Assay with 4PL curves.
Arlenda © 2014
3
Parallel curve design
Choosing a Non-Linear Model
 The four parameter logistic (4PL) model is a nonlinear function
characterized by 4 parameters:
 a = upper asymptote
b = slope at inflection point
 c = ec50 (inflection point)
d = lower asymptote
0.01
RP
Ref
Test
0.001
Response
𝑑−𝑎
𝑦=𝑎+
𝑥 𝑏
1+
𝑐
1
10
Parallel curve
0.1
1
10
Concentration
Arlenda © 2014
4
Difference tests to assess parallelism
Full
Reduced
 F-Ratio test :
𝑆𝑆𝐸 𝑅𝑒𝑑𝑢𝑐𝑒𝑑 − 𝑆𝑆𝐸 𝐹𝑢𝑙𝑙 /𝑑𝑓1
𝑅𝑆𝑆𝐸𝐹 =
~𝐹𝑑𝑓1,𝑑𝑓2
𝑆𝑆𝐸 𝐹𝑢𝑙𝑙 /𝑑𝑓2
 Chi-Squared test :
𝑅𝑆𝑆𝐸𝜒² = 𝑆𝑆𝐸 𝑅𝑒𝑑𝑢𝑐𝑒𝑑 − 𝑆𝑆𝐸 𝐹𝑢𝑙𝑙 ~𝜒²𝑑𝑓1
Arlenda © 2014
5
Equivalence Approach
 USP 1032 : use historical data to develop equivalence margins
 Comparison of Reference to Reference
 Different methods have been proposed:
 Equivalence with Chi-squared metrics (Rousseau & Boulanger)
 Equivalence of the parameters (Jonkman et al., Yang et al.)
 Derive equivalence margins based on historical data.
 Problem: historical data not always available, and building plates
full of reference product might be very expensive!!
 Objective: find a method to develop a threshold based on few
reference curves.
Arlenda © 2014
6
Method
Qualification plate
Ref to Ref plate
Reference
Sample
Control or empty
Dilution
Dilution
 Fit one curve for 3 replicates (3 rows).
 (Other formats possible).
Arlenda © 2014
7
Simulation material
 8 qualification plates (only 3 reference rows).
 10’000 reference to reference plates
 1 curve with 3 rows  28 possible comparisons per plate.
 Fit model:
 a=3
b = 3.35
Log(c) = 4
d = 0.5
 Standard deviations from 0.2 to 0.8
SD = 0.2
Arlenda © 2014
SD = 0.8
8
Chi² test or F test ?
 Chi² and F metrics test are strongly correlated.
 F doesn’t depend on S  Use F test for straightforwardness.
Arlenda © 2014
9
Comparisons to be done
 Distribution of 𝑅𝑆𝑆𝐸𝐹 using:
 8 reference plates (very expensive).
 8 qualification plates using Frequentist method based on bootstrap.
 8 qualification plates using Bayesian method.
 10’000 reference plates.
 Power comparison
 Progressively spread the upper asymptote of test product from upper
asymptote of reference product.
 At each “delta from parallelism” level, perform 1000 parallelism tests.
 Compare the probability of rejection of each method.
Arlenda © 2014
10
Simulation of reference curves (1)
 Frequentist method:
 Fit one curve by plate.
 Bootstrap on residuals to simulate large amount of reference rows for
each plate.
 Randomly draw 6 rows (2 curves) and compute parallelism metrics.
Repeat operation a large amount of time.
 Combine computed parallelism metrics from each plate.
 Get the 95th percentile of the obtained distribution.
 Repeat all above several times to get the distribution of the percentile
 Working within each plate separately allows not to take into
account the plate to plate variability (which is sometimes huge).
Arlenda © 2014
11
Simulation of reference curves (2)
 Bayesian method:
 Non-linear mixed effect model with random plate effect on one or several
parameters (select best model).
 Use non-informative prior distributions. Software Stan allows to use
uniform prior on every parameter.
 Plate to plate variability can be ignored while simulating curves as it has
no effect on the parallelism metrics when comparing two curves from
the same plate.
 From posterior chains of parameters, draw 6 rows (2 curves) and
compute parallelism metrics. Repeat operation a large amount of time.
 Simulate large amount of reference curves based on posterior chains of
parameters
 Get the 95th percentile of the obtained distribution.
 Repeat all above several times to get the distribution of the percentile
Arlenda © 2014
12
Distribution of the F ratio
sd = 0.2
0.8
Equivalence 10000 plates
Bootstrap estimation
0.4
Bayesian estimation
•
This pictures present the densities of
the F ratios computed with each
methods for sd = 0.2 and sd = 0.8
•
Black vertical line is the “true p95”
(95th percentile of F ratio computed
with 10’000 equivalence plates)
•
Small curves are the distribution of
p95 for 8 equivalence plates, bootstrap
and Bayesian approximation
0.0
0.2
Density
0.6
Equivalence 8 plates
0
1
2
3
4
F ratio
0.6
0.4
0.0
0.2
Density
0.8
1.0
sd = 0.8
0
1
2
3
4
F ratio
Arlenda © 2014
13
Power by threshold
These pictures present the probability
of rejection as a function of the
departure from parallelism.
•
Median 95th percentile has been used
as a threshold for Bayesian and
bootstrap method for simulation of
reference curve.
•
For residual error = 0.2, only one
curve can be seen as the three curves
are exactly confounded.
•
For residual error = 0.8, Bayesian
curve is exactly confounded with “true
curve” (power of threshold computed
with
10’000
reference
plates),
bootstrap method gives very similar
results.
0.2
0.4
0.6
0.8
•
Equiv 10000
Bootstrap
Bayes
0.0
Probability of rejection
1.0
Residual Error = 0.2
0.0
0.5
1.0
1.5
2.0
2.5
Delta from parallelism
0.8
0.6
0.4
0.2
Equiv 10000
Bootstrap
Bayes
0.0
Probability of rejection
1.0
Residual Error = 0.8
0.0
0.5
1.0
1.5
2.0
2.5
Delta from parallelism
Arlenda © 2014
14
Conclusions
 With only 8 plates with three rows of reference product, both
Bootstrap and Bayesian methods give results that are
comparable to the use of 10’000 plates with 8 rows of reference.
 These results are both better and much cheaper (in money and
time) than building 8 plates full of reference product.
 When the residual variability is high, simulation of reference
curves using Bayesian method continues to give same results as
using 10’000 reference plates.
 ! Here, we consider homoscedasticity, that’s usually not true,
weighting to apply.
Arlenda © 2014
15
Ongoing next steps
 Define equivalence margins on joint posterior distribution of
parameters based on simulated curves.
 Perform a power analysis to compare all methods together.
 Paper in preparation.
Arlenda © 2014
16
Main references
 Gottschalk, Paul G., and John R. Dunn. "Measuring parallelism,
linearity, and relative potency in bioassay and immunoassay
data." Journal of Biopharmaceutical Statistics 15, no. 3 (2005):
437-463.
 USP <1032>, “Design and Development of Biological Assays”,
2010.
 Rousseau, Réjane and Boulanger, Bruno. “How to develop and
assess the parallelism in a bioassays: a fit-for-purpose strategy.”
Lecture, European Bioanalysis Forum, Barcelona, 2013.
 Yang, Harry et al., “Implementation of Parallelism Testing for
Four-Parameter Logistic Model in Bioassays”, PDA J Pharm Sci
and Tech, 66(2012): 262-296
Arlenda © 2014
17
Download