Sample Size Estimation in Clinical Trial

advertisement
The Sample Size Estimation and its SAS Code For Binary Response Endpoints Clinical
Study- A Review
*
1
Singh Rajneesh 1, Namdev Kuldeep Kumar 2 and Dr. Deepak 3
Biostatistician, Fortis Clinical Research Ltd, 2 Pharmacokinetic Scientist, Fortis Clinical
Research Ltd, 3 Head Clinical Operation, Fortis Clinical Trial Clinical Research Ltd.
Abstract:
The aim of this article is to provide basic idea about sample size estimation in clinical study of
binary response endpoints. Every clinical trial must be planned and it describes the objective,
primary and secondary endpoint, method of collecting data, inclusion & exclusion criteria,
sample size with scientific justification, statistical methodology. The calculation of sample size
with sufficient levels of significance and power is an essential part of protocol which needs to be
submitted to drug authorities for approval. In this review emphasis has been made to describe the
different kind of design i.e. equivalence, non-inferiority, superiority and equality and their SAS
codes for sample size calculation in clinical study.
Key: Significance Level (), Power, Margin, Non-inferiority, Superiority.
Introduction:
The most important aspire of a sample size calculation is to determine the number of participants
needed to detect a clinically relevant treatment effect. The determinants of sample size are the
study design and the clinical end point's scale level.
*Correspondence Address: Rajneesh Singh, Biostatistician
Fortis Clinical Research Ltd.,Sunflag Hospital & Research Centre, Sector 16-A Faridabad – 121 002 India
E-Mail:rajneesh.singh@fortis-cro.com
However, information needed to determine sample sizes are power, level of significance; size of
the treatment effect, variation and others factors (e.g. drop-out rate or withdrawn rate). Statistical
study designs are available to achieve objectives. Typical designs that may be employed are
parallel group design, crossover design etc. The description of primary study end point should
cover whether it is discrete or continuous or time-to-event. Sample size is estimated differently
for each of these end points. Sample size is adjusted if primary end point involves multiple
comparisons. The sample size estimation is challenging for complex designs such as noninferiority and/or time to event end points. 1, 2
Statistical Considerations:
1.
Null Hypothesis (H0): The null hypothesis typically corresponds to a general or default
position. For example, the null hypothesis might be that there is no relationship between
two measured phenomena or that a potential treatment has no effect or no difference in the
response exists between treatment and control groups.
2.
Alternative Hypothesis (Ha): The alternative hypothesis is what we are attempting to
demonstrate in an indirect way by the use of our hypothesis test. If the null hypothesis is
rejected, then we accept the alternative hypothesis. If the null hypothesis is not rejected,
then we do not accept the alternative hypothesis or a difference of a specified amount (δ)
exists between treatment and control.
3.
Significance Level (): The Probability of detecting the false effect. The probability of
rejecting H0 given that H0 is true,  = 0.1, .05, .025, .001. However the investigator is
willing to accept a 5% chance of erroneously reporting a significant effect. In "traditional
statistics" a probability of something occurring of less than .05 (= 5% = 1 chance in 20) is
conventionally considered "unlikely".
4.
Power (1 - ): The probability of detecting of true effect. We can also calculate a power
needed to detect a difference of δ as statistically significant, with N participants. The power
of a statistical hypothesis test measures the test's ability to reject the null hypothesis when it
is actually false.
5.
Margin (δ): smallest difference hopes to detect
6.
Test Statistics: Test statistics is nothing but it is function of sample values. For Ex. z =
test statistic, zα = critical value: If | z| > zα: Reject H0.
7.
Others: Missing data elements in retrospective studies, Loss to follow-up in prospective
studies.
Type of Testing:
1. Test for equivalence: This is a trial with the primary objective of showing that the response to
two or more treatments differs by an amount which is clinically unimportant. This is usually
demonstrated by showing that the true treatment difference is likely to lie between a lower and
an upper equivalence margin of clinically acceptable differences.
2. Test for Non-inferiority: To demonstrate that the new drug is as less effective as the
Standard treatment (i.e. the difference between the new treatment and the standard is less than
the smallest clinically meaningful difference). It is not possible to perform a non- inferiority trial
in all situations. The decision to perform non-inferiority trial should be justified considering both
the therapeutic area and the profile of the reference product.
3. Test for Superiority: This is a trial with the primary objective of showing that the response to
the investigational product is superior to a comparative agent (active or placebo control). To
demonstrate that the new treatment is more superior that standard treatment (i.e. the difference
between the new treatment and the standard is greater than the smallest clinically meaningful
difference).
To show the superiority is the gold standard in clinical trial research. Of the three types of trials,
the results of a superiority trial are the simplest regarding interpretation. Due to the difficulty of
launching more powerful drugs on the market, it is not easy to prove superiority.
The Method for Calculation of Sample Size:
Here we illustrating the sample size based on the binary clinical endpoints. The protocol should
provide below information and justification of sample size.
4, 5
1. Statements of the (primary) objectives of the study.
2. Desired level of significance: However the investigator is willing to accept a 5% chance of
erroneously reporting a significant effect.
3. Desired power: Power = 0.80, 0.90, it should be at least 0.80 for design.
4. Assumptions made about the distribution of the outcomes.
5. Summary of test statistic will be used for analysis.
6. Smallest difference (margin): Selection of the margin is based upon a combination of
statistical reasoning and clinical importance
A.
Equivalence Trial:
Defining "Equivalence": Since no two treatments are exactly equal, we must define what
difference in effects would be clinically important. This clinically important effect size (delta)
will differ depending on the condition being studied.
Formula of calculating sample size is
N = [(Zα + Zβ)2 × {(PT (1- PT) + (PR (1- PR))}] / (|PT – PR|- δ)2
B.
Non-inferior Trial:
The methodology for sample size and power calculations is developed based on an exact
unconditional test of non-inferiority. It may be possible to justify a wider non-inferiority margin
for efficacy if the product has an advantage in some other aspect of its profile. 11
N = [(Zα + Zβ)2 × {(PT (1- PT) + (PR (1- PR))}] / (PT - PR- δ)2
For Example δ = - 0.10, Non-inferior margin, based on clinical relevance.
C.
Superior Trial:
A superiority trial is one where we desire to demonstrate that one treatment or intervention is
better than another. The methodology for sample size and power calculations is developed based
on an exact unconditional test of superiority.
N = [(Zα + Zβ)2 × {(PT (1- PT) + (PR (1- PR))}] / (PT - PR- δ)2
For Example δ = 0.10, Superiority margin, based on clinical relevance.
Above formula for sample size determination published in different published articles 3, 4, 7, 8, and 9
Where N = sample size required in each group, PT = proportion of subject cured by Test Drug,
PR = proportion of subject cured by Reference Drug Zα: This depends on level of significance,
for 5% Zβ: This depends on power, for 80%. δ = degree of difference.
Summary: Clinical trials should have sufficient statistical power to detect difference between
groups considered to be of clinical interest. In general, an increase in sample size will be caused
by either decreasing the significance level α, by raising the statistical power 1− β, by demanding
smaller minimum detectable differences δ and by larger variation. A minimum necessary group
size based on this consideration, however, ensures that a mean group difference δ can be detected
at the significance level α with a minimum statistical power of 1− β.6
It is important to investigate the sensitivity of the sample size estimate to a variety of deviations
from these assumptions and this may be facilitated by providing a range of sample sizes
appropriate for a reasonable range of deviations from assumptions 5.
Planning the sample size of a clinical study requires prior information. The type of prior
information depends on the statistical methods which are to be used. If the desired parameters
cannot be estimated, it may be desirable to perform a pilot study in advance, in order to estimate
the appropriate population parameters. The sample size is one of the critical steps in planning a
clinical study and any negligence in its estimation may lead to rejection of an efficacious drug
and an ineffective drug may get approval.2. If there is no sample size planned, this indicates that
the quality of the study is poor.10
Key Messages 10

Sample size planning is an essential step in the performance of clinical studies.

Sample size planning requires the expert knowledge of clinicians or physicians, who
provide an estimate of the relevant effect.

Sample size planning depends on the planned method of statistical evaluation and thus on
the medical question to be answered.

The chances of success in a clinical study and the quality of the research results are
highly dependent on sample size planning.

Sample size planning should always be carried out in collaboration with an expert
statistician or biometrician
SAS Procedure for Estimation of Sample Size: To test the responses of two binary variables,
we used TWOSAMPLEFREQ statement in Proc Power for estimation of sample size. In Proc
Power we have several statements. 13:
Title2 "SAS Program of Sample Size of Non-Inferiority Trial";
Proc Power ;
TwoSampleFreq
Test
= PChi
Sides
=U
Alpha
= 0.05
NullProportionDiff = -.10 /* Margin*/
ProportionDiff
= 0.10 /* Difference of corresponds to responses between group */
RefProportion
= 0.65 /* proportion for Reference group A */
Power
Ntotal
= 0.80
=.
;
Run ;
Title2 "SAS Program of Sample Size of Superiority Trial";
Proc Power ;
TwoSampleFreq
Test
= PChi
Sides
=U
Alpha
= 0.05
NullProportionDiff = .10 /* Margin*/
ProportionDiff
= 0.20 /* Difference of corresponds to responses between group */
RefProportion
= 0.65 /* proportion for Reference group A */
Power
Ntotal
;
Run ;
= 0.80
=.
Table – 1: QC Program- Sample Size of Non-Inferiority Trial
Data Sample ;
MARGIN = -0.10;
PT
= 0.85;
PR
= 0.65;
ZALPHA = 1.64;
ZBETA = 0.84;
POWER = 80 ;
ARM
= 2 ;
DROPOUT = 60 ;
run;
Data SS ;
Set Sample ;
Denom = (PT-PR)- (margin) ;
Denom1 = Denom*Denom ;
Zstatistic = (Zalpha+Zbeta)**2 ;
SS1 = PT*(1-PT)+ PR*(1-PR);
SS2 = Zstatistic* SS1 ;
SS3 = SS2/Denom1 ;
SS = SS3*ARM ;
SSFINAL = (SS*(DROPOUT/100)+SS);
run;
Data Sample ;
MARGIN = 0.10;
PT
= 0.85;
PR
= 0.65;
ZALPHA = 1.64;
ZBETA
= 0.84;
POWER
= 80 ;
ARM
= 2 ;
DROPOUT = 60 ;
run;
Proc Print data = SS ;
Var PT PR MARGIN POWER SS SSFINAL ;
Format SS SSFINAL 10.0 ;
Label PT = 'Efficacy Rate of Test Product' ;
Label PR = 'Efficacy Rate of Reference Product' ;
Label MARGIN = 'Non-Inferiority Margin';
Label POWER = 'Power of Study' ;
Label SS = 'Sample Size Without Drop-out Rate '
;
Label SSFINAL = 'Sample Size Including Drop-out
Rate' ;
run;
Table – 2: QC Program- Sample Size of Superiority Trial
Proc Print data = SS ;
Var PT PR MARGIN POWER SS SSFINAL ;
Format SS SSFINAL 10.0 ;
Label PT = 'Efficacy Rate of Test Product' ;
Label PR = 'Efficacy Rate of Reference Product' ;
Label MARGIN = 'Superiority Margin';
Label POWER = 'Power of Study' ;
Label SS = 'Sample Size Without Drop-out Rate '
;
Label SSFINAL = 'Sample Size Including Drop-out
Rate' ;
run;
Data SS ;
Set Sample ;
Denom = (PT-PR)- (margin) ;
Denom1 = Denom*Denom ;
Zstatistic = (Zalpha+Zbeta)**2 ;
SS1 = PT*(1-PT)+ PR*(1-PR);
SS2 = Zstatistic* SS1 ;
SS3 = SS2/Denom1 ;
SS = SS3*ARM ;
SSFINAL = (SS*(DROPOUT/100)+SS);
run;
Acknowledgement:
We have taken efforts in this research paper. However, it would not have been possible without
the kind support and help of Fortis Clinical Research LTD. I would like to extend my sincere
thanks to all of them.
Conflicting Interest: There is no conflict of interest assign with this article.
Source of Funding: We did not need financial support from any agency to make this article
Ethical Clearance: The article is under review category and any human or animal subject are
not involve in this research hence there is nix function of ethics committee however we have
clearance from internal committee.
References:
1. Shein-Shung C, Jun S, Hansheng W. Sample Size Calculation in Clinical Trial. New York:
Marcel Dekker Inc; 2003. Chapter 1, page 11, section 1.2.3.
2. Tushar Vijay Sakpal Sample Size Estimation in Clinical Trial, Perspect Clin Res , v.1(2);
Apr,Jun 2010
3. Lehana Thabane, Sample Size Determination in Clinical Trials HRM-733 CLass Notes
4. Pamela J. Atherton Skaff, Sloan, Mayo Clinic, Rochester, MN 55905, Design and Analysis of
Equivalence Clinical Trials Via the SAS® System , SAS.
5.
ICH E 9 Statistical Principles for Clinical Trials.
6. KP Suresh, S.Chandrashekara, Sample Size Estimation and Power Analysis for Clinical
Research Study: Journal of Human Reporductive Science/volume 5.
7. Laura Lee Johnson, Ph.D, Sample Size and Power, Presentation, National Center for
Complementary and Alternative Medicine in NIH USA
8. Lihong Qi, Division of Biostatistics, EPP 246 Clinical Biostatistics, Lecture 3, Jan 19, 2007
9. F Krummenauer, B Dick, O Schwenn, and N Pfeiffer, The determination of sample size in
controlled clinical trials in ophthalmology, Br J Ophthalmol. 2002 September; 86(9): 946–947.
10. Bernd Röhrig, Dr. rer. nat., Jean-Baptist du Prel, Dr. med. MPH, Daniel Wachtlin, Dipl.-Kfm.,
Robert Kwiecien, Dr. rer. nat., and Maria Blettner, Prof. Dr. rer. nat., Sample Size Calculation in
Clinical Trials, Dtsch Arztebl Int v.107(31-32); Aug 2010.
11. Committee For Medicinal Products For Human Use: Guideline On The Choice Of The NonInferiority Margin.
12. Emmanuel Lesaffre, Dr.Sc., Superiority, Equivalence, and Non-Inferiority Trials: Bulletin of the
NYU Hospital for Joint Diseases 2008;66(2):150-4
13. Proc Power: The Power Procedure (Experimental) Chapter-56
Download