Statistical Power

advertisement
STAT 3130
Statistical Methods II
Session 6
Statistical Power
STAT3130 – Statistical Power
Ho TRUE
Reject Ho
FTR Ho
Ho FALSE
Type 1 Error (alpha)
Power (1-beta)
1-alpha
Type 2 Error (beta)
When we engage in Hypothesis Testing, we have focused
on the p-value and the alpha value…recall what these
actually mean.
STAT3130 – Statistical Power
Ho TRUE
Reject Ho
FTR Ho
Ho FALSE
Type 1 Error (alpha)
Power (1-beta)
1-alpha
Type 2 Error (beta)
A second calculation of interest is the Power of the test –
or 1- the probability of committing a Type 2 error. In
words, the Power of the test is “The probability of correctly
rejecting the Null Hypothesis when it should be rejected”.
STAT3130 – Statistical Power
In other words, it is the probability that if a true difference exists, it
will be discovered. It is sometimes referred to as the “sensitivity” of
the test.
Statistical Power is heavily used in medicine, clinical psychology
and biology.
Typically, a test must have a Statistical Power of 80% or greater to
be considered valid. Why would that be?
STAT3130 – Statistical Power
Power is a function of three factors:
1.
Effect size – i.e., the difference between the two groups or
measurements. As the effect size goes up, the power increases.
2.
Alpha – As the chance of finding an incorrect significant effect is
reduced (Type I error), the probability of correctly finding an effect
is also reduced. Typically, alpha is set to be .01 (most conservative
and lowers power), .05 or .10 (most risk tolerant and increases
power). As alpha goes down, the power increases.
3.
Sample Size - Increased sample sizes will always produce greater
power. But, increasing the sample size can also produce “too
much” power – smaller and smaller effects will be found to be
significant until at a large enough sample size, any effect is
considered to be significant. As sample size goes up, power
increases.
STAT3130 – Statistical Power
When conducting a one tailed test, the power calculation is
executed as:
1-() = 1-P[z<z- (| 0 - a|/ (s/SQRT(n))]
A two tailed test is conducted similarly, with the alpha value associated
with the z score divided by two.
STAT3130 – Statistical Power
Example…
Lets say that we have a treatment group taking a cholesterol
lowering drug. Lets say that the average decrease in cholesterol
after taking the drug is 2 points. (n=100 and s = 10). What do you
conclude.
1-(2) = 1-P[z<1.645- (| 2|/ (10/SQRT(100))]
Now…what if the drop is 3 points. What happens?
STAT3130 – Statistical Power
Lets build some power curves…
Power versus effect size
Power versus sample size
STAT3130 – Statistical Power
If you are starting an experiment from scratch,
how do you determine the appropriate sample
size?
n= 2*(Z +Z )2/Δ2
A two tailed test is conducted similarly, with the alpha value
associated with the z score divided by two.
(Or…you can just use SAS).
Download