1 1. Statistical significance and decision levels (continue discussion from last week). Z scores, t values and F values. Sampling distributions for the null hypothesis (assumes no difference – calculate test statistic based on sample size and number size): Type 1 error: p<.05 (alpha level) Type 2 error: Language for talking about the null hypothesis. Truth for population Do not reject null hypothesis Reject null hypothesis Null is true Null is not true Try exercise from handout (equivalent to homework problem). 2. Effect sizes: R-squared, r and Cohen's D Formula for Cohen’s D (way to represent differences between two groups as opposed to relationships between two scores). Large Medium Small For a small effect size, .01, The change in success rate is from 46% to 54% For a medium effect size, .06, The change in success rate is from 38% to 62%. For a large effect size, .16, The change in success rate is from 30% to 70% Some similar examples. Predictor Outcome R-squared r 2 Vietnam veteran status Testosterone AZT Psychotherapy Alcohol abuse Juvenile delinquency Death improvement Meta-analysis: 3. Statistical power: probability that study will detect relationship if research hypothesis is correct. Required sample size for 80% statistical power (for p <.05) r=.10 r=.30 Two tailed One tailed Power: Power = 1 - type 2 error Power = 1 - beta How increase power? 1) 2) 3) 4) What determines power? 1. 2. 3. What is adequate power? How do you know how much power you have? r=.50 3 Two ways to use power: 1. 2. Study feature Practical way of raising power Disadvantages Sample Size Conclusion Predicted difference Standard deviation Standard deviation Sample size Significant level One tailed vs. two tailed test Outcome statistically significant Yes Yes No No 4 An example (after Shapiro & Shapiro, 1983). Is psychotherapy effective? Therapy target Anxiety & depression Phobias Physical and habit problems Social and sexual problems Performance anxieites Number of studies 30 76 106 76 126 Cohen’s D .67 .88 .85 .75 .71 r .31 .54 .52 .43 .37 R2 9.6% 29% 27% 18% 14% If you think that the effect is small (.01), medium, (.06) or large (.15), and you want to find a statistically significant difference defined as p<.05, this table shows you how many participants you need for different levels of “sensitivity” or power. Power -> Effect size | .01 .06 .15 .10 .20 .30 .40 .50 21 5 3 53 10 5 83 14 6 113 144 19 24 8 10 .60 .70 179 219 30 36 12 14 .80 .90 271 354 44 57 17 22 If you think that the effect is small (.01), medium, (.06) or large (.15), and you want to find a statistically significant difference defined as p<.01, this table shows you how many participants you need for different levels of “sensitivity” or power. Power -> Effect size | .01 .06 .15 Reject null hypothesis Do not reject null hypothesis .10 .20 .30 .40 70 13 6 116 20 8 156 26 11 194 232 32 38 13 15 Null hypothesis is false Merit pay works and we know it We decided merit pay does not work but it does. .50 .60 .70 274 323 45 53 18 20 .80 .90 385 478 62 77 24 29 Null hypothesis is true We decided merit pay worked, but it doesn’t. Merit pay does not work and we know it.