ANOVA STEP-BY-STEP The Sums of Squares: 1) Computing SStotal: The SStotal is the SS based on the entire set of scores in the study. So computing this SS is the same as if you just stacked your different treatment samples together to form a single sample and then computed the Sum of 2 SS total = X 2 G N Squares on that one larger sample. In terms of our new notation: So for the Alcohol data, you first need to make X2 columns for each of the treatment levels, then add up the columns and then add the three ΣX2 's together. Then you subtract G2/N: 1oz 0 1 0 2 1 4 X2 0 1 0 4 1 6 3oz 2 3 0 3 1 9 X2 4 9 0 9 1 23 5oz 4 6 3 2 3 18 X2 16 36 9 4 9 74 k = 3, n = 5, N = 15, T1 = 4, T2 = 9, and T3 = 18, and G = 31 ΣX2 = sum of the ΣX2 for each level of the factor = 6 + 23 + 74 = 103 So SStotal = 103 - 312 = 103 - 961/15 = 38.933 15 2) Computing SSwithin: To compute SSwithin you first need to compute the SS within each level of the IV using the regular SS formula: ( X) SS = X n 2 2 Then to get SSwithin you simply add up the SS from within each level of the IV: SSwithin = ΣSS PSY295-001 Spring 2003 1 of 4 ANOVA STEP-BY-STEP So for the Alcohol data there are three SS you need to compute first, one for each level: SS1 = 6 - 42/5 = 2.800 SS2 = 23 - 92/5 = 6.800 SS3 = 74 - 182/5 = 9.200 Then you add these three SS up to get SSwithin: SSwithin = 2.800 + 6.800 + 9.200 = 18.800 3) Computing SSbetween: Recall that the variance between treatments measures the differences or variance between the treatment means. This implies one way we could find the SSbetween would be to compute a SS using the X - 's as the scores. That is, we could consider our deviations (that we will square and sum) as the deviation of each individual mean from the grand mean (the grand mean is the over all mean of the entire set of data or G/N). Of course, there is a computational formula that looks different from that, but is much easier to use: 2 2 G T SS betw = [ ] n N So for the Alcohol data, SSbetween = (42/5 + 92/5 + 182/5) - 312/15 = 84.2 - 64.067 = 20.133 Note: You should Always check to see if: SStotal = SSbetween + SSwithin If this check does not come out right - you have made a mistake in your calculations. So for the Alcohol data: Does 20.133 + 18.8 = SStotal? Yes 38.933 = 38.933. PSY295-001 Spring 2003 2 of 4 ANOVA STEP-BY-STEP In computing the degrees of freedom, you should keep in mind that: 1) Each df is associated with a specific SS 2) The df are approximately equal to the number of items that went into computing the corresponding SS minus 1. So if n things went in, then df = n - 1. Computing the df: 1) dftotal = N - 1 Because SStotal was computed using the entire set of N scores. So for the Alcohol study, dftotal = 15 - 1 = 14 2) dfwithin = N - k To get the SSwithin we first computed the SS for each level and then added them up. This is the same for dfwithin in a sense. For each level we have "n - 1" degrees of freedom. Then we sum those n - 1 degrees of freedom across the levels: (n - 1) + (n - 1) + (n - 1) + ... If you simplify this, you get N - k which is the right number for the dfwithin. So for the Alcohol study, dfwithin = 15 - 3 = 12. 3) dfbetween = k - 1 Because SSbetween is really based on the deviations of each treatment mean from the grand mean, the number of items in this SS is the number of treatment means = k. So the dfbetween = k - 1. So for the Alcohol study, dfbetween = 3 - 1 = 2. Note: You should Always check to see if: dftotal = dfbetween + dfwithin If this check does not come out right - you have made a mistake in your calculations. For the Alcohol study: does 2 + 12 = dftotal? Yes 14 = 14! So now we have the 3 SS and the corresponding 3 df. What we need now is to compute the variances. PSY295-001 Spring 2003 3 of 4 ANOVA STEP-BY-STEP Computing the between and within variances: Recall that a variance is SS/df. In Anova the variances we compute are called Mean Squares, symbolized "MS" (Because they are essentially mean squared deviations) So we can compute: MSbetween = SSbetween dfbetween = the variance between treatments and MSwithin = SSwithin dfwithin = the variance within treatments So for the Alcohol study, MSbetween = 20.133/2 = 10.067 MSwithin = 18.800/12 = 1.567 Note: In general we do not compute MStotal. Also, it is NOT TRUE that MStotal = MSbetween + MSwithin. Finally, because the F test is the variance between divided by the variance within, we get our F-ratio: F = MSbetween MSwithin So for the Alcohol study, F = 10.067/1.567 = 6.426. Finally, you should always present what is called an Anova Summary Table that contains the results of all of these calculations. It should look like: Source SS df MS F _____________________________________________________________ Between 20.133 2 10.067 6.426 Within 18.800 12 1.567 Total 38.933 14 _____________________________________________________________ PSY295-001 Spring 2003 4 of 4