MANOVA Multivariate Analysis of Variance Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Michael J. Kalsher Department of Cognitive Science PSYC 4310 Advanced Experimental Methods and Statistics Statistical Analysis of Data © 2012, Michael Kalsher 1 of 42 38 MANOVA: What is it? Used to determine main and interaction effects of categorical variables on multiple DVs. ANOVA tests the differences in means of a single DV for two or more categories of IVs. MANOVA tests whether the vectors of means for the two or more groups are sampled from the same sampling distribution. In other words, MANOVA gives a measure of the overall likelihood of picking two or more random vectors of means out of the same hat. Purposes for MANOVA • To compare groups formed by categorical IVs on group differences in a set of interval-level DVs. • To use lack of difference for a set of DVs as a criterion for reducing a set of IVs to a smaller, more easily modeled number of variables. • To identify the IVs which differentiate a set of DVs the most. Statistical Analysis of Data 2 of 42 38 Why Use MANOVA? Advantages: • Improves chances of discovering changes as a result of different variables and their interactions. • Protects against inflated Type I error due to multiple tests of correlated dependent variables. • Can detect whether groups differ along a combination of variables (i.e., a variate), whereas ANOVA can detect only if groups differ along a single variable. Disadvantages: • More complicated and less powerful than ANOVA. • Analysis often ambiguous in interpretation of the effects of IVs on any single DV. Statistical Analysis of Data 3 of 42 38 MANOVA: Parts of the Analysis Main Analysis Four commonly used ways of assessing the overall significance of a MANOVA: Pillai’s trace (V); Hotelling’s trace, Wilks’s lambda (), and Roy’s largest root. • Debate over which method is best in terms of power and sample size considerations. • Rules of thumb: If group differences are concentrated on the first variate, Roy’s statistic most powerful followed by Hotelling’s trace, Wilks’s lambda and Pillai’s trace. When groups differ along more than one variate, the power ordering is reversed. Approaches to Follow-up Analysis • Perform separate ANOVAs for each DV (controlling for Type I error). • Transform linear combinations of DVs to z scores, add them together, then evaluate the combined scores using ANOVA. • Use discriminant function analysis (DFA) (yields one or more uncorrelated linear combinations of DVs that maximize differences among the groups) Statistical Analysis of Data 4 of 42 38 MANOVA test statistics: A Comparison Statistical Analysis of Data 5 of 42 38 Pillai-Bartlett trace (V) (Pillai’s trace) Given by the equation below in which represents the eigenvalues for each of the discriminant variates, and s represents the number of variates. Pillai’s trace is the sum of the proportion of explained variance on the discriminant functions and is similar to the ratio of SSM/SST s V= i=1 Statistical Analysis of Data i 1 + i 6 of 42 38 Hotelling’s trace The Hotelling-Lawley trace is the sum of the eigenvalues for each variate and is computed by the equation below. This test statistic is the sum of SSM/SSR for each of the variates and so it compares directly to the F ratio in ANOVA. s T= i i=1 Statistical Analysis of Data 7 of 42 38 Wilks’s lambda () Wilks’ lambda is the product of the unexplained variance on each of the variates. The symbol is similar to the summation symbol () except that it means multiply rather than add up. Wilks’s lambda represents the ratio of error variance to total variance (SSR/SST) for each variate. = Statistical Analysis of Data s 1 i=1 1 + i 8 of 42 38 Roy’s largest root Roy’s largest root is the eigenvalue for the first variate. In a sense, it is the same as the Hotelling-Lawley trace, except for the first variate only. This statistic represents the proportion of explained variance to unexplained variance (SSM/SSR) for the first discriminant function. This value is conceptually the same as the F-ratio in univariate ANOVA and represents the maximum possible between-group difference given the data collected. Largest root = largest Statistical Analysis of Data 9 of 42 38 MANOVA: Assumptions Independence: Observations should be statistically independent. Random Sampling: Data should be score level and randomly sampled from the population of interest. Multivariate Normality: In ANOVA, we assume that our DV is normally distributed within each group. In MANOVA, we assume that the DVs (collectively) have multivariate normality within groups (cannot be tested directly by SPSS). Homogeneity of Covariance Matrices: In ANOVA, it is assumed that the variances in each group are roughly equal (homogeneity of variance). In MANOVA, we assume this is true for each DV, but also that the correlation between any two DVs is the same in all groups. As a preliminary test, Levene’s test should not be significant for any of the DVs. Since Levene’s test doesn’t take account of the covariances, Box’s M test should be used to test whether the population variance-covariance matrices of the different groups in the analysis are equal Note: The F test from Box’s M statistics should be interpreted cautiously in that a significant result may be due to violation of the multivariate normality assumption for the Box’s M test, and a non-significant result may be due to a lack of power. Statistical Analysis of Data 10 of 42 38 Effect Size Statistics for MANOVA The multivariate GLM procedure computes a multivariate effect size index. The multivariate effect size associated with Wilks’s lambda () is the multivariate eta square. Multivariate 2 = 1 - 1 s Here, s is equal to the number of levels of the factor minus 1 or the number of DVs, whichever is smaller. • This statistic should be interpreted similar to a univariate eta square and ranges in value from 0 to 1. • A 0 represents no relationship between the factor and the DV, while a 1 indicates the strongest possible relationship. • Unclear what should be considered a small, medium, and large effect size for this statistic. Statistical Analysis of Data 11 of 42 38 Controversies and MANOVA Ability of MANOVA to detect true effects – Ramsey (1982): As correlation between DVs increases, power of MANOVA decreases. – Tabachnick & Fidell (2001): MANOVA works best with highly negatively correlated DVs and acceptably well with moderately correlated DVs in either direction; MANOVA wasteful when DVs are uncorrelated. – Cole et al. (1994): Power of MANOVA depends on combination of the correlation between DVs and effect size. • Expecting large effect: MANOVA most powerful if the measures are somewhat different and if the group differences are in the same direction for each measure. • If 2 DVs differ in terms of group differences (one large, one small), then power increased if DVs are highly correlated. Statistical Analysis of Data 12 of 42 38 The Matrix: Revisited The MANOVA test statistic is derived by comparing the ratio of a matrix representing the systematic variance of all DVs to a matrix representing the unsystematic variance of all DVs Off-diagonal components 5 3 6 10 3 12 4 6 6 4 2 7 10 6 7 6 Square Matrix 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 Identity Matrix Statistical Analysis of Data 2 6 4 8 Row Vector Single person’s score on four different variables Diagonal components 8 6 Column Vector 10 Four participants’ score on one variable 15 13 of 42 38 Partitioning the Variance SST = Sums of Squares Total SSm = Sums of Squares Model (Systematic Variance) SST SSR = Sums of Squares Error (Unexplained Variance) MANOVA Test Statistic Sum of squares and cross-products matrices SSR SSM Systematic Variance H Hypothesis sum of squares and cross-product matrix or hypothesis SSCP Unsystematic Variance E Error sum of squares and cross-products matrix or error SSCP Total Variance T Total sum of squares and cross-products matrix or total SSCP Cross-product deviations represent a total value for the combined error between two variables, so in some sense, they represent an unstandardized estimate of the total correlation between two variables. Statistical Analysis of Data 14 of 42 38 MANOVA: Performing MANOVA Using SPSS Statistical Analysis of Data 15 of 42 38 Move “Social Dimension Summed Scale” and “Pragmatic Dimension Summed Scale” to the Dependent Variables box. Move “Gender (q047_r01)” to the Fixed Factors box. Then, click “Options”. Statistical Analysis of Data 16 of 42 38 Move “Gender (q047_r01” to the Display Means box. Click “Descriptive statistics”, “Estimates of effect size”, and “Homogeneity tests” in the Display box. Click “Continue” Statistical Analysis of Data 17 of 42 38 Statistical Analysis of Data 18 of 42 38 SPSS Output Statistical Analysis of Data 19 of 42 38 SPSS Output: Main Analysis Statistical Analysis of Data 20 of 42 38 SPSS Output: Univariate ANOVAs Statistical Analysis of Data 21 of 42 38 Results Section A one-way MANOVA on the two dependent variables, the summed scales for the Social and Pragmatic dimensions, was significant for gender, Wilks’s = .995, F(2,1340) = 3.289, p<.05. Table 1 contains the means and the standard deviations on the dependent variables for males and females. Analyses of variance (ANOVAs) on each dependent variable were conducted as follow-up tests to the MANOVA. The ANOVA on the Social Dimension summed scale scores was significant, F(1,1341) = 6.29, p<.05, 2 =.005. Males had significantly higher scores (M=16.66, SE=.31) than females (M=15.59, SE=.297). The ANOVA on the Pragmatic Dimension summed scale scores was not significant, F(1,1341) = .072, p>.05. Statistical Analysis of Data 22 of 42 38 Practice Problem: Effects of study strategies on learning A researcher investigates the effectiveness of different study strategies on learning. Thirty undergraduates are randomly assigned to one of three study conditions. All participants receive the same set of study questions, but each group receives different instructions about how to study. The write group is instructed to write responses to each question, the think group is instructed to think about answers to the questions, and the talk group is instructed to develop a talk that they could deliver centering on the answers to the questions. At the completion of the study session, all students take a quiz consisting of two types of questions: recall and application. Statistical Analysis of Data 23 of 42 38 Data Set: Subject Group Effects of study strategies on learning Application Recall Subject Group Application Recall 1 1 3 1 16 2 5 7 2 1 4 4 17 2 5 4 3 1 4 3 18 2 5 5 4 1 4 5 19 2 8 7 5 1 2 3 20 2 6 5 6 1 3 2 21 3 4 3 7 1 4 3 22 3 6 6 8 1 3 3 23 3 4 4 9 1 3 5 24 3 4 3 10 1 3 3 25 3 5 6 11 2 6 7 26 3 5 5 12 2 7 4 27 3 5 5 13 2 5 6 28 3 2 4 14 2 6 3 29 3 3 3 15 2 5 2 30 3 4 5 Statistical Analysis of Data 24 of 42 38 Statistical Analysis of Data 25 of 42 38 Move “recall” and “applicat” to the Dependent Variables box. Move “Study Strategies” to the Fixed Factors box. Then, click “Options”. Statistical Analysis of Data 26 of 42 38 Move “group” to the Display Means box. Click “Descriptive statistics”, “Estimates of effect size”, and “Homogeneity tests” in the Display box. Click “Continue” Statistical Analysis of Data 27 of 42 38 Select “Post Hoc” from the Multivariate Screen Move “group” to the Post Hoc Test for box. Select “Bonferroni” Select “Games Howell” Click “Continue” Click “OK” Statistical Analysis of Data 28 of 42 38 SPSS Output for MANOVA Statistical Analysis of Data 29 of 42 38 SPSS Output for MANOVA Statistical Analysis of Data 30 of 42 38 SPSS Output: Univariate ANOVAs Statistical Analysis of Data 31 of 42 38 SPSS Output for Post-Hoc Tests Statistical Analysis of Data 32 of 42 38 Results Section A one-way Multivariate analysis of variance (MANOVA) was conducted to determine the effect of the three types of study strategies (thinking, writing, and talking) on the two dependent variables, the recall and the application test scores. Significant differences were found among the three study strategies on the dependent measures, Wilks’s = .42, F(4,52) = 7.03, p<.01. The multivariate 2 based on Wilks’s was .35. Table 1 contains the means and the standard deviations on the dependent variables for the three groups. Statistical Analysis of Data 33 of 42 38 Results Section - continued Univariate ANOVAs on each dependent variable were conducted as follow-up tests to the MANOVA. The ANOVA on the recall scores was significant, F(2,27) = 17.11, p<.01, 2 =.56, as was the ANOVA on the application scores, F(2,27) = 4.20, p = .026, 2 =.24. Post-hoc analyses to the univariate ANOVA for the recall and application scores consisted of conducting pair-wise comparisons to find which study strategy affected performance most strongly. With respect to the recall scores, the writing group produced significantly superior performance on the recall questions in comparison with either of the other two groups (ps<05). The thinking and talking groups were not significantly different from each other (p>.05). With respect to the application scores, the writing group produced significantly better performance on the application questions than the thinking group (p<.05), but no other comparisons were significant (ps>.05). Statistical Analysis of Data 34 of 42 38 One More Step … Following up MANOVA with Discriminant Analysis The practice of conducting ANOVAs as follow-up tests to significant MANOVA has been criticized because univariate ANOVAs do not take into account the multivariate nature of MANOVA An alternative is to conduct follow-up analyses using discriminant function analysis (DFA). DFA can be used after MANOVA to see how the dependent variables discriminate the groups. DFA yields one or more uncorrelated linear combinations of dependent variables (variates) that maximize differences among the groups. Statistical Analysis of Data 35 of 42 38 Discriminant Analysis: Statistical Analysis of Data Step by Step 36 of 42 38 Discriminant Analysis: Step by Step Predictors Statistical Analysis of Data 37 of 42 38 Discriminant Analysis: Already in the MANOVA output Produces the bs for each variate Statistical Analysis of Data Step by Step Useful for gaining insight into the relationships between DVs for each group Finally, click on “Continue”, then “Classify” on the main dialog box for discriminant analysis 38 of 42 38 Discriminant Analysis: Step by Step Plots the variate scores for each participant groups according to the study strategy they were given Provides an overall gauge of how well the discriminant variates classify the actual participants Statistical Analysis of Data 39 of 42 38 Discriminant Analysis: Step by Step Click on “Save” These scores can be useful because the variates that the analysis identifies may represent important underlying constructs Statistical Analysis of Data 40 of 42 38 Discriminant Analysis Output Shows that only one of the variates is significant. Thus, the group differences shown by the MANOVA can be explained in terms of one underlying dimension Statistical Analysis of Data 41 of 42 38 Discriminant Analysis Output These coefficients tell us the relative contribution of each variable to the variates. These values are comparable to factor loadings. They represent the relative contribution of each DV to group separation. Statistical Analysis of Data 42 of 42 38 Results The MANOVA was followed up with discriminant analysis, which revealed two discriminant functions. The first explained 96.7% of the variance, canonical R2 = .56, whereas the second explained only 3.3% of the variance, canonical R2 = .04. In combination, these discriminant functions significantly differentiated the treatment groups, Λ = 0.42, 2(4) = 22.90, p < .001, but removing the first function indicated that the second function did not significantly differentiate the treatment groups, Λ = 0.96, 2(1) = 1.12, p > .05. The correlations between outcomes and the discriminant functions revealed that total correct on recall questions loaded almost exclusively on the first function (r = .99); total correct on application questions loaded more highly on the second function (r = .89) than on the first function (r = .47). Statistical Analysis of Data 43 of 42 38