10 The Analysis of Variance Copyright © Cengage Learning. All rights reserved. http://www.luchsinger-mathematics.ch/Var_Reduction.jpg ANOVA: Examples 1) Do four different types of steel have the same structural strength? 2) Does the major of the student (math, engineering, life sciences, economics, computer science) have an effect on the student’s grade in STAT 511? 3) Does the percentage of alcohol in gasoline has an effect on the mpg? 4) Does the heat retention in a house depending on the thickness or of insulation in the attic? ANOVA: Graphical ANOVA: notation Xij: jth measurement taken from the ith population sample sizes: n1, …, nI šš š=1 ššš šš. = šš 2 šš ššš − š ššš š=1 2 šš = = š−1 š−1 nT = n1 + … + nI šš š¼ š=1 š=1 ššš š.. = šš ANOVA: Assumptions 1. All samples are independent of each other. 2. Each population or treatment distributions are normal with E(Xij) = ļI. 3. Each population has the same variance (pooled), Var(Xij) = σ2. ANOVA test statistic ANOVA test F Distribution http://www.vosesoftware.com/ModelRiskHelp/index.htm#Distributions/ Continuous_distributions/F_distribution.htm F curve and critical value http://controls.engin.umich.edu/wiki/index.php/Factor_analysis_and_ANOVA Table A.9 Critical Values for F Distribution (first page) ANOVA Table: Formulas Source Model (Between) Error (Within) df SS I I–1 ļ„ļ„ (x iļ½1 jļ½1 I i. ļx..) 2 ni 2 (x ļ x .) nT – I ļ„ļ„ ij i iļ½1 jļ½1 I Total ni ni 2 (x ļ x..) nT – 1 ļ„ļ„ ij iļ½1 jļ½1 MS (Mean Square) F SSM SSM ļ½ dfm I ļ1 MSM MSE SSE SSE ļ½ dfe nT ļ I ANOVA Hypothesis test: Summary H0: μ1 = μ2 = ļļļ = μI Ha: At least one ļi is different Test statistic: š¹ = ššš šššø Rejection Region: F ≥ Fļ”,dfm,dfe ANOVA: Example An experiment was carried out to compare five different brands of automobile oil filters with respect to their ability to capture foreign material. A sample of nine filters of each brand was used. Do the filters capture the same amount of foreign material at a 0.05 significance level? ANOVA: Example (cont) 2. H0: ļ1 = ļ2 = ļ3 = ļ4 = ļ5 The true mean amount of foreign material is the same for all of the filters HA: at least one of the ļi is different The true mean amount of foreign material caught is not the same for all of the filters ANOVA: Example (cont) Source Model Error Total df 4 40 44 SS MS F 13.32 3.33 37.84 3.53 0.088 16.85 Example: ANOVA (cont) 7. The data does provide strong support to the claim that the mean amount of foreign material caught is not the same for all of the filters. Problem with Multiple t tests Overall Risk of Type I Error in Using Repeated t Tests at ļ” = 0.05 Table A.10: Studentized Range ANOVA: Example (Tukey) An experiment was carried out to compare five different brands of automobile oil filters with respect to their ability to capture foreign material. A sample of nine filters of each brand was used. Do the filters capture the same amount of foreign material at a 0.05 significance level? Which one(s) of the filters is best? xĢ 1. = 14.5 xĢ 2. = 13.8 xĢ 3. = 13.3 xĢ 4. = 14.3 xĢ 5. = 13.1 ANOVA: Example (cont) Source Model Error Total df 4 40 44 SS MS F 13.32 3.33 37.84 3.53 0.088 16.85 Example: Tukey (cont) i–j 1–2 1–3 1–4 1–5 2–3 2–4 2–5 3–4 3–5 4–5 xĢ i - xĢ j 0.7 1.2 0.2 1.4 0.5 -0.5 0.7 -1.0 0.2 1.2 CI Same? (0.3, 1.1) (0.8, 1.6) (-0.2, 0.6) yes (1.0, 1.8) (0.1, 0.9) (-0.9, -0.1) (0.3, 1.1) (-1.4, -0.6) (-0.2, 0.2) yes (0.8, 1.6) Example: Tukey (cont) xĢ 5. 13.1 xĢ 3. 13.3 xĢ 2. 13.8 xĢ 4. 14.3 xĢ 1. 14.5 xĢ 5. 13.1 xĢ 3. 13.3 xĢ 2. 13.8 xĢ 4. 14.3 xĢ 1. 14.5