Summary of Formulas/Tests (Chapter 10&12) Chapter 10: Correlation and Regression 1. Linear Correlation Coefficient ๐(∑ ๐ฅ๐ฆ) − (∑ ๐ฅ)(∑ ๐ฆ) ๐= √๐(∑ ๐ฅ²) − (∑ ๐ฅ)² √๐(∑ ๐ฆ²) − (∑ ๐ฆ)² ๏ท Notation: n: the number of pairs of data present r: the linear correlation coefficient for a sample, -1 ≤ r ≤ +1 If r is close to 0, we conclude that there is no linear correlation between x and y, but if r is close to -1 or +1, we conclude that there is a linear correlation between x and y. 2. Coefficient of Determination: r² The value of r² is the proportion of the variation in y that is explained by the linear relationship between x and y. 3. Hypothesis Test for Correlation H0: ρ = 0 (There is no linear correlation.) H1: ρ ≠ 0 (There is a linear correlation.) (ρ: Greek letter rho used to represent the linear correlation coefficient for a population.) ๏ท Method 1: Test statistics is t: Test statistic: ๐ก = ๐ √1−๐² ๐−2 Critical values: t distribution (Table A-3 with df = n – 2) ๏ท Method 2: test statistic is r Test statistic: ๐ = ๐(∑ ๐ฅ๐ฆ)−(∑ ๐ฅ)(∑ ๐ฆ) √๐(∑ ๐ฅ²)−(∑ ๐ฅ)²√๐(∑ ๐ฆ²)−(∑ ๐ฆ)² Critical value: critical values of the person correlation coefficient r (Table A-6) 4. Regression Equation ๏ท Notations for regression equation Population parameter y-intercept of regression equation β0 Slope of regression equation β1 Equation of the regression line Y = β0 + β1x b1= Sample statistic b0 b1 ๐ฆฬ = b0 + b1x ๐(∑ ๐ฅ๐ฆ)−(∑ ๐ฅ)(∑ ๐ฆ) ๐(∑ ๐ฅ²)−(∑ ๐ฅ)² b0= ๐ฆ − b1๐ฅ or ๐0 = (∑ ๐ฆ)(∑ ๐ฅ 2 )−(∑ ๐ฅ)(∑ ๐ฅ๐ฆ) ๐((∑ ๐ฅ 2 )−(∑ ๐ฅ)² ๐ฆฬ = b0 + b1x 5. 6. 7. 8. Residual = observed y – predicted y = y - ๐ฆฬ Total Deviation = y - ๐ฆ, total variation = ∑(๐ฆ − ๐ฆ)² Explained Deviation = y - ๐ฆฬ − ๐ฆ, explained variation = ∑( ๐ฆฬ − ๐ฆ)² Unexplained Deviation = ๐ฆ − ๐ฆฬ, unexplained variation = ∑(๐ฆ − ๐ฆฬ)² 9. Coefficient of Determination: ๐² = 10. Standard Error of Estimate: se= √ ๐๐ฅ๐๐๐๐๐๐๐ ๐ฃ๐๐๐๐๐ก๐๐๐ ๐ก๐๐ก๐๐ ๐ฃ๐๐๐๐๐ก๐๐๐ ∑(๐ฆ−๐ฆฬ)² ๐−2 or se= √ 1 ๐ + − ๐ฆ)² ∑ ๐ฆ²−๐0 ∑ ๐ฆ−๐1 ∑ ๐ฅ๐ฆ 11. Prediction Interval for an Individual y Margin of error: E = tα/2Se√1 + ∑(๐ฆฬ−๐ฆ)² = ∑(y ๐(๐ฅ0 −๐ฅ)² ๐(∑ ๐ฅ²)−(∑ ๐ฅ)² Prediction interval: ๐ฆฬ − ๐ธ < ๐ฆ < ๐ฆฬ + ๐ธ X0: given value of x tα/2: t distribution (Table A-3 with df = n-2) 12. Multiple Regression Multiple regression equation: ๐ฆฬ = b0 + b1x1 + b2x2 + โฏ + bkxk ๐−2 Adjusted coefficient of determination: ๐ ² = 1 − (๐−1) [๐−(๐+1)] (1 − ๐ 2 ) n: sample size k: number of predictor (x) variables Chapter 12: Analysis of Variance 1. One-Way ANOVA with Equal Sample Sizes n H0: µ1 = µ2 = µ3 = ... H1: At least one of the means is different from the others. Test statistic: ๐น = ๐ฃ๐๐๐๐๐๐๐ ๐๐๐ก๐ค๐๐๐ ๐ ๐๐๐๐๐๐ ๐ฃ๐๐๐๐๐๐๐ ๐ค๐๐กโ๐๐ ๐ ๐๐๐๐๐๐ = ๐๐ ๐ฅ2 ๐ ๐2 = ๐๐ ๐ฅ2 2 2 ๐ 2 1 +๐ 2 +โฏ+๐ ๐ ๐ Critical values: F distribution (Table A-5 with numerator df = k – 1 and denominator df = k (n - ), k = number of samples and n = sample size) 2. One-Way ANOVA with Unequal Sample Sizes H0: µ1 = µ2 = µ3 = ... H1: At least one of the means is different from the others. Test statistic: ๐น= ๏ท ๐ฃ๐๐๐๐๐๐๐ ๐๐๐ก๐ค๐๐๐ ๐ ๐๐๐๐๐๐ ๐ฃ๐๐๐๐๐๐๐ ๐ค๐๐กโ๐๐ ๐ ๐๐๐๐๐๐ = ∑ ๐๐ (๐ฅ๐−๐ฅ ฬฟ)² ] ๐−1 ∑(๐ −1)๐ 2 [ ∑(๐๐ −1)๐ ] ๐ [ = ๐๐ (๐ก๐๐๐๐ก๐๐๐๐ก) ๐๐ (๐๐๐๐๐) Notation: ๐ฅฬฟ : mean of all sample values combined ๐: number of population means being compared ๐๐ : number of values in the ith sample ๐ฅ๐ : mean of values in the ith sample ๐ ๐2 : variance of values in the ith sample Critical values: F distribution (Table A-5 with numerator df = k – 1 and denominator df = N – k with k = number of samples and N = total number of values in all samples combined) 3. Two-Way ANOVA H0: There are no effects from the row factor (that is, the row means are equal). H1: There are no effects from the column factor (that is, the column means are equal). Step 1. Interaction Effect Test for an interaction between the two factors using ๐น = ๐๐ (๐๐๐ก๐๐๐๐๐ก๐๐๐) ๐๐(๐๐๐๐๐) Step 2. Row/Column Effect Is there an effect due to interaction between the two factors? ๏ Yes (Reject H0 of no interaction effect.). Stop. Don’t consider the effects of either factor without considering the effects of the other. ๏ No (Fail to reject H0) of no interaction effect.). ๏จ Test for effect from row factor using ๐น = ๐๐ (๐๐๐ค ๐๐๐๐ก๐๐) ๐๐(๐๐๐๐๐) Interpretation: Compare P-value with significance level. If the P-value is less than or equal to the significance level, reject the null hypothesis of no effects from the row factor. If the P-value is greater than the significance level, fail to reject the null hypothesis of no effects from the row factor. ๏จ Test for effect from column factor using ๐น = ๐๐ (๐๐๐๐ข๐๐ ๐๐๐๐ก๐๐) ๐๐(๐๐๐๐๐) Interpretation: Compare P-value with significance level. If the P-value is less than or equal to the significance level, reject the null hypothesis of no effects from the column factor. If the P-value is greater than the significance level, fail to reject the null hypothesis of no effects from the column factor.