Chi Square Classifying yourself as studious or not. Yes 58 No 42 Total 100 Are they significantly different? Studious Read ahead Yes No Total 12 18 30 No 46 24 70 Total 58 42 100 Yes Does reading ahead make a difference? Independence? One variable Choice of PSYA01 Section L01 25 L02 40 L03 15 L30 36 Total 116 Is this more than a chance difference? 2 (O E ) 2 E O = the observed frequency in a category E = the expected frequency in that category We may expect different categories to have the same frequency if chance alone is at work. (25 29) 2 (40 29) 2 (15 29) 2 (36 29) 2 29 29 29 29 = .55 + 4.17 + 6.79 + 1.69 = 13.17 Is this significant? Go to the table. df = k - 1 Two Variable Are the two variables independent of each other? Contingency Table Career Choice Nat. Sci. Soc. Sci Totals 37 16 53 Female 47 62 109 Totals 84 78 162 contingency is another word for “possibility” Male So this is a “table of possibilities” Marginal Totals The key is determining the expected frequencies of the four observed frequencies (the 4 colored cells). Two Variables – Expected Frequencies Testing the null hypothesis that the variables are independent We know that the probability of the joint occurrence of two independent events is the product of their separate probabilities. 37 16 53 47 62 109 e.g., (84/162) X (53/162) = .1696 or 16.96% of the observations are expected in the upper left hand cell. But, N (162) times = 27.48 (expected frequency) 84 78 162 Expected Frequencies 27.48 25.52 Now we can use….. 2 56.52 52.48 (O E ) 2 E Expected Frequencies and Alternative Calculations Eij Ri C j N 53(84) E11 27.48 162 R = the row total C = the column total E12 53(78) 2552 . 162 E21 109(84) 56.52 162 E22 109(78) 52.48 162 (37 27.48) 2 (16 2552 . ) 2 (47 56.52) 2 (62 52.48) 2 27.48 2552 . 56.52 52.48 = 3.30 + 3.55 + 1.60 + 1.78 = 10.18 Is the probability of this Chi-Square value (or larger) less than .05? Degrees of Freedom for Two Variables df = (R-1)(C-1) R = the number of rows C = the number of columns With our example: df = (2-1)(2-1) = 1 Go to Chi-Square Table and you find that the critical value is 3.84. Our Chi-Squared obtained must be larger than 3.84 for us to reject the null hypothesis. What was the null hypothesis? Phi Coefficient Will establish (at the .05 alpha level) whether two variables are related. A significant Chi-Square means we reject the null hypothesis (which assumes that the two variable are independent. We feel we have evidence That the two variable are related. Gives the numerical value to the relation. The value can range from zero to one. Zero meaning no relation at all (independence) and one indicating a prefect relations. If you know one variable’s value you, you can perfectly predict the value of the other variable.