Cross-Tabs With Nominal Variables 10/24/2013 Readings • Chapter 7 Tests of Significance and Measures of Association (Pollock) (pp. 155-169) • Chapter 5 Making Controlled Comparisons (Pollock Workbook) • Chapter 7 Chi-Square and Measures of Association (Pollock Workbook) OPPORTUNITIES TO DISCUSS COURSE CONTENT Office Hours For the Week • When – Friday 10-11 – Monday 10-12 – Tuesday 8-12 – And by appointment Course Learning Objectives 1. Students will be able to interpret and explain empirical data. 2. Students will achieve competency in conducting statistical data analysis using the SPSS software program. A test of statistical significance CHI-SQUARE What is Chi-Square? • A test of significance between two categorical variables • We run the test in conjunction with crosstabs Things about Chi-Square • It is not a test of strength, just significance • Chi-square is inflated by large samples • It is a test that tries to disprove the null hypothesis. • An insignificant chi-square means that no relationship exists. Chi-Square is an up or down measure • if our Chi-Square significance value from our test is greater than .05 we accept the null hypothesis and we have no relationship • If our significance value is less than or equal to.05 table, we reject the null hypothesis- we have a relationship Nominal Variables MEASURES OF ASSOCIATION Why Measures of Association • Chi-Square only tests for significance • It does not say how strongly the variables are related • We Use a Measure of Association to Do this A measure of association is a single number that reflects the strength of the relationship Measures of association for Nominal Variables tell us: • Strength of the Relationship • The statistical significance of the relationship • These go hand in hand Measures of Association for Nominal Variables Measure of Association Range Lambda 0 - 1.0 Phi 0 - 1.0 Cramer's V 0 - 1.0 Characteristics may underestimate, but a PRE measure Use for a 2x2 table only and is Chi-square based Chi-square based and the compliment to PHI. A value of 1.00 means a perfect relationship, a value of .000 means no relationship Lambda • What kinds of variables are needed for Lambda? • Lambda ranges from 0 (no relation) to 1 (a perfect relationship) • It measures how much better one can predict the value of each case on the DV if one knows the value of the IV Interpreting Lambda • .000 to .10 none • .10-.20 weak • .20-.30 moderate • .30-.40 strong • .40 and above- there is a very strong relationship Reading Lambda in SPSS • IN SPSS, LAMBDA GIVES YOU 3 DIFFERENT VALUES • Symmetric- always ignore • Two measures of your dependent variable – always use the lambda associated with your dependent variable. – If you place the dependent variable as the ROW VARIABLE, this will be the middle value. • Help from Rocky IV- And the video Lambda Significance Value • The P-value for the test statistic (p<.05) • Is the association real or happening by chance? The one in the middle Ignore these The significance of the Lambda p<.05 Lambda as a PRE Measure • Proportional Reduction in Error (PRE) • this is defined as the improvement, expressed as a Percentage, in predicting a dependent variable due to knowledge of the independent variable. • How well we can increase our prediction of the dependent variable by knowing the independent variable? Converting a Lambda to a Percent • We take the value of our association measure • Multiply by 100% • this is our PRE value. Problems with Lambda • It fears a TYPE I error (false alarm) so it is very conservative • Lambda can Underestimate relationships, even when there are significant chisquare values. • If the modal category is even, Lambda is pretty useless. SOME LAMBDA PRACTICE EXAMPLES Fracking and the Northeast 30% 25% 20% 15% 10% 5% 0% NE MW S W Phi and Cramer’s V ALTERNATIVES TO LAMBDA Cramer’s V • An alternative to Lambda • Ranges from 0 -1.0 • Not a Pre Measure Phi • Measured similarly to Lambda • You will use this with 2x2 tables only Phi And Cramer’s V Interpreting them • .000 to .10 none Limitations • Neither are PRE Measures • .10-.15 weak • .15-25 moderate • .25.- 40 strong • .40 and above- there is a very strong relationship • They are both Chi-square based so large samples inflate it An Example • Here we can say with a .369 Cramer's V, that we have a strong relationship between our independent and dependent variables. Lambda Underestimating What the Cramer’s V Tells Us • If the Modal category is hard to predict, Lambda falls flat • What we see is a weakto-moderate relationship here. • Independents and Democrats are different Lambda Underestimating Part II D.V.- obama_win08 IV- Region Lambda shows Nothing We have a moderate relationship, but it is not significant (small sample) RUNNING LAMBDA, PHI AND CRAMER’S V Easy to Do • How to do it in SPSS • Open States.SAV • Analyze – Descriptive • Cross-Tabs – Click on the Statistics Tab • Highlight your nominal variable statistics – Choose continue Two Examples Region and Cig Taxes Region and Public Support for Gay Rights Open up the GSS and Try one for yourself