Data in the Classroom CSU Fresno November 1, 2010 3/10/2016 1 Presenters John Korey Cal Poly Pomona, Political Science jlkorey@csupomona.edu Ed Nelson CSU Fresno, Sociology ednelson@csufresno.edu 3/10/2016 2 Workshop Agenda 3/10/2016 Introductions (Ed Nelson) SSRIC (Ed) Data for this workshop (John Korey) Issues and examples Experimental design (John) Sampling and Statistical Inference (Ed) Causality and contingency tables (Ed and John) Fun with graphics (John) Change over time (John) Where can we get the data? (John) What are we doing this year at Fresno State? (Ed) Evaluations 3 SSRIC Social Science Research & Instructional Council http://www.ssric.org 3/10/2016 4 The Council Oldest CSU affinity group -- founded in 1972 Each campus has a representative Works to provide access to data Promotes use of data analysis in research and teaching 3/10/2016 5 The Council Annual student research conference on April 29 at San Jose State University Sponsors attendance at the ICPSR summer workshops in Ann Arbor, Michigan http://www.ssric.org/participate/icpsr_summ er Works with the Field Institute -- selects faculty fellow (12 questions) – proposal due April 15 3/10/2016 6 Datasets for This Workshop Based on SPSS for Windows 16.0: A Basic Tutorial (http://www.ssric.org/trd/spss16) General Social Survey (GSS) 2006 Subset Based on Introduction to Research Methods (http://www.csupomona.edu/~jlkorey/POWERMUTT/i ndex.html) American National Election Study (ANES) 2004 Subset GSS Cumulative File Subset ANES 2000-2002-2004 Panel Study Subset U.S. Senate 3/10/2016 7 Issues and Examples Experimental design Sampling and statistical inference Causality and contingency tables Fun with graphics Change over time 3/10/2016 8 Experimental Design 3/10/2016 9 Design Requirements Experiments Random assignment to groups Manipulation by experimenter of independent (predictor) variable Quasi-experiments 3/10/2016 10 Types of Experiments Laboratory Field 3/10/2016 11 Laboratory Experiment: Prisoner’s Dilemma HOMICIDE DIVISION INTERROGATION ROOM A HOMICIDE DIVISION INTERROGATION ROOM B 3/10/2016 12 Laboratory Experiment: Prisoner’s Dilemma INTERROGATION IN PROGRESS DO NOT ENTER 3/10/2016 13 Laboratory Experiment: Prisoner’s Dilemma JACK’S BAIL BONDS “I’ll get you out if it takes 20 years.” 909/869-4619 24/7 3/10/2016 14 Laboratory Experiment: Prisoner’s Dilemma Outcomes KEY: A'S OUTCOME B'S OUTCOME B TALKS B DOESN’T TALK 3/10/2016 A TALKS A DOESN'T TALK 10 YEARS 10 YEARS DEATH 1 YEAR 1 YEAR DEATH WALK WALK 15 Field Experiments Gosnell (1927) 3/10/2016 Gerber and Green (2000) 16 Resources The Center for Experimental Social Science 3/10/2016 17 Experimental Design in Survey Research Telephone vs. face to face (2000 ANES) Question wording: Do you favor or oppose doing away with the DEATH tax? Do you favor or oppose doing away with the ESTATE tax? 3/10/2016 18 House 3/10/2016 19 Estate http://en.wikipedia.org/wiki/File:Ashford_castle.jpg 3/10/2016 20 Results (2002 ANES) Favor abolishing “death tax”: 74.3% Favor abolishing “estate tax”: 71.5% p = n.s. 3/10/2016 21 Sampling and Statistical Inference 3/10/2016 22 What do we want to make sure our students understand? Populations and samples Parameters and statistics Sampling variability Margin of error Confidence intervals and confidence levels 3/10/2016 23 Basic principle Samples vary What factors influence sampling variability? Size of sample Population variability How sample was selected 3/10/2016 24 Using Simulations to Teach Statistical Inference Draw repeated random samples Compute sample statistic Construct chart showing the distribution of these sample statistics Demonstration – see http://constats.atech.tufts.edu 3/10/2016 25 Estimators and Estimates An estimator is the method and an estimate is the numerical result Demonstration – see http://inspire.stat.ucla.edu/unit_09/teaching_t ips.php 3/10/2016 26 Resources -- Exercises Rolling dice and flipping coins – see http://www.causeweb.org/repository/StarLibr ary/activities/andrews_2003/ M&M’s – see http://www.ropercenter.uconn.edu/education/ assignments/polling_basics.pdf Drawing cards (Aces to Kings) – Xuanning Fu (CSU Fresno) 3/10/2016 27 Resources – Web Sites Roper Center -- Fundamentals of polling: http://www.ropercenter.uconn.edu/education/ polling_fundamentals.html American Association for Public Opinion Research – more on polling -http://www.aapor.org/Poll_andamp_Survey_F AQs.htm Sample size calculator -http://www.surveysystem.com/sscalc.htm 3/10/2016 28 Causality and Contingency Tables 3/10/2016 29 What do we need to do to establish cause and effect? Statistical relationship Causal ordering Eliminate alternative explanations 3/10/2016 30 Example Religiosity and how to regulate the distribution of pornography – data set – gss06_subset_for_classes_modified2.sav RELITEN – how religious the respondent is PORNLAW – how the respondent feels about regulating the distribution of pornography 3/10/2016 31 Spuriousness Are there any alternative explanations (other than the causal one) for the relationship? Can we think of any alternative explanations for RELITEN and PORNLAW? Gender might account for this relationship. Women are more religious than men and also more likely to want to restrict the distribution of pornography In other words, the relationship between X and Y might be spurious. So what we need to do is to test for spuriousness 3/10/2016 32 Testing for Spuriousness Independent variable (X) is RELITEN Dependent variable (Y) is PORNLAW Control variable (C) is SEX 3/10/2016 33 Conclusions We found out that the relationship of RELITEN and PORNLAW was not spurious when we controlled for SEX But does that mean that we can conclude that the relationship is never spurious? What does this say about proving causality? 3/10/2016 34 Applying this to the Classroom Start with examples that make sense to students Move to examples with real data that students can run Generalize to issues of testing causality Can show that a relationship is not causal (i.e., it’s spurious) Can never prove that a relationship is causal. 3/10/2016 35 Example: Specification Open General Social Survey Subset Does level of education influence the relationship between political views and party identification? 3/10/2016 36 Specification (continued) From Menu bar, go to: Analyze Descriptive Statistics Crosstabs Dependent variable (first box): partyid Independent variable (second box): polviews Control variable: (third box): degree Statistics: Kendall’s taub Cells: Column percentages 3/10/2016 37 Specification (continued) Look at pattern of Kendall’s taub statistics 3/10/2016 38 Example: Reactivity We know that the race of the interviewer in face-to-face interviews affects what people tell us about race We know that the perceived race of the interviewer in telephone interviews also influences what people tell us What about the gender of the interviewer in face-to-face interviews? 3/10/2016 39 ANES Example Open anes04s We’ll going to use three variables GENDER – gender of respondent INTGENPO – gender of interviewer WORKMOM – do you agree or disagree [that a] working mother can establish just as warm and secure a relationship with her children as a mother who does not work? Let’s start by using the gender of the interviewer (INTGENPO) as our independent variable and WORKMOM as our dependent variable 3/10/2016 40 ANES Example Continued What did we discover? Respondents interviewed by women are more likely to agree that working mothers can have a warm relationship with their children Now let’s see if this is true for both male and female respondents. Let’s control for GENDER – gender of the respondent We discover that it is true for both men and women. It appears that the gender of the interviewer does influence what people tell us about working mothers and their children 3/10/2016 41 ANES Example Implications Since about 75% of the interviewers in this survey were women, this has some serious implications. This suggests that we will overestimate the percent of people that feel that working mothers can have a warm relationship with their children 3/10/2016 42 Fun with Graphics 3/10/2016 43 Box and Whiskers Plots Open senate file (senate_mod.sav) Compare acu and dwnom scores 1. Graphs Legacy Dialogs Boxplots Clustered Summarize by Separate Variables Define 2. 1st box: acu, dwnom; 2nd box: party; 3rd box: name; OK 3/10/2016 44 Box and Whiskers Plots (continued) Convert acu and dwnom to Z scores 1. Analyze Descriptive Statistics Descriptives 2. Move acu and dwnom to right window 3. Check Save standardized values as variables 3/10/2016 45 Box and Whiskers Plots (continued) Compare Zacu and Zdwnom scores 1. Graphs Legacy Dialogs Boxplots Clustered Summarize by Separate Variables Define 2. 1st box: Zacu, Zdwnom; 2nd and 3rd boxes remain the same; OK 3/10/2016 46 Sample Size and the “Margin of (Sampling) Error” http://www.surveysystem.com/sscalc.htm 3/10/2016 47 Just the Facts http://pollingreport.com/guns.htm 3/10/2016 48 Poll Aggregators 3/10/2016 http://www.pollster.com/polls/ 49 Do It Yourself Prognostication http://uselectionatlas.org/PRED/ 3/10/2016 50 Resources Examples of Assignments (Roper Center) Polling 101: Fundamentals of Polling (Roper Center) Polling 201: Analyzing Surveys (Roper Center) Polling for Dummies Sample size calculator (Creative Research Systems) Sampling Distributions (Tufts) Polling and Survey FAQs (AAPOR) 3/10/2016 51 Change Over Time 3/10/2016 52 Objectives To explain: Trend and cohort analysis (gsscums.sav) Panel studies (anespanl.sav) 3/10/2016 53 Age Cohorts GI Generation (born 1927 or earlier) Silent Generation (1928-1945) Baby Boomers (1946-1964) Generation X (1965-1981) Generation Y (1982 or later) 3/10/2016 54 Procedure 3/10/2016 SPSS line charts 55 Dependent Variables Values recoded into two categories (0 and 100) as nearly equal in size as possible. Example: Confidence in press is recoded as 100 (a lot or only some) and 0 (hardly any or none). The resulting line graph can be interpreted as the percent of respondents coded as 100, that is, having at least some confidence in the press. 3/10/2016 56 Trend Analysis: Daily Newspaper Readership (Commands) Open gsscums.sav Click on Graphs -> Legacy Dialogs -> Interactive -> Line Move NEWS to first window on right, and YEAR to second window. Click on OK 3/10/2016 57 Trend Analysis: Daily Newspaper Readership (Results) 3/10/2016 58 Cohort Analysis To illustrate: Generational replacement Life cycle patterns Across the board change 3/10/2016 59 Cohort Analysis: Daily Newspaper Readership (Commands) Open gsscums.sav Click on Graphs -> Legacy Dialogs -> Interactive -> Line Move NEWS to first window on right, YEAR to second window, and COHORT to third window. Click on OK 3/10/2016 60 Cohort Analysis: Daily Newspaper Readership (Results) 3/10/2016 61 More Cohort Analysis Repeat above commands (first without, then with, COHORT), but instead of NEWS, use TVHOURS (over 2 hours per day watching TV), then CONPRESS (at least some confidence in the press) 3/10/2016 62 Even More Cohort Analysis Repeat above, but try the following: GRASS (favor legalization of marijuana) RACMAR (oppose interracial marriage) TRUST (think most people can be trusted) 3/10/2016 63 Panel Studies Open anespanl.sav Did respondents in 2004 recall their 2000 vote differently than they had in 2000? Click on Analyze -> Descriptive Statistics -> Frequencies Obtain frequency distributions for P200004 and P200000. 3/10/2016 64 Panel Studies Did the relationship between party identification and feelings about Ralph Nader change between 2000 (pre-election) and 2004? Click on Analyze -> Compare Means -> Means. Move NADR00PR and NADR04 to first window on right, and PTYID300 to second window. Click on OK. 3/10/2016 65 Where Can We Get Data? Data resources on or linked from the SSRIC website: http://www.ssric.org/data 3/10/2016 66 Social Science Databases The California State University subscribes to three data bases to support teaching and research Data bases Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan Field Poll in San Francisco Roper Center for Public Opinion Research at the University of Connecticut General Social Survey and American National Election Studies are available through these databases These are available to campuses by annual subscription 3/10/2016 67 Proxy Servers On-campus access to data bases is IP authenticated Off-campus access to ICPSR and Roper through your campus’ proxy server For ICPSR, account only needs to be authenticated from on campus or via proxy server every six months; otherwise, can be accessed from anywhere. Off-campus access not available for Field data Another alternative: set up a VPN on your home computer 3/10/2016 68 Where Do We Get the Data? •SSRIC: http://www.ssric.org/data •Pew: http://people-press.org/dataarchive/ •PPIC: http://www.ppic.org/main/datadepot.asp •Berkeley’s SDA archive: http://sda.berkeley.edu/archive.htm •ICPSR: http://www.icpsr.org •Roper: http://www.ropercenter.uconn.edu •Field Public : ftp://128.32.165.222:2121/ (download spss files) CSU and UC only ( analyze online): http://ucdata.berkeley.edu/data_record.p hp?recid=3#analyze 3/10/2016 69 What are we doing this year at Fresno State? Workshops for faculty and staff Teaching with Data (September 23) Data in the classroom (November 1 with special guest presenter John Korey, Political Science, CSU Pomona) Online statistical packages (SDA) (early spring) SPSS (introductory and intermediate) (late spring) Encourage students to present their research at student research conferences (SRC) SSRIC’s SRC in San Jose on April 29 Santa Clara University’s Anthropology and Sociology SRC in April CSU’s Student Research Competition in Fresno on May 6-7 Presentations at the department level One-on-one consultations with faculty Surveys to get faculty’s input and feelings 3/10/2016 70 Evaluations 3/10/2016 71