PSC 47410: Data Analysis Workshop What’s the purpose of this exercise? The workshop’s research questions: Who supports war in America? How consistent is support for conflict across different types of war? Should we be considering other, related questions? If this project were to be expanded, what could else we look at… how different groups change their support over time, support for other types of conflict, varying patterns in other societies Will you want to do your research paper on work that builds on this workshop? You research paper assignment will ask you to either do a more complex paper on public opinion and war or to test the power of various causes of war to explain the outbreak of a conflict not considered in our class readings How would a research paper using Pew data be different than this assignment with respect to its literature review and what methods you would need to use? PSC 4000-level Stats Workshop Where Do You Get Survey Data? Why are we looking at survey data from the Pew Foundation? How do we download a dataset from Pew? http://people-press.org/category/datasets/?download=20034647 Why do we have to read the methodology sheet first (it’s boring, but critical)? What do we do with the questionnaire? How does scientific method work?: In theory and in the real world Can we operationalize our dependent variable(s)? Are there some questions that we can use to identify who supports war? Do we have the independent variable(s) we need? Are there any questions that identify characteristics (e.g., gender) that we expect to be correlated with whether a person supports war? What specific hypotheses can we explore with our survey? What specific relationships do we expect to find between our independent and dependent variables? PSC 4000-level Stats Workshop What Do You Need to Turn In? Your assignment has two components: The first asks you to complete several data assignments The second is an 1000-word analytical essay The assignments related to SPSS ask you to: Recode and label both new variables and their response categories You will be asked to create dummy (1-0 value) variables. For example, you can use a question asking about religion to create a new variable called “Catholic,” where all Catholic respondents are coded 1 and all other respondents are coded zero Calculate descriptive statistics Mean (the average value of the response categories) Median (the middle value for all of your observations) Mode (the most common response if there’s no natural order to variable’s response categories) Generate a bar graph Do a little bivariate data analysis using SPSS’s cross-tab routine PSC 4000-level Stats Workshop Getting Started with SPSS Open your dataset with SPSS Open a new syntax file What are the four SPSS screens you will be working with? Data editor: Data view Data editor: Variable view The syntax file (Why is it WAY better than just pointing and clicking all of the time? Why is this the ONLY file that you will want to save in most cases? ) The output file PSC 4000-level Stats Workshop Recoding Data in SPSS Why do we need to recode? Need to drop refusals (but we want to be careful to think about what “don’t know means… e.g. if someone doesn’t know that she’s a born-again Christian, she’s not one) Variables may not measure constructs correctly, so we need to create different categories that make sense We’ll want to create dummy (0-1) variables How do we recode? Use the point and click menus at the top of the data editor Use the “Recode Into Different Variable” option Select variable you want to recode Tell SPSS how you want to recode the variable Name and label the new variable Define values Do not hit OK; instead, you want to paste command into syntax Highlight the appropriate syntax and click the run (arrow) button PSC 4000-level Stats Workshop Recoding Data: Step by Step Illustration PSC 4000-level Stats Workshop Recoding Data: Step by Step Illustration PSC 4000-level Stats Workshop Recoding Data: Step by Step Illustration PSC 4000-level Stats Workshop Recoding Data: Step by Step Illustration PSC 4000-level Stats Workshop Recoding Data: Step by Step Illustration PSC 4000-level Stats Workshop Recoding Data: Step by Step Illustration PSC 4000-level Stats Workshop Labeling Values in SPSS Why do we need to label the values used for different response categories (e.g., 1=male, 0=woman)? You may not remember how you recoded values Labels will transfer into tables and graphs How do we label values for our variable categories? Use “Define Variable Properties” option Select variable(s) with values you want to label Tell SPSS how you want to label the values Paste command into syntax Highlight syntax and click run Once you’ve got the syntax down, you may want to work much more quickly with other variables by copying and pasting that syntax and making appropriate substitutions PSC 4000-level Stats Workshop Creating Value Labels for Response Categories: Step by Step Illustration PSC 4000-level Stats Workshop Creating Value Labels for Response Categories: Step by Step Illustration PSC 4000-level Stats Workshop Creating Value Labels for Response Categories: Step by Step Illustration PSC 4000-level Stats Workshop Creating Value Labels for Response Categories: Step by Step Illustration PSC 4000-level Stats Workshop Creating Value Labels for Response Categories: Step by Step Illustration PSC 4000-level Stats Workshop Descriptive Statistics in SPSS What descriptive statistics may interest us? Frequencies Mean (the average), Medians (the middle observation), and Mode (rarely used) How do we compute descriptive statistics? Use “Descriptive Statistics” option Select “Frequencies” Select variable(s) that you want to analyze Select “Statistics” Select the statistics you want to view Option 1 (if you will need the results later): click the paste command to create syntax; highlight the syntax and click run Option 2 (if you are using the results right now): click OK and the syntax won’t be pasted into your syntax file, but the output will be created PSC 4000-level Stats Workshop Descriptive Statistics: Step by Step Illustration PSC 4000-level Stats Workshop Descriptive Statistics PSC 4000-level Stats Workshop Descriptive Statistics: Step by Step Illustration PSC 4000-level Stats Workshop Descriptive Statistics: Step by Step Illustration PSC 4000-level Stats Workshop Descriptive Statistics: Step by Step Illustration PSC 4000-level Stats Workshop Descriptive Statistics: Step by Step Illustration PSC 4000-level Stats Workshop Descriptive Statistics: Step by Step Illustration PSC 4000-level Stats Workshop Graphing in SPSS What kinds of graphs might we create? Bar Line Histogram How do we compute descriptive statistics? Use “Descriptive Statistics” option Select “Frequencies” Select variable(s) that you want to analyze Select “Charts” Select the charts you want to view; check the “percentage” option in most cases so that your charts will be more readable Paste command into syntax Highlight syntax and click run PSC 4000-level Stats Workshop Creating Graphs PSC 4000-level Stats Workshop Creating Graphs PSC 4000-level Stats Workshop Creating Graphs PSC 4000-level Stats Workshop Creating Graphs PSC 4000-level Stats Workshop Creating Graphs PSC 4000-level Stats Workshop Creating Graphs PSC 4000-level Stats Workshop Creating Graphs PSC 4000-level Stats Workshop Bivariate Analysis in SPSS What types of bivariate analysis might we conduct? Correlation Crosstabs (this is your best option for tables that will make sense to a general audience) Significance tests How do we conduct bivariate analysis? Use options in the “Analyze” menu Select variable(s) that you want analyze Tell SPSS how you want to analyze the values Paste command into syntax Highlight syntax and click run PSC 4000-level Stats Workshop Calculating Correlation PSC 4000-level Stats Workshop Calculating Correlation PSC 4000-level Stats Workshop Calculating Correlation PSC 4000-level Stats Workshop Calculating Correlation PSC 4000-level Stats Workshop Calculating Correlation PSC 4000-level Stats Workshop Guidelines for Correlation Less than .1: Very weak relationship .1 to .2: Weak relationship .2 to .3: Moderate relationship Above .4: Strong relationship PSC 4000-level Stats Workshop Review: Variable Types Categorical Ordinal Represent categories or names (e.g. gender, religion) Rank ordered indicators (e.g. agree, strongly agree) Interval Ordered values where all unit increases are roughly equivalent (e.g. age) PSC 4000-level Stats Workshop Calculating Significance Crosstabs will allow you to see relationships between two variables in more detail Put dependent variable in the rows Put independent variable in the columns Can also test association: Categorical/categorical: Lambda, Cramer’s V Categorical/ordinal: Cramer’s V Ordinal/ordinal: Gamma, Tau-B (same number of categories) Categorical/Interval: Eta Ordinal/Interval: Spearman’s Rho Interval/Interval: Pearson’s R PSC 4000-level Stats Workshop Crosstabs and Tests of Significance PSC 4000-level Stats Workshop Crosstabs and Tests of Significance PSC 4000-level Stats Workshop Crosstabs and Tests of Significance PSC 4000-level Stats Workshop Crosstabs and Tests of Significance PSC 4000-level Stats Workshop Crosstabs and Tests of Significance PSC 4000-level Stats Workshop Crosstabs and Tests of Significance PSC 4000-level Stats Workshop Crosstabs and Tests of Significance PSC 4000-level Stats Workshop Crosstabs and Tests of Significance PSC 4000-level Stats Workshop