Social Science Data Bases CSU Dominguez Hills April 15, 2011 April 15, 2011 CSU Dominguez Hills 1 Sponsored by http://www.ssric.org April 15, 2011 CSU Dominguez Hills 2 Presenters John L. Korey, Political Science, Cal Poly Pomona jlkorey@csupomona.edu Eugene J. Turner, Geography, CSU Northridge eugene.turner@csun.edu April 15, 2011 CSU Dominguez Hills 3 Workshop Agenda Overview of the SSRIC Data Archives ICPSR Field Census Roper Other Data Other Workshops Your Feedback April 15, 2011 CSU Dominguez Hills 4 SSRIC http://www.ssric.org April 15, 2011 CSU Dominguez Hills 5 The Council Oldest CSU affinity group (since 1972) Representatives from CSU campuses meet three times per year Negotiates with data providers for access to data Promotes use of data analysis in research and teaching April 15, 2011 CSU Dominguez Hills 6 The Council Annual student research conference April 29, San Jose State http://www.ssric.org/participate/src Sponsors travel to ICPSR summer workshops in Ann Arbor, Michigan http://www.ssric.org/participate/icpsr_summer Field Poll Faculty Fellowship (deadline April 15) http://www.ssric.org/participate/field_institute April 15, 2011 CSU Dominguez Hills 7 Social Science Databases The California State University subscribes to social science databases to support teaching research administration Available to campuses by annual subscription April 15, 2011 CSU Dominguez Hills 8 Social Science Databases Assistance from the SSRIC Note: SSRIC members are not staff data consultants or technical assistance providers, but we will do what we can to help students, faculty, and staff to access data Campus representatives http://www.ssric.org/reps April 15, 2011 CSU Dominguez Hills 9 Social Science Databases Inter-university Consortium for Political and Social Research (ICPSR) Field Poll Census Data Roper Center for Public Opinion Research April 15, 2011 CSU Dominguez Hills 10 Accessing the data bases On-campus access to data bases is IP authenticated Off-campus access Field: not available Roper: uses library as proxy server -- works just like other library data bases ICPSR: first time, uses library proxy server, after that, you can access from anywhere but you must reauthenticate every 6 months April 15, 2011 CSU Dominguez Hills 11 ICPSR Inter-university Consortium for Political and Social Research http://www.icpsr.org April 15, 2011 CSU Dominguez Hills 12 ICPSR Primary social science data archive for the U.S. Founded in 1962; CSU joined in 1972 Over 600 member institutions worldwide Services Vast data archive accessible via the web (ICPSR Direct) Summer program Instructional resources April 15, 2011 CSU Dominguez Hills 13 ICPSR Teaching April 15, 2011 CSU Dominguez Hills 14 ICPSR April 15, 2011 CSU Dominguez Hills 15 ICPSR April 15, 2011 CSU Dominguez Hills 16 Using ICPSR Direct Locate the study Select the data for download Download the data Import the data for statistical analysis April 15, 2011 CSU Dominguez Hills 17 Find and Analyze Data Find data Analyze data online Topical archives April 15, 2011 CSU Dominguez Hills 18 Searching for Data •Type in key words (e.g., gun control) •Click search Search terms are not case sensitive if all lower case April 15, 2011 CSU Dominguez Hills 19 Search Results Sort by time period (newest) Click on link April 15, 2011 CSU Dominguez Hills 20 Downloading the data View study description Download all files April 15, 2011 CSU Dominguez Hills 21 Agree to the Terms of Use April 15, 2011 CSU Dominguez Hills 22 Saving the Files April 15, 2011 CSU Dominguez Hills 23 Select Files and Place Them Where You Want to Save Them April 15, 2011 CSU Dominguez Hills 24 What’s in the File? Look in the layers of subfolders until you find the DS0001 folder It will include the: Codebook (pdf file) Questionnaire (pdf file) SPSS file (in this case, it’s a .sav file) Other files April 15, 2011 CSU Dominguez Hills 25 Open the sav File in SPSS April 15, 2011 CSU Dominguez Hills 26 Other Types of Files you Might Encounter SPSS portable (.por) files Will work on all platforms Tell SPSS to look for a .por file Raw data (ASCII) file with SPSS syntax (.sps) file You’ll have to create your own .sav file See next slide Raw data file with only a codebook You’ll have to create your own syntax file SPSS Intro workshop discusses how to do this April 15, 2011 CSU Dominguez Hills 27 What’s a Syntax File? A syntax file is a set of commands to convert a raw ASCII data file into an SPSS system file ASCII + SPSS syntax file = SPSS system file For More Information: http://www.ssric.org/data/icpsr_direct#syntax April 15, 2011 CSU Dominguez Hills 28 Other ICPSR Resources SSRIC ICPSR Guide ICPSR Data Use Tutorial Social Science Variables Database April 15, 2011 CSU Dominguez Hills 29 The Field Poll April 15, 2011 CSU Dominguez Hills 30 The Field (California) Poll Mervin Field Established in 1947 Independent, non-partisan survey of California public opinion Wide range of political and social topics Multiple polls each year http://www.field.com April 15, 2011 CSU Dominguez Hills 31 Accessing Field Data April 15, 2011 Set up your browser Find polls and data sets Download data sets CSU Dominguez Hills 32 Setting up your browser Requires setting passive FTP access Can set up browser for folder view or FTP directory view See the instructions for details: http://www.ssric.org/data/field_polls You will only need to do this once for your browser and user account April 15, 2011 CSU Dominguez Hills 33 Finding Polls Find the poll number Field Poll Archives (http://field.com/fieldpollonline/subscribers/) American Public Opinion Index The CALPOLL window Check with your library for printed and/or microfiche versions Accessible via UCData You will be using the poll number to select the data set for download. April 15, 2011 CSU Dominguez Hills 34 Searching Polls with UCDATA Uses SDA Does not include last couple of years Gives you the frequency distributions of the variables you select April 15, 2011 CSU Dominguez Hills 35 UC Data http://ucdata.berkeley.edu Click on Field Polls April 15, 2011 CSU Dominguez Hills 36 Field Polls on UCDATA Click on search April 15, 2011 CSU Dominguez Hills 37 Searching I selected 2000’s April 15, 2011 CSU Dominguez Hills 38 Keywords Enter gun control April 15, 2011 CSU Dominguez Hills 39 Search Results Click View April 15, 2011 CSU Dominguez Hills 40 Results from SDA April 15, 2011 CSU Dominguez Hills 41 Downloading the Data Click Download April 15, 2011 CSU Dominguez Hills 42 Choosing the File Double click cal0003 since you want to download that folder Note that this is directory view April 15, 2011 CSU Dominguez Hills 43 Saving the Files April 15, 2011 CSU Dominguez Hills 44 Select the Folder April 15, 2011 CSU Dominguez Hills 45 Open the Data in SPSS 2 1 April 15, 2011 CSU Dominguez Hills 46 Other Field Resources SSRIC Field Poll Guide Field News Releases and California Opinion Indexes April 15, 2011 CSU Dominguez Hills 47 The U. S. Census of Population and Housing April 15, 2011 CSU Dominguez Hills 48 U. S. Census Collections Construction Transportation For. Trade Housing Business Governments Agriculture Mineral Indus. Manufacturing Population 1800 1850 1900 1950 1990 1787 – Framers of Constitution – Needed to resolve 2 Issues How to deal with apportionment of power for House How to assess taxes land value, persons? April 15, 2011 CSU Dominguez Hills 49 Census of Population and Housing April 15, 2011 Began in 1790 U. S. marshals went to each household Continuing fight over additional social and housing questions – cost 1960 – Most questions in sample questionnaire, Beginning of mailouts and digital tabulation CSU Dominguez Hills 50 Modern Census Concerns A. Geography Data aggregated for different area units Choice may affect research results Each level has a unique Summary Level code Selected aggregations by geography Nation Region (4) Division (9) State (52) County Place County Subdivision Tract Tract Block Group Block Group Block Block April 15, 2011 CSU Dominguez Hills 51 Modern Census Concerns B. Question and Tabulation Changes Race, education, income, age, occupation …. can change between censuses Need to consult documentation appendices i.e. Census 2000 allowed people to indicate more than a single race which complicates race tabulations i.e. Census 2010 socioeconomic data collected in an annual sample Some questions have a different Universe or population at risk ie Educational Attainment: Persons 25+ yrs Categories are defined by Census Bureau ie Age: 0-4 yrs, 5-9 yrs, 10-14yrs … April 15, 2011 CSU Dominguez Hills 52 Census 1990 and 2000 Tabulations A. Some data collected on short form from everyone Race, age, family characteristics Reported in Summary Files 1 and 2 B. Some questions collected on long form or sample questionnaire About 1 in 6 households Education, income, occupation, ancestry, commuting time, housing tenure, vehicles…. Reported in Summary Files 3 and 4 C. Note SF2 and SF4 also tabulate data for specified race, Hispanic, and Native American groups April 15, 2011 CSU Dominguez Hills 53 Census 2010 The American Community Survey A. Will replace SF3 and SF4 ACS has similar questions and data Some tables have fewer categories B. Based on annual survey of 3 million households Area units over 65,000 reported each year Area units over 20,000 reported as 3-year average Area units less than 20,000 reported as 5-year average April 15, 2011 CSU Dominguez Hills 54 Census 2010 The American Community Survey C. Most questions report a Margin of Error 90 percent of values will lie within it Increases with smaller sample size (ie tracts) Actual accuracy less than decennial census so data may be more challenging for small areas First release of 5-year average (2005-2009) in 2010 April 15, 2011 CSU Dominguez Hills 55 Census Tabulations Public Use Microdata Sample (PUMS) A. 5% and 1% sample of households Are for individuals so users aggregate to categories in software such as SPSS Provide great flexibility for analysis B. 5% geography reported for Public Use Microdata Areas of at least 100,000 persons May range from parts of a city to multiple counties Some are aggregations of numerous non-contiguous areas April 15, 2011 CSU Dominguez Hills 56 www.census.gov April 15, 2011 CSU Dominguez Hills 57 Will replace legacy factfinder by fall 2011 Census 2000 will be added in May 2011 April 15, 2011 CSU Dominguez Hills 58 April 15, 2011 CSU Dominguez Hills 59 Click Topics and select Race and Ethnicity Then select Ancestry April 15, 2011 CSU Dominguez Hills 60 Click Geography and select City or Town Then select Ancestry April 15, 2011 CSU Dominguez Hills 61 At the top window enter: Los Angeles and click Go April 15, 2011 CSU Dominguez Hills 62 Select Los Angeles city, California and click the Add button. Then close the window. April 15, 2011 CSU Dominguez Hills 63 Choose the QT-P13 button and then the View button. April 15, 2011 CSU Dominguez Hills 64 When done, click the Back to Search button April 15, 2011 CSU Dominguez Hills 65 Click the Download button (Ancestry checked) April 15, 2011 CSU Dominguez Hills 66 Extract the desired files and open in Excel April 15, 2011 CSU Dominguez Hills 67 The IPUMS web site for PUMS data http://www.ipums.umn.edu/ April 15, 2011 CSU Dominguez Hills 68 April 15, 2011 CSU Dominguez Hills 69 April 15, 2011 CSU Dominguez Hills 70 Select the Household menu and the Geographic option. April 15, 2011 CSU Dominguez Hills 71 If you later wish to subset data for a particular state or PUMA check the STATEFIP and PUMA buttons. (green check) April 15, 2011 CSU Dominguez Hills 72 Select the Person menu and then choose variables from within the list of topics. April 15, 2011 CSU Dominguez Hills 73 Here, Race was selected. Note the Codes column for an explanation of content. April 15, 2011 CSU Dominguez Hills 74 April 15, 2011 CSU Dominguez Hills 75 Race comes in both a General version and a Detailed version. April 15, 2011 CSU Dominguez Hills 76 April 15, 2011 CSU Dominguez Hills 77 When done, choose the Select Samples button. April 15, 2011 CSU Dominguez Hills 78 Deselect the Default U.S. sample button and click the 2000 5% button for this example. Click the Submit sample selections button. 3 1 2 April 15, 2011 CSU Dominguez Hills 79 Choose the View Cart option. April 15, 2011 CSU Dominguez Hills 80 In the Data Cart make sure you have all items. Then click the Check Out button. April 15, 2011 CSU Dominguez Hills 81 Enter your email to login or to set up a new account. April 15, 2011 CSU Dominguez Hills 82 Leave the Extract file type as Rectangular and click Continue. April 15, 2011 CSU Dominguez Hills 83 Under the Set variable options window you may limit the scope of the extract with the Select Cases option. Here, choose the Select Cases button under STATEFIP and Continue. The Attach Characteristics option allows you to add characteristics from the mother, father, or spouse to this person. April 15, 2011 CSU Dominguez Hills 84 Choose a state [Colorado] from among all states for your extract. Click Continue. April 15, 2011 CSU Dominguez Hills 85 Check your extract and make any needed changes. Write a brief description of your extract under the Description window. Click the Submit extract button at the bottom to submit your extract request. Races in different states April 15, 2011 CSU Dominguez Hills 86 You may wait awhile for the extract or use the link from an email to download your data. You will receive an SPSS syntax file, a codebook of your variables, and the data. Use the SPSS syntax file to convert the data to a SAV file for processing. April 15, 2011 CSU Dominguez Hills 87 Census Data at ICPSR April 15, 2011 CSU Dominguez Hills 88 Find & Analyze Data April 15, 2011 CSU Dominguez Hills 89 April 15, 2011 CSU Dominguez Hills 90 April 15, 2011 CSU Dominguez Hills 91 Find & Analyze Data April 15, 2011 CSU Dominguez Hills 92 Find & Analyze Data April 15, 2011 CSU Dominguez Hills 93 Other ICPSR Demographic Data April 15, 2011 CSU Dominguez Hills 94 Other ICPSR Demographic Data April 15, 2011 CSU Dominguez Hills 95 The Roper Center for Public Opinion Research April 15, 2011 CSU Dominguez Hills 96 http://www.ropercenter.uconn.edu Roper Express iPoll Roper Explorer April 15, 2011 CSU Dominguez Hills 97 Searching for Data Sets Hover over Quick Links Then click “Search for Datasets” April 15, 2011 CSU Dominguez Hills 98 Sign In April 15, 2011 CSU Dominguez Hills 99 Sign In April 15, 2011 CSU Dominguez Hills 100 Searching for Data Sets Specify Keywords Date range Country April 15, 2011 CSU Dominguez Hills 101 Searching for Data Sets Enter Search Terms Click on “Search” April 15, 2011 CSU Dominguez Hills 102 Search Results Download Data Examine Questions Online Analysis April 15, 2011 CSU Dominguez Hills 103 Dataset Abstract Study Description Download Data April 15, 2011 CSU Dominguez Hills 104 Downloading the Files Click on “Download Dataset” Click on SPSS portable file April 15, 2011 CSU Dominguez Hills 105 Downloading the Files April 15, 2011 CSU Dominguez Hills 106 What if nothing is available? Some documentation and/or data sets are not available on-line. If you are interested in one of these, email the survey number and the title to: Ed Nelson at ednelson@csufresno.edu April 15, 2011 CSU Dominguez Hills 107 View Questions in iPOLL April 15, 2011 CSU Dominguez Hills 108 Get Question Details April 15, 2011 CSU Dominguez Hills 109 Question Details April 15, 2011 CSU Dominguez Hills 110 Online Analysis April 15, 2011 CSU Dominguez Hills 111 Run Analysis April 15, 2011 CSU Dominguez Hills 112 Analysis - Results April 15, 2011 CSU Dominguez Hills 113 More About iPOLL Data base of over 500,000 questions Includes frequency distributions allowing students to do trend analysis online You will not need a user name or password from oncampus, since the CSU will be IP authenticated. You will be prompted for your email address April 15, 2011 CSU Dominguez Hills 114 Log in to iPOLL April 15, 2011 CSU Dominguez Hills 115 Searching iPOLL Search for keyword Click on Boolean or Wildcard for help Let’s search for relig% (note: % is the wildcard) Select the topic of religion Use date from 1/1/2000 to today’s date April 15, 2011 CSU Dominguez Hills 116 Search April 15, 2011 CSU Dominguez Hills 117 Results April 15, 2011 CSU Dominguez Hills 118 Instructional Resources Click here Then choose one of these April 15, 2011 CSU Dominguez Hills 119 Fundamentals of Polling April 15, 2011 CSU Dominguez Hills 120 Analyzing Polls April 15, 2011 CSU Dominguez Hills 121 Teaching Tools April 15, 2011 CSU Dominguez Hills 122 Other Roper Resources SSRIC Roper Guide April 15, 2011 CSU Dominguez Hills 123 Other Data April 15, 2011 CSU Dominguez Hills 124 Other Data Instructional Subsets (http://www.ssric.org/node/361) Other (http://www.ssric.org/data/other) April 15, 2011 CSU Dominguez Hills 125 Other Workshops April 15, 2011 CSU Dominguez Hills 126 Workshops on SPSS SPSS – introductory workshop Creating SPSS data file Transforming data Descriptive statistics SPSS – intermediate workshop Crosstabulations Using means Correlation and regression Charts April 15, 2011 CSU Dominguez Hills 127 Workshop on Online Statistical Analysis SDA (Survey Documentation and Analysis) Requires no site license Extremely easy to learn Excellent for classes where you want to introduce students to statistical analysis Tools available – frequencies, crosstabulations, correlation and regression April 15, 2011 CSU Dominguez Hills 128 Workshop on Data in the Classroom Statistical literacy – issues and examples Causality Experimental and survey design Unintentional consequences Sampling and statistical inference Change over time Where do we get the data? April 15, 2011 CSU Dominguez Hills 129 Feedback? Please fill out an evaluation form. Thanks! April 15, 2011 CSU Dominguez Hills 130