April 15, 2011 59

advertisement
Social Science
Data Bases
CSU Dominguez Hills
April 15, 2011
April 15, 2011
CSU Dominguez Hills
1
Sponsored by
http://www.ssric.org
April 15, 2011
CSU Dominguez Hills
2
Presenters

John L. Korey, Political Science, Cal
Poly Pomona jlkorey@csupomona.edu

Eugene J. Turner, Geography, CSU
Northridge eugene.turner@csun.edu
April 15, 2011
CSU Dominguez Hills
3
Workshop Agenda

Overview of the SSRIC

Data Archives




ICPSR
Field
Census
Roper

Other Data
 Other Workshops

Your Feedback
April 15, 2011
CSU Dominguez Hills
4
SSRIC
http://www.ssric.org
April 15, 2011
CSU Dominguez Hills
5
The Council

Oldest CSU affinity group (since 1972)

Representatives from CSU campuses meet
three times per year

Negotiates with data providers for access to
data

Promotes use of data analysis in research
and teaching
April 15, 2011
CSU Dominguez Hills
6
The Council

Annual student research conference
 April 29, San Jose State
http://www.ssric.org/participate/src
 Sponsors travel to ICPSR summer workshops
in Ann Arbor, Michigan


http://www.ssric.org/participate/icpsr_summer
Field Poll Faculty Fellowship (deadline April
15)
http://www.ssric.org/participate/field_institute
April 15, 2011
CSU Dominguez Hills
7
Social Science Databases

The California State University subscribes to
social science databases to support
 teaching
 research
 administration
 Available to campuses by annual subscription
April 15, 2011
CSU Dominguez Hills
8
Social Science Databases

Assistance from the SSRIC


Note: SSRIC members are not staff data
consultants or technical assistance providers, but
we will do what we can to help students, faculty,
and staff to access data
Campus representatives
http://www.ssric.org/reps
April 15, 2011
CSU Dominguez Hills
9
Social Science Databases

Inter-university Consortium for Political
and Social Research (ICPSR)

Field Poll

Census Data

Roper Center for Public Opinion Research
April 15, 2011
CSU Dominguez Hills
10
Accessing the data bases

On-campus access to data bases is IP
authenticated

Off-campus access



Field: not available
Roper: uses library as proxy server -- works just
like other library data bases
ICPSR: first time, uses library proxy server, after
that, you can access from anywhere but you must
reauthenticate every 6 months
April 15, 2011
CSU Dominguez Hills
11
ICPSR
Inter-university Consortium for
Political and Social Research
http://www.icpsr.org
April 15, 2011
CSU Dominguez Hills
12
ICPSR

Primary social science data archive for the U.S.

Founded in 1962; CSU joined in 1972


Over 600 member institutions worldwide
Services



Vast data archive accessible via the web (ICPSR
Direct)
Summer program
Instructional resources
April 15, 2011
CSU Dominguez Hills
13
ICPSR
Teaching
April 15, 2011
CSU Dominguez Hills
14
ICPSR
April 15, 2011
CSU Dominguez Hills
15
ICPSR
April 15, 2011
CSU Dominguez Hills
16
Using ICPSR Direct

Locate the study

Select the data for download

Download the data

Import the data for statistical analysis
April 15, 2011
CSU Dominguez Hills
17
Find and Analyze Data
Find data
Analyze data online
Topical archives
April 15, 2011
CSU Dominguez Hills
18
Searching for Data
•Type in key words
(e.g., gun control)
•Click search
Search terms are not
case sensitive if all
lower case
April 15, 2011
CSU Dominguez Hills
19
Search Results
Sort by time
period (newest)
Click on link
April 15, 2011
CSU Dominguez Hills
20
Downloading the data
View study description
Download all files
April 15, 2011
CSU Dominguez Hills
21
Agree to the Terms of Use
April 15, 2011
CSU Dominguez Hills
22
Saving the Files
April 15, 2011
CSU Dominguez Hills
23
Select Files and Place Them
Where You Want to Save Them
April 15, 2011
CSU Dominguez Hills
24
What’s in the File?

Look in the layers of subfolders until you
find the DS0001 folder

It will include the:




Codebook (pdf file)
Questionnaire (pdf file)
SPSS file (in this case, it’s a .sav file)
Other files
April 15, 2011
CSU Dominguez Hills
25
Open the sav File in SPSS
April 15, 2011
CSU Dominguez Hills
26
Other Types of Files you
Might Encounter

SPSS portable (.por) files
 Will work on all platforms
 Tell SPSS to look for a .por file

Raw data (ASCII) file with SPSS syntax (.sps) file
 You’ll have to create your own .sav file
 See next slide

Raw data file with only a codebook
 You’ll have to create your own syntax file
 SPSS Intro workshop discusses how to do this
April 15, 2011
CSU Dominguez Hills
27
What’s a Syntax File?

A syntax file is a set of commands to convert a raw
ASCII data file into an SPSS system file
 ASCII + SPSS syntax file = SPSS system file
 For More Information:
http://www.ssric.org/data/icpsr_direct#syntax
April 15, 2011
CSU Dominguez Hills
28
Other ICPSR Resources

SSRIC ICPSR Guide

ICPSR Data Use Tutorial

Social Science Variables Database
April 15, 2011
CSU Dominguez Hills
29
The Field Poll
April 15, 2011
CSU Dominguez Hills
30
The Field (California) Poll
Mervin Field

Established in 1947

Independent, non-partisan survey
of California public opinion

Wide range of political and social
topics

Multiple polls each year
http://www.field.com
April 15, 2011
CSU Dominguez Hills
31
Accessing Field Data
April 15, 2011

Set up your browser

Find polls and data sets

Download data sets
CSU Dominguez Hills
32
Setting up your browser

Requires setting passive FTP access

Can set up browser for folder view or FTP
directory view

See the instructions for details:
http://www.ssric.org/data/field_polls

You will only need to do this once for your
browser and user account
April 15, 2011
CSU Dominguez Hills
33
Finding Polls

Find the poll number
 Field Poll Archives
(http://field.com/fieldpollonline/subscribers/)

American Public Opinion Index


The CALPOLL window


Check with your library for printed and/or
microfiche versions
Accessible via UCData
You will be using the poll number to select
the data set for download.
April 15, 2011
CSU Dominguez Hills
34
Searching Polls with UCDATA

Uses SDA

Does not include last couple of years

Gives you the frequency distributions of the
variables you select
April 15, 2011
CSU Dominguez Hills
35
UC Data
http://ucdata.berkeley.edu
Click on Field Polls
April 15, 2011
CSU Dominguez Hills
36
Field Polls on UCDATA
Click on search
April 15, 2011
CSU Dominguez Hills
37
Searching
I selected 2000’s
April 15, 2011
CSU Dominguez Hills
38
Keywords
Enter gun control
April 15, 2011
CSU Dominguez Hills
39
Search Results
Click View
April 15, 2011
CSU Dominguez Hills
40
Results from SDA
April 15, 2011
CSU Dominguez Hills
41
Downloading the Data
Click Download
April 15, 2011
CSU Dominguez Hills
42
Choosing the File
Double click cal0003 since you
want to download that folder
Note that this is
directory view
April 15, 2011
CSU Dominguez Hills
43
Saving the Files
April 15, 2011
CSU Dominguez Hills
44
Select the Folder
April 15, 2011
CSU Dominguez Hills
45
Open the Data in SPSS
2
1
April 15, 2011
CSU Dominguez Hills
46
Other Field Resources

SSRIC Field Poll Guide

Field News Releases and California Opinion
Indexes
April 15, 2011
CSU Dominguez Hills
47
The U. S. Census of
Population and
Housing
April 15, 2011
CSU Dominguez Hills
48
U. S. Census Collections
Construction
Transportation
For. Trade
Housing
Business
Governments
Agriculture
Mineral Indus.
Manufacturing
Population
1800
1850
1900
1950
1990
1787 – Framers of Constitution – Needed to resolve 2 Issues
How to deal with apportionment of power for House
How to assess taxes
land value, persons?
April 15, 2011
CSU Dominguez Hills
49
Census of Population and Housing
April 15, 2011

Began in 1790

U. S. marshals went to each
household

Continuing fight over
additional social and housing
questions – cost

1960 – Most questions in
sample questionnaire,
Beginning of mailouts and
digital tabulation
CSU Dominguez Hills
50
Modern Census Concerns

A. Geography
 Data aggregated for different area units
 Choice may affect research results
 Each level has a unique Summary Level code
Selected aggregations by geography
Nation
Region (4)
Division (9)
State (52)
County
Place
County Subdivision
Tract
Tract
Block Group
Block Group
Block
Block
April 15, 2011
CSU Dominguez Hills
51
Modern Census Concerns

B. Question and Tabulation Changes

Race, education, income, age, occupation ….
can change between censuses
 Need to consult documentation appendices




i.e. Census 2000 allowed people to indicate more than a
single race which complicates race tabulations
i.e. Census 2010 socioeconomic data collected in an
annual sample
Some questions have a different Universe or population at
risk

ie Educational Attainment: Persons 25+ yrs
Categories are defined by Census Bureau
 ie Age: 0-4 yrs, 5-9 yrs, 10-14yrs …
April 15, 2011
CSU Dominguez Hills
52
Census 1990 and 2000 Tabulations
A. Some data collected on short form from everyone


Race, age, family characteristics
Reported in Summary Files 1 and 2
B. Some questions collected on long form or sample
questionnaire



About 1 in 6 households
Education, income, occupation, ancestry, commuting time,
housing tenure, vehicles….
Reported in Summary Files 3 and 4
C. Note SF2 and SF4 also tabulate data for specified
race, Hispanic, and Native American groups
April 15, 2011
CSU Dominguez Hills
53
Census 2010
The American Community Survey
A. Will replace SF3 and SF4


ACS has similar questions and data
Some tables have fewer categories
B. Based on annual survey of 3 million households



Area units over 65,000 reported each year
Area units over 20,000 reported as 3-year average
Area units less than 20,000 reported as 5-year average
April 15, 2011
CSU Dominguez Hills
54
Census 2010
The American Community Survey
C. Most questions report a Margin of Error


90 percent of values will lie within it
Increases with smaller sample size (ie tracts)

Actual accuracy less than decennial census so data may
be more challenging for small areas

First release of 5-year average (2005-2009) in 2010
April 15, 2011
CSU Dominguez Hills
55
Census Tabulations
Public Use Microdata Sample (PUMS)
A. 5% and 1% sample of households


Are for individuals so users aggregate to categories in
software such as SPSS
Provide great flexibility for analysis
B. 5% geography reported for Public Use Microdata
Areas of at least 100,000 persons


May range from parts of a city to multiple counties
Some are aggregations of numerous non-contiguous areas
April 15, 2011
CSU Dominguez Hills
56
www.census.gov
April 15, 2011
CSU Dominguez Hills
57
Will replace legacy factfinder by fall 2011
Census 2000 will be added in May 2011
April 15, 2011
CSU Dominguez Hills
58
April 15, 2011
CSU Dominguez Hills
59
Click Topics and select Race and Ethnicity
Then select Ancestry
April 15, 2011
CSU Dominguez Hills
60
Click Geography and select City or Town
Then select Ancestry
April 15, 2011
CSU Dominguez Hills
61
At the top window enter: Los Angeles and
click Go
April 15, 2011
CSU Dominguez Hills
62
Select Los Angeles city, California and
click the Add button. Then close the window.
April 15, 2011
CSU Dominguez Hills
63
Choose the QT-P13 button and then
the View button.
April 15, 2011
CSU Dominguez Hills
64
When done, click the Back to Search button
April 15, 2011
CSU Dominguez Hills
65
Click the Download button (Ancestry checked)
April 15, 2011
CSU Dominguez Hills
66
Extract the desired files and open in Excel
April 15, 2011
CSU Dominguez Hills
67
The IPUMS web site for
PUMS data
http://www.ipums.umn.edu/
April 15, 2011
CSU Dominguez Hills
68
April 15, 2011
CSU Dominguez Hills
69
April 15, 2011
CSU Dominguez Hills
70
Select the Household menu and the
Geographic option.
April 15, 2011
CSU Dominguez Hills
71
If you later wish to subset data for a particular
state or PUMA check the STATEFIP and PUMA
buttons. (green check)
April 15, 2011
CSU Dominguez Hills
72
Select the Person menu and then choose
variables from within the list of topics.
April 15, 2011
CSU Dominguez Hills
73
Here, Race was selected. Note the Codes
column for an explanation of content.
April 15, 2011
CSU Dominguez Hills
74
April 15, 2011
CSU Dominguez Hills
75
Race comes in both a General version and a
Detailed version.
April 15, 2011
CSU Dominguez Hills
76
April 15, 2011
CSU Dominguez Hills
77
When done, choose the Select Samples button.
April 15, 2011
CSU Dominguez Hills
78
Deselect the Default U.S. sample button and
click the 2000 5% button for this example.
Click the Submit sample selections button.
3
1
2
April 15, 2011
CSU Dominguez Hills
79
Choose the View Cart option.
April 15, 2011
CSU Dominguez Hills
80
In the Data Cart make sure you have all items.
Then click the Check Out button.
April 15, 2011
CSU Dominguez Hills
81
Enter your email to login or to set up a new
account.
April 15, 2011
CSU Dominguez Hills
82
Leave the Extract file type as Rectangular and
click Continue.
April 15, 2011
CSU Dominguez Hills
83
Under the Set variable options window you
may limit the scope of the extract with the
Select Cases option. Here, choose the Select
Cases button under STATEFIP and Continue.
The Attach Characteristics option allows you to add
characteristics
from the mother,
father,
or spouse to this person.
April 15, 2011
CSU Dominguez
Hills
84
Choose a state [Colorado] from among all
states for your extract. Click Continue.
April 15, 2011
CSU Dominguez Hills
85
Check your extract and make
any needed changes.
Write a brief description of
your extract under the
Description window.
Click the Submit extract
button at the bottom to
submit your extract request.
Races in different states
April 15, 2011
CSU Dominguez Hills
86
You may wait awhile for the
extract or use the link from an
email to download your data.
You will receive an SPSS
syntax file, a codebook of
your variables, and the data.
Use the SPSS syntax file to
convert the data to a SAV file
for processing.
April 15, 2011
CSU Dominguez Hills
87
Census Data at ICPSR
April 15, 2011
CSU Dominguez Hills
88
Find & Analyze Data
April 15, 2011
CSU Dominguez Hills
89
April 15, 2011
CSU Dominguez Hills
90
April 15, 2011
CSU Dominguez Hills
91
Find & Analyze Data
April 15, 2011
CSU Dominguez Hills
92
Find & Analyze Data
April 15, 2011
CSU Dominguez Hills
93
Other ICPSR Demographic Data
April 15, 2011
CSU Dominguez Hills
94
Other ICPSR Demographic Data
April 15, 2011
CSU Dominguez Hills
95
The Roper Center for
Public Opinion
Research
April 15, 2011
CSU Dominguez Hills
96
http://www.ropercenter.uconn.edu
Roper
Express
iPoll
Roper
Explorer
April 15, 2011
CSU Dominguez Hills
97
Searching for Data Sets
Hover over
Quick Links
Then click “Search
for Datasets”
April 15, 2011
CSU Dominguez Hills
98
Sign In
April 15, 2011
CSU Dominguez Hills
99
Sign In
April 15, 2011
CSU Dominguez Hills
100
Searching for Data Sets

Specify
 Keywords
 Date range
 Country
April 15, 2011
CSU Dominguez Hills
101
Searching for Data Sets
Enter
Search
Terms
Click on
“Search”
April 15, 2011
CSU Dominguez Hills
102
Search Results
Download Data
Examine Questions
Online Analysis
April 15, 2011
CSU Dominguez Hills
103
Dataset Abstract
Study
Description
Download
Data
April 15, 2011
CSU Dominguez Hills
104
Downloading the Files
Click on
“Download Dataset”
Click on
SPSS portable file
April 15, 2011
CSU Dominguez Hills
105
Downloading the Files
April 15, 2011
CSU Dominguez Hills
106
What if nothing is available?

Some documentation and/or data sets are not available
on-line. If you are interested in one of these, email the
survey number and the title to:
Ed Nelson at ednelson@csufresno.edu
April 15, 2011
CSU Dominguez Hills
107
View Questions in iPOLL
April 15, 2011
CSU Dominguez Hills
108
Get Question Details
April 15, 2011
CSU Dominguez Hills
109
Question Details
April 15, 2011
CSU Dominguez Hills
110
Online Analysis
April 15, 2011
CSU Dominguez Hills
111
Run Analysis
April 15, 2011
CSU Dominguez Hills
112
Analysis - Results
April 15, 2011
CSU Dominguez Hills
113
More About iPOLL

Data base of over 500,000 questions

Includes frequency distributions allowing students
to do trend analysis online

You will not need a user name or password from oncampus, since the CSU will be IP authenticated.
You will be prompted for your email address
April 15, 2011
CSU Dominguez Hills
114
Log in to iPOLL
April 15, 2011
CSU Dominguez Hills
115
Searching iPOLL

Search for keyword

Click on Boolean or Wildcard for help

Let’s search for relig% (note: % is the
wildcard)

Select the topic of religion

Use date from 1/1/2000 to today’s date
April 15, 2011
CSU Dominguez Hills
116
Search
April 15, 2011
CSU Dominguez Hills
117
Results
April 15, 2011
CSU Dominguez Hills
118
Instructional Resources
Click here
Then choose one
of these
April 15, 2011
CSU Dominguez Hills
119
Fundamentals of Polling
April 15, 2011
CSU Dominguez Hills
120
Analyzing Polls
April 15, 2011
CSU Dominguez Hills
121
Teaching Tools
April 15, 2011
CSU Dominguez Hills
122
Other Roper Resources

SSRIC Roper Guide
April 15, 2011
CSU Dominguez Hills
123
Other Data
April 15, 2011
CSU Dominguez Hills
124
Other Data

Instructional Subsets
(http://www.ssric.org/node/361)
 Other (http://www.ssric.org/data/other)
April 15, 2011
CSU Dominguez Hills
125
Other Workshops
April 15, 2011
CSU Dominguez Hills
126
Workshops on SPSS

SPSS – introductory workshop




Creating SPSS data file
Transforming data
Descriptive statistics
SPSS – intermediate workshop




Crosstabulations
Using means
Correlation and regression
Charts
April 15, 2011
CSU Dominguez Hills
127
Workshop on Online
Statistical Analysis

SDA (Survey Documentation and Analysis)

Requires no site license

Extremely easy to learn


Excellent for classes where you want to introduce
students to statistical analysis
Tools available – frequencies, crosstabulations,
correlation and regression
April 15, 2011
CSU Dominguez Hills
128
Workshop on Data in
the Classroom

Statistical literacy – issues and examples






Causality
Experimental and survey design
Unintentional consequences
Sampling and statistical inference
Change over time
Where do we get the data?
April 15, 2011
CSU Dominguez Hills
129
Feedback?
Please fill out an evaluation form.
Thanks!
April 15, 2011
CSU Dominguez Hills
130
Download