Census 2010

advertisement
Census 2010: Redistricting Data and
Beyond
TUESDAY, July 19 1011:30am
Today
The first detailed results from the 2010 Census became available
this spring. Additional data releases include the Groups Quarters
data at the end of April, Demographic Profiles in May, Summary
File 1 data this summer, Summary File 2 data next winter, and a
variety of additional products over the next two years. In addition,
the 2010 Census will serve as the basis for new geographies and
weights for data from the American Community Survey. This
presentation will provide an overview of what data the Census
Bureau releases, in what form, and when.
Overview of Census Series
Four 1.5 hour sessions




2010 Census, ACS, products & content
Access and Analysis Resources
Geographies
Issues in cross-time, cross-survey analyses
THURSDAY, July 21 10-11:30am
Census 2010: Accessing Census Data. Census data are released in both
Summary File format (aggregate counts) and as microdata (individual
person and household records). Data in both of these formats is bundled
and distributed in different ways by the Census Bureau and other agencies.
This workshop will introduce the tools - American Factfinder, DataFerrett,
SDA, Social Explorer, direct downloads, browser add-ons - that make
working with these data easier.
TUESDAY, July 26 10-11:30am
Census 2010: Census Geographies. One of the principal applications that
researchers and policy-makers use Census data for is to characterize local
neighborhoods, communities, labor markets, cities, or regions. This
workshop will discuss the types of geographies that census data are
produced and distributed for, and how those change over time.
THURSDAY, July 28 10 – 11:30am
Census 2010: Issues in combining Census data over time. This
lunchtime session will talk about some of the issues researchers who want
to look at changes over time using Census data confront, including changing
questions, changing geographies, and changing contexts.
Recurring Questions

Content
What gets asked, what get distributed?

Geography
What data is available for what areas?

Access
What are the best places to explore, download, and understand census products?

Multi-legged stools
Drawing on multiple resources, surveys, time periods and geographies….and the
strengths and drawbacks.
What makes the Census important?

Huge data collection budget
(7 billion in 2010, 3 billion in 2009, up from 1.5 billion 2008, 900
million 2007)


Even more money allocated on basis of data
collection (~400 Billion annually)
Most widely used social science data
–
–
–
High quality sample frames
Large samples sizes, small geographies
Consistency
Broad Data Collections






Population & Housing Census - every 10 years
Economic Census - every 5 years
Census of Governments - every 5 years
American Community Survey – annually
Many additional surveys -- both Demographic &
Economic
Economic Indicators - each indicator is released on a
specific schedule
Sampling Frame

Master Address File (MAF)
–
–

Housing Units
–

Official inventory of known living quarters
Linked to TIGER
Based on Census 2000 MAF and updates from
the USPS’ Delivery Sequence File
Group Quarters
–
… and updates from the administrative records
and the FSCPE
Census 2010 Questionnaire
10 Questions
Name
Sex
Age
Relationship (to Household Head)
Hispanic Origin
Race
Owner/Renter Status
Plus
Whether each member sometimes lives/stays elsewhere
Total number living in residence
Probe for unreported persons
Telephone contact
Name
Sex
Age/DOB
Hispanic Origin
Race
Tenure/Ownership
Name
Relationship to Person 1
Sex
Age/DOB
Hispanic Origin
Race
Tenure/Ownership
Per Census Bureau Technical Documentation:
“The concept of race, as used by the Census Bureau,
reflects self-identification by people according to the
race or races with which they most closely identify.
These categories are socio-political constructs and
should not be interpreted as being scientific or
anthropological in nature. Furthermore, the race
categories include both racial and national-origin groups.”
“Origin can be viewed as the heritage, nationality group,
lineage, or country of birth of the person or the person’s
parents or ancestors before their arrival in the United
States. People who identify their origin as Spanish, Hispanic,
or Latino may be of any race.”
Where’s all the interesting stuff?
In 2000 (and earlier) censuses, the census used more
than one form:

A “short” form, which asked basic demographic data, just like the
2010 census form (AKA – 100% data)

A “long” form, which collected both the items on the short form and a
broader set of items about income, education, ancestry, language,
disability, employment, etc.. (AKA – sample data)
Now, decennial census focuses solely on basic
demographic data, and social and economic data are
collected in the American Community Survey (ACS)
Census 2010
Product Release Schedule
Reapportionment data – December 2010
Redistricting data – February-April 2011
SF 1 – June – August 2011
SF 2 - Dec 2011 – April 2012
Same Sex Couple Summary File – Nov 2011
Congressional District Summary File – Jan 2013
AIAN Summary File – April 2013
State Legislative District Summary File – June 2013
PUMS - TBD
Census 2010 Products
Census 2010 Products
P.L. 94-171 (Redistricting Data)
State and sub-state counts down to the block level are shown
for the total population and the population 18 years and over for
63 race groups; and not Hispanic or Latino origin by 63 race
groups. Also shown are housing unit counts by occupancy
status (occupied units, vacant units).
Basic Census Geography
Census 2010 Products
P.L. 94-171 (Redistricting Data)
In December 1975, the Congress passed Public Law (P.L.) 94-171. This law requires the
Census Bureau to make special preparations to provide redistricting data to the 50 states
no later than April 1 of the year following a census.
5 phases:
1: Census Bureau collects state legislative district boundaries and associated updates to
tabulate legislative districts.
2: Voting District/Block Boundary Suggestion Project (VTD/BBSP) in which states received
TIGER/Line® shapefiles and the MAF/TIGER Partnership Software (MTPS) to
electronically collect voting district boundaries, feature updates, suggested block
boundaries, and corrected state legislative district boundaries.
3: delivery of the data for the 2010 Census. States use this data for their own redistricting.
4: During Phase 4 (2011–2013) the Census Bureau’s Redistricting Data Office will collect the
post-2010 Census state legislative and congressional district plans.
5: evaluation and solicitation of recommendations for the 2020 Census
Census 2010 Products
P.L. 94-171 (Redistricting Data)
Census 2010 Products
P.L. 94-171 (Redistricting Data)
1980
1990
2000
2010
Race
5 Race AND
Spanish Origin
5 Race x
2 Hispanic
63 Race x
2 Hispanic
63 Race x
2 Hispanic
Age
---
Total, Age 18+
Total, Age 18+
Total, Age 18+
Housing
---
Total Housing Units
---
Occupied vs Vacant
Census 2010 Products
VTD Map- Berkeley
Census 2010
Product Release Schedule
Reapportionment data – December 2010
Redistricting data – February-April 2011
SF 1 – June – August 2011
SF 2 - Dec 2011 – April 2012
Same Sex Couple Summary File – Nov 2011
Congressional District Summary File – Jan 2013
AIAN Summary File – April 2013
State Legislative District Summary File – June 2013
PUMS - TBD
100% Summary Files
Summary File 1 (SF1)





About 300 tables
Counts and cross tabulations
Counts for detailed race, Hispanic or Latino groups,
and American Indian/Alaska Native tribes (to the tract)
Tables repeat for major race groups alone, two or
more races, Hispanic or Latino, White not Hispanic or
Latino
Geography: block, census tract
http://www.census.gov/population/www/cen2010/glance/files/SF1_Final_1.5_Internet.xls
Examples of SF1 Tabulations
P1
P3
P8
P12
P14
P15
P17
Total population (1)
Race (71)
Hispanic or Latino (17)
Sex by age (5-year groupings) (49)
Sex by age for the population
under 20 (single years of age) (43)
Households (1)
Average household size (1)
Census 2010
Product Release Schedule
Reapportionment data – December 2010
Redistricting data – February-April 2011
SF 1 – June – August 2011
SF 2 - Dec 2011 – April 2012
Same Sex Couple Summary File – Nov 2011
Congressional District Summary File – Jan 2013
AIAN Summary File – April 2013
State Legislative District Summary File – June 2013
PUMS - TBD
Census 2010 Products
Summary File 2
Detailed tables on age, sex, households, families, relationship
to householder, housing units, and group quarters.
Tables are repeated by 141 race groups, 98 American Indian
and Alaska Native tribes/tribal groupings, and 39 Hispanic or
Latino origin groups.
SF2 - Detailed Asian
Categories
Asian
Asian Indian
Bangladeshi
Bhutanese
Burmese
Cambodian
Chinese
Chinese, except Taiwanese
Taiwanese
Filipino
Hmong
Indonesian
Japanese
Korean
Laotian
Malaysian
Nepalese
Pakistani
Sri Lankan
Thai
Vietnamese
Other Asian
SF2 - Detailed Hispanic/Latino Categories
Hispanic or Latino
(of any race)
Mexican
Puerto Rican
Cuban
Other Hispanic or Latino
Dominican
Central American
Costa Rican
Guatemalan
Honduran
Nicaraguan
Panamanian
Salvadoran
South American
Argentinean
Bolivian
Chilean
Colombian
Ecuadorian
Paraguayan
Peruvian
Uruguayan
Venezuelan
Spaniard
All other Hispanic or Latino
SF 2 - 42 American Indian Categories
American Indian
Apache
Houma
South American Indian
Arapaho
Iroquois
Spanish American Indian
Blackfeet
Kiowa
Tohono O'Odham
Canadian and French American Indian
Lumbee
Ute
Central American Indian
Menominee
Yakama
Cherokee
Mexican American Indian
Yaqui
Cheyenne
Navajo
Yuman
Chickasaw
Osage
American Indian tribes, Other
Chippewa
Ottawa
Choctaw
Paiute
Alaska Native
Colville
Pima
Alaskan Athabascan
Comanche
Potawatomi
Aleut
Cree
Pueblo
Inupiat
Creek
Puget Sound Salish
Tlingit-Haida
Crow
Seminole
Tsimshian
Delaware
Shoshone
Yup'ik
Hopi
Sioux
SF2 Subject Content
36
Population tables at census tract (PCT) level
10
Population tables at county level
10
Housing tables at census tract (HCT) level
Other Deccenial-based tabulations



Same-Sex Tabulation a single table, but
tabulated as reported, not “edited” to
unmarried partner
Congressional and State Legislative
summary files – retabulation of SF1 to new
boundaries
Other summary files draw upon ACS, rather
than decennial
EEOC: Fall 2010
Geography
(type and
level)
Occupation
Sex
Race/ethnicity
Citizenship
Industry
Age
Earnings
Educational
attainment
33
New this time

ACS 2006-2010 5-year file

Margins of error

2010 Census population base

2010 SOC Occupation categories

Additional variable: Citizenship
34
Sample Data Summary Files
Summary File ….. 3?

813 tables of data

Counts and cross tabulations of sample
items (income, occupation, education,
rent and value, vehicles available)

Lowest level of geography: block group
The American Community Survey



Replacement for the “long form” of the
decennial census.
HH sample fully implemented in January
2005, annual sample of around 3 million.
Multi-mode: mail, CATI, CAPI
ACS: Design of the Sample







Annual Sample Size of 3 million addresses
Series of Monthly Samples of 250,000 addresses
HU sample in each of the 3,141 Counties
Areas with smaller populations sampled at higher
rates than those with larger populations
HU Address sampling rate set by Block
Final sampling rate varies between 1.6% and 10%
No HU address can be sampled more than once in 5
years
ACS: Data Collection

HU addresses by three modes
–
–
–

Mailout of paper questionnaire in 1st month
Telephone (CATI) non-response follow-up in 2nd
month
Personal visit (CAPI) non-response follow-up in
3rd month to a sub-sample
GQ
–
Personal visit within 6 weeks of sample selection
ACS: Sample Design


GQ facilities sample for each state
Two stratum
–
–

Small
–
–

Small (15 or fewer residents)
Large ( more than 15 residents)
Data collected on all residents
Facility eligible once in 5 years
Large
–
–
–
Groups of ten residents sub-sampled
Number of groups determined by size of facility
Facility eligible every year
ACS Content - Basic
ACS Content Tests
2006
Health Insurance
Marital History
Veteran's Service-connected Disability
2007
Field of Degree (BA)
2010
Computer Ownership-Internet Access
Parental Place of Birth
Distribution Formats


Like former decennial census data, released
in both aggregate and microdata formats
Because of change to continuous sampling,
however, aggregate data released at different
geographic levels with differing collection
frames
Multi-year estimates


Larger geographies have multiple options for
estimates – 1 year, 3 year, 5 year
Comparing and interpreting overlapping multiyear estimates not intuitive: only differences
come from the non-overlapping period.
How does the decennial feed into the ACS?
Counts
Geographies
(Tentatively, September, October and December)
Demographic (Household) Surveys






Survey of Income and Program Participation
Survey of Program Dynamics
American Housing Survey
Current Population Survey
Consumer Expenditure Survey (CES)
And more…..
Survey of Income and Program
Participation
The Survey of Income and Program Participation (SIPP) program, initiated
in 1983, is a longitudinal, multi-panel survey primarily of adults in
households in the United States.

Sampled households are interviewed at least nine times at four-month
intervals and followed over the life of the panel. New samples (panels) are
drawn periodically, ranging in size from around 13,000 HHs to around
40,000 HH’s. (annually 1984-1993; 1996, 2001, 2004, 2008)

The SIPP attempts to interview all members age 15 and older in the
household during the first wave of interviewing. Subsequent interviews may
be in-person or by phone, with the same interviewer speaking to the same
respondents.

New members who join the household are interviewed after they join;
departing members are interviewed at their new address.
Survey of Income and Program
Participation


SIPP information falls into two categories: the core
information, and other questions (found in "topical
modules") that produce in-depth information on
specific subjects and are asked at only one or two
interviews.
SIPP core content covers demographic
characteristics, work experience, earnings, program
participation, transfer income, and asset income.
Current Population Survey

The Current Population Survey (CPS) is a monthly survey of about 50,000 to 65,000
households conducted by the Bureau of the Census for the Bureau of Labor Statistics. The
survey has been conducted for more than 50 years.

The CPS is the primary source of information on the labor force characteristics of the U.S.
population. The sample is scientifically selected to represent the civilian noninstitutional
population.

Households are in the survey eight times: four consecutive months, eight months off, and
then a final four months.

Estimates obtained from the CPS include employment, unemployment, earnings, hours of
work, and other indicators. They are available by a variety of demographic characteristics
including age, sex, race, marital status, and educational attainment. They are also available
by occupation, industry, and class of worker.

Supplemental questions to produce estimates on a variety of topics including school
enrollment, income, previous work experience, health, employee benefits, and work
schedules are also often added to the regular CPS questionnaire.
Current Population Survey

Annual Social and Economic Supplement (ASEC)
–













(formerly called the Annual Demographic Survey or March Supplement)
Voting and Registration (November)
School Enrollment (October)
Food Security; every year since 1995
Computer Ownership
Fertility and Marital History
Fertility and Birth Expectations
Contingent Workers and Alternative Employment
Displaced Workers
Job Tenure and Occupational Mobility
Race and Ethnicity
Tobacco Use
Work Experience
Work Schedules
American Housing Survey






Provides information on the size and composition of the housing
inventory in the United State, neighborhood characteristics,
characteristics of occupants. household characteristics, income,
housing and neighborhood quality, housing costs, equipment and
fuels, size of housing unit, and recent movers.
The AHS returns to the same housing units year after year to gather
data; therefore, this survey is ideal for analyzing the flow of
households through housing.
Sample of ~ 65,000
Collected for HUD
Separate national (fixed sample for ~50,000, followed since 1985) and
metropolitan samples (~3,200 – 4,800 per area, every 6 years, 12-14
areas/year)
More detailed data, less geographic detail, than census
Consumer Expenditure Survey

The Consumer Expenditure Survey (CES) provides information on the buying habits of
American consumers and also furnishes data to support periodic revisions of the Consumer
Price Index. A new sample is drawn annually, and includes about 60,000 households.

The survey consists of two separate components: (1) a quarterly Interview Survey in which
each consumer unit in the sample is interviewed every three months over a fifteen-month
period, and (2) a Diary Survey completed by the sample consumer units for two
consecutive one-week periods.

The quarterly interview gathers retrospective data on purchases, and focuses on regular
and large expenses.

The Diary Survey contains consumer information on small, frequently-purchased items
such as food, beverages, food consumed away from home, gasoline, housekeeping
supplies, nonprescription drugs and medical supplies, and personal care products and
services. Participants are asked to maintain expense records, or diaries, of all purchases
made each day for two consecutive one-week periods.
Selected Other Data

National Crime Victimization Survey
–
–
–

National Corrections Reporting Program
–

Prison Admission and discharges. Variables include incarceration history, current offenses, and total
time served. Background information on individuals includes year of birth, sex, age, race, Hispanic
origin, and educational attainment.
A variety of surveys for NCHS, e.g.
–
–
–

48,000 addresses in 809 PSU’s in US
Operating since 1972
7 interviews over 3 ½ year period
National Health Interview Survey
National Hospital Discharge Survey
National Survey of Ambulatory Surgery
National Survey of College Graduates
–
Baseline Survey based on Census
 1993 from 1990 census, 2003 from 2000 census
 Follow-up surveys every 2 years (4 total per decade)
On Thursday…


I’ve primarily talked about what’s available,
not how to get it.
On Thursday, I’ll talk about resources for
online analysis and exploration, download
resources, documentation, and local
(Berkeley) resources.
Questions/Comments
email me at: jons@berkeley.edu
Download