Census 2010: Redistricting Data and Beyond TUESDAY, July 19 1011:30am Today The first detailed results from the 2010 Census became available this spring. Additional data releases include the Groups Quarters data at the end of April, Demographic Profiles in May, Summary File 1 data this summer, Summary File 2 data next winter, and a variety of additional products over the next two years. In addition, the 2010 Census will serve as the basis for new geographies and weights for data from the American Community Survey. This presentation will provide an overview of what data the Census Bureau releases, in what form, and when. Overview of Census Series Four 1.5 hour sessions 2010 Census, ACS, products & content Access and Analysis Resources Geographies Issues in cross-time, cross-survey analyses THURSDAY, July 21 10-11:30am Census 2010: Accessing Census Data. Census data are released in both Summary File format (aggregate counts) and as microdata (individual person and household records). Data in both of these formats is bundled and distributed in different ways by the Census Bureau and other agencies. This workshop will introduce the tools - American Factfinder, DataFerrett, SDA, Social Explorer, direct downloads, browser add-ons - that make working with these data easier. TUESDAY, July 26 10-11:30am Census 2010: Census Geographies. One of the principal applications that researchers and policy-makers use Census data for is to characterize local neighborhoods, communities, labor markets, cities, or regions. This workshop will discuss the types of geographies that census data are produced and distributed for, and how those change over time. THURSDAY, July 28 10 – 11:30am Census 2010: Issues in combining Census data over time. This lunchtime session will talk about some of the issues researchers who want to look at changes over time using Census data confront, including changing questions, changing geographies, and changing contexts. Recurring Questions Content What gets asked, what get distributed? Geography What data is available for what areas? Access What are the best places to explore, download, and understand census products? Multi-legged stools Drawing on multiple resources, surveys, time periods and geographies….and the strengths and drawbacks. What makes the Census important? Huge data collection budget (7 billion in 2010, 3 billion in 2009, up from 1.5 billion 2008, 900 million 2007) Even more money allocated on basis of data collection (~400 Billion annually) Most widely used social science data – – – High quality sample frames Large samples sizes, small geographies Consistency Broad Data Collections Population & Housing Census - every 10 years Economic Census - every 5 years Census of Governments - every 5 years American Community Survey – annually Many additional surveys -- both Demographic & Economic Economic Indicators - each indicator is released on a specific schedule Sampling Frame Master Address File (MAF) – – Housing Units – Official inventory of known living quarters Linked to TIGER Based on Census 2000 MAF and updates from the USPS’ Delivery Sequence File Group Quarters – … and updates from the administrative records and the FSCPE Census 2010 Questionnaire 10 Questions Name Sex Age Relationship (to Household Head) Hispanic Origin Race Owner/Renter Status Plus Whether each member sometimes lives/stays elsewhere Total number living in residence Probe for unreported persons Telephone contact Name Sex Age/DOB Hispanic Origin Race Tenure/Ownership Name Relationship to Person 1 Sex Age/DOB Hispanic Origin Race Tenure/Ownership Per Census Bureau Technical Documentation: “The concept of race, as used by the Census Bureau, reflects self-identification by people according to the race or races with which they most closely identify. These categories are socio-political constructs and should not be interpreted as being scientific or anthropological in nature. Furthermore, the race categories include both racial and national-origin groups.” “Origin can be viewed as the heritage, nationality group, lineage, or country of birth of the person or the person’s parents or ancestors before their arrival in the United States. People who identify their origin as Spanish, Hispanic, or Latino may be of any race.” Where’s all the interesting stuff? In 2000 (and earlier) censuses, the census used more than one form: A “short” form, which asked basic demographic data, just like the 2010 census form (AKA – 100% data) A “long” form, which collected both the items on the short form and a broader set of items about income, education, ancestry, language, disability, employment, etc.. (AKA – sample data) Now, decennial census focuses solely on basic demographic data, and social and economic data are collected in the American Community Survey (ACS) Census 2010 Product Release Schedule Reapportionment data – December 2010 Redistricting data – February-April 2011 SF 1 – June – August 2011 SF 2 - Dec 2011 – April 2012 Same Sex Couple Summary File – Nov 2011 Congressional District Summary File – Jan 2013 AIAN Summary File – April 2013 State Legislative District Summary File – June 2013 PUMS - TBD Census 2010 Products Census 2010 Products P.L. 94-171 (Redistricting Data) State and sub-state counts down to the block level are shown for the total population and the population 18 years and over for 63 race groups; and not Hispanic or Latino origin by 63 race groups. Also shown are housing unit counts by occupancy status (occupied units, vacant units). Basic Census Geography Census 2010 Products P.L. 94-171 (Redistricting Data) In December 1975, the Congress passed Public Law (P.L.) 94-171. This law requires the Census Bureau to make special preparations to provide redistricting data to the 50 states no later than April 1 of the year following a census. 5 phases: 1: Census Bureau collects state legislative district boundaries and associated updates to tabulate legislative districts. 2: Voting District/Block Boundary Suggestion Project (VTD/BBSP) in which states received TIGER/Line® shapefiles and the MAF/TIGER Partnership Software (MTPS) to electronically collect voting district boundaries, feature updates, suggested block boundaries, and corrected state legislative district boundaries. 3: delivery of the data for the 2010 Census. States use this data for their own redistricting. 4: During Phase 4 (2011–2013) the Census Bureau’s Redistricting Data Office will collect the post-2010 Census state legislative and congressional district plans. 5: evaluation and solicitation of recommendations for the 2020 Census Census 2010 Products P.L. 94-171 (Redistricting Data) Census 2010 Products P.L. 94-171 (Redistricting Data) 1980 1990 2000 2010 Race 5 Race AND Spanish Origin 5 Race x 2 Hispanic 63 Race x 2 Hispanic 63 Race x 2 Hispanic Age --- Total, Age 18+ Total, Age 18+ Total, Age 18+ Housing --- Total Housing Units --- Occupied vs Vacant Census 2010 Products VTD Map- Berkeley Census 2010 Product Release Schedule Reapportionment data – December 2010 Redistricting data – February-April 2011 SF 1 – June – August 2011 SF 2 - Dec 2011 – April 2012 Same Sex Couple Summary File – Nov 2011 Congressional District Summary File – Jan 2013 AIAN Summary File – April 2013 State Legislative District Summary File – June 2013 PUMS - TBD 100% Summary Files Summary File 1 (SF1) About 300 tables Counts and cross tabulations Counts for detailed race, Hispanic or Latino groups, and American Indian/Alaska Native tribes (to the tract) Tables repeat for major race groups alone, two or more races, Hispanic or Latino, White not Hispanic or Latino Geography: block, census tract http://www.census.gov/population/www/cen2010/glance/files/SF1_Final_1.5_Internet.xls Examples of SF1 Tabulations P1 P3 P8 P12 P14 P15 P17 Total population (1) Race (71) Hispanic or Latino (17) Sex by age (5-year groupings) (49) Sex by age for the population under 20 (single years of age) (43) Households (1) Average household size (1) Census 2010 Product Release Schedule Reapportionment data – December 2010 Redistricting data – February-April 2011 SF 1 – June – August 2011 SF 2 - Dec 2011 – April 2012 Same Sex Couple Summary File – Nov 2011 Congressional District Summary File – Jan 2013 AIAN Summary File – April 2013 State Legislative District Summary File – June 2013 PUMS - TBD Census 2010 Products Summary File 2 Detailed tables on age, sex, households, families, relationship to householder, housing units, and group quarters. Tables are repeated by 141 race groups, 98 American Indian and Alaska Native tribes/tribal groupings, and 39 Hispanic or Latino origin groups. SF2 - Detailed Asian Categories Asian Asian Indian Bangladeshi Bhutanese Burmese Cambodian Chinese Chinese, except Taiwanese Taiwanese Filipino Hmong Indonesian Japanese Korean Laotian Malaysian Nepalese Pakistani Sri Lankan Thai Vietnamese Other Asian SF2 - Detailed Hispanic/Latino Categories Hispanic or Latino (of any race) Mexican Puerto Rican Cuban Other Hispanic or Latino Dominican Central American Costa Rican Guatemalan Honduran Nicaraguan Panamanian Salvadoran South American Argentinean Bolivian Chilean Colombian Ecuadorian Paraguayan Peruvian Uruguayan Venezuelan Spaniard All other Hispanic or Latino SF 2 - 42 American Indian Categories American Indian Apache Houma South American Indian Arapaho Iroquois Spanish American Indian Blackfeet Kiowa Tohono O'Odham Canadian and French American Indian Lumbee Ute Central American Indian Menominee Yakama Cherokee Mexican American Indian Yaqui Cheyenne Navajo Yuman Chickasaw Osage American Indian tribes, Other Chippewa Ottawa Choctaw Paiute Alaska Native Colville Pima Alaskan Athabascan Comanche Potawatomi Aleut Cree Pueblo Inupiat Creek Puget Sound Salish Tlingit-Haida Crow Seminole Tsimshian Delaware Shoshone Yup'ik Hopi Sioux SF2 Subject Content 36 Population tables at census tract (PCT) level 10 Population tables at county level 10 Housing tables at census tract (HCT) level Other Deccenial-based tabulations Same-Sex Tabulation a single table, but tabulated as reported, not “edited” to unmarried partner Congressional and State Legislative summary files – retabulation of SF1 to new boundaries Other summary files draw upon ACS, rather than decennial EEOC: Fall 2010 Geography (type and level) Occupation Sex Race/ethnicity Citizenship Industry Age Earnings Educational attainment 33 New this time ACS 2006-2010 5-year file Margins of error 2010 Census population base 2010 SOC Occupation categories Additional variable: Citizenship 34 Sample Data Summary Files Summary File ….. 3? 813 tables of data Counts and cross tabulations of sample items (income, occupation, education, rent and value, vehicles available) Lowest level of geography: block group The American Community Survey Replacement for the “long form” of the decennial census. HH sample fully implemented in January 2005, annual sample of around 3 million. Multi-mode: mail, CATI, CAPI ACS: Design of the Sample Annual Sample Size of 3 million addresses Series of Monthly Samples of 250,000 addresses HU sample in each of the 3,141 Counties Areas with smaller populations sampled at higher rates than those with larger populations HU Address sampling rate set by Block Final sampling rate varies between 1.6% and 10% No HU address can be sampled more than once in 5 years ACS: Data Collection HU addresses by three modes – – – Mailout of paper questionnaire in 1st month Telephone (CATI) non-response follow-up in 2nd month Personal visit (CAPI) non-response follow-up in 3rd month to a sub-sample GQ – Personal visit within 6 weeks of sample selection ACS: Sample Design GQ facilities sample for each state Two stratum – – Small – – Small (15 or fewer residents) Large ( more than 15 residents) Data collected on all residents Facility eligible once in 5 years Large – – – Groups of ten residents sub-sampled Number of groups determined by size of facility Facility eligible every year ACS Content - Basic ACS Content Tests 2006 Health Insurance Marital History Veteran's Service-connected Disability 2007 Field of Degree (BA) 2010 Computer Ownership-Internet Access Parental Place of Birth Distribution Formats Like former decennial census data, released in both aggregate and microdata formats Because of change to continuous sampling, however, aggregate data released at different geographic levels with differing collection frames Multi-year estimates Larger geographies have multiple options for estimates – 1 year, 3 year, 5 year Comparing and interpreting overlapping multiyear estimates not intuitive: only differences come from the non-overlapping period. How does the decennial feed into the ACS? Counts Geographies (Tentatively, September, October and December) Demographic (Household) Surveys Survey of Income and Program Participation Survey of Program Dynamics American Housing Survey Current Population Survey Consumer Expenditure Survey (CES) And more….. Survey of Income and Program Participation The Survey of Income and Program Participation (SIPP) program, initiated in 1983, is a longitudinal, multi-panel survey primarily of adults in households in the United States. Sampled households are interviewed at least nine times at four-month intervals and followed over the life of the panel. New samples (panels) are drawn periodically, ranging in size from around 13,000 HHs to around 40,000 HH’s. (annually 1984-1993; 1996, 2001, 2004, 2008) The SIPP attempts to interview all members age 15 and older in the household during the first wave of interviewing. Subsequent interviews may be in-person or by phone, with the same interviewer speaking to the same respondents. New members who join the household are interviewed after they join; departing members are interviewed at their new address. Survey of Income and Program Participation SIPP information falls into two categories: the core information, and other questions (found in "topical modules") that produce in-depth information on specific subjects and are asked at only one or two interviews. SIPP core content covers demographic characteristics, work experience, earnings, program participation, transfer income, and asset income. Current Population Survey The Current Population Survey (CPS) is a monthly survey of about 50,000 to 65,000 households conducted by the Bureau of the Census for the Bureau of Labor Statistics. The survey has been conducted for more than 50 years. The CPS is the primary source of information on the labor force characteristics of the U.S. population. The sample is scientifically selected to represent the civilian noninstitutional population. Households are in the survey eight times: four consecutive months, eight months off, and then a final four months. Estimates obtained from the CPS include employment, unemployment, earnings, hours of work, and other indicators. They are available by a variety of demographic characteristics including age, sex, race, marital status, and educational attainment. They are also available by occupation, industry, and class of worker. Supplemental questions to produce estimates on a variety of topics including school enrollment, income, previous work experience, health, employee benefits, and work schedules are also often added to the regular CPS questionnaire. Current Population Survey Annual Social and Economic Supplement (ASEC) – (formerly called the Annual Demographic Survey or March Supplement) Voting and Registration (November) School Enrollment (October) Food Security; every year since 1995 Computer Ownership Fertility and Marital History Fertility and Birth Expectations Contingent Workers and Alternative Employment Displaced Workers Job Tenure and Occupational Mobility Race and Ethnicity Tobacco Use Work Experience Work Schedules American Housing Survey Provides information on the size and composition of the housing inventory in the United State, neighborhood characteristics, characteristics of occupants. household characteristics, income, housing and neighborhood quality, housing costs, equipment and fuels, size of housing unit, and recent movers. The AHS returns to the same housing units year after year to gather data; therefore, this survey is ideal for analyzing the flow of households through housing. Sample of ~ 65,000 Collected for HUD Separate national (fixed sample for ~50,000, followed since 1985) and metropolitan samples (~3,200 – 4,800 per area, every 6 years, 12-14 areas/year) More detailed data, less geographic detail, than census Consumer Expenditure Survey The Consumer Expenditure Survey (CES) provides information on the buying habits of American consumers and also furnishes data to support periodic revisions of the Consumer Price Index. A new sample is drawn annually, and includes about 60,000 households. The survey consists of two separate components: (1) a quarterly Interview Survey in which each consumer unit in the sample is interviewed every three months over a fifteen-month period, and (2) a Diary Survey completed by the sample consumer units for two consecutive one-week periods. The quarterly interview gathers retrospective data on purchases, and focuses on regular and large expenses. The Diary Survey contains consumer information on small, frequently-purchased items such as food, beverages, food consumed away from home, gasoline, housekeeping supplies, nonprescription drugs and medical supplies, and personal care products and services. Participants are asked to maintain expense records, or diaries, of all purchases made each day for two consecutive one-week periods. Selected Other Data National Crime Victimization Survey – – – National Corrections Reporting Program – Prison Admission and discharges. Variables include incarceration history, current offenses, and total time served. Background information on individuals includes year of birth, sex, age, race, Hispanic origin, and educational attainment. A variety of surveys for NCHS, e.g. – – – 48,000 addresses in 809 PSU’s in US Operating since 1972 7 interviews over 3 ½ year period National Health Interview Survey National Hospital Discharge Survey National Survey of Ambulatory Surgery National Survey of College Graduates – Baseline Survey based on Census 1993 from 1990 census, 2003 from 2000 census Follow-up surveys every 2 years (4 total per decade) On Thursday… I’ve primarily talked about what’s available, not how to get it. On Thursday, I’ll talk about resources for online analysis and exploration, download resources, documentation, and local (Berkeley) resources. Questions/Comments email me at: jons@berkeley.edu