Moving from the Census to the American Community Survey Richard Lycan Population Research Center Portland State University North American Cartographic Information Society October, 2008 PRC – Who we are PRC ► An applied research center: College of Urban and Public Affairs Portland State University ► The Oregon link with the U.S. Census Bureau for: Population Estimates Census State Data Center ► Courses in applied demography ► Contract demographic research ► Staffing a mix of demographers, geographers, and planners ► George Hough, PRC Director, usually with me to answer the tough questions NACIS 2008 2 Outline for Presentation PRC ► Some background and history on the Census of Population and Housing and the American Community Survey (ACS) ► What demographers and other scholars are saying about the pros and cons of the ACS ► Examples showing use of data from the ACS ► Sources for more information NACIS 2008 3 Part 1 – Introduction to the Census and the ACS PRC ► There will be no 2010 Census long form data on income, poverty, housing values, and dozens of other socio-economic characteristics of persons, households, and housing units ► Instead, these data are becoming available from the American Community Survey (ACS) on an annual basis NACIS 2008 4 A Very Short History of the Census ► 1787 - Article 1, Section 2 of the U.S. Constitution requires that a census of the population be conducted every ten years so that the representatives in Congress and direct taxes might be apportioned. ► 1790 - Federal marshals conduct the first census by going door-to-door through the 13 states plus the districts of Maine, Vermont, Kentucky, and the Southwest Territory (Tennessee). ► 1940 - Statistical sampling techniques are introduced, which allow the Census Bureau to create a “long form” answered by only a subset of the population, the source for topics such as income and poverty. ► Mail surveys, computers, the internet, wars, Elvis, TIGER ► 1990’s – Initial planning for “continuous measurement” approach used in ACS ► 2000 - Last use of the “long form” in the Decennial Census. To be replaced by the annual American Community Survey (ACS) ► 2006 - Initial public release of the ACS data. ► 2010 – Release of the ACS data for all geographies down to block group level PRC NACIS 2008 5 An Oversimplified View of the 2000 Census PRC ► Build master address file (MAF) of all residential addresses in US ► Refine and finalize the questionnaire ► Build the maps and geography files ► Test the census – survey, tabulate, evaluate ► Hire thousands of temporary workers ► Collect the data: mail-out, field follow-up ► Tabulate the results: make many adjustments, create summary tables ► Publish the results: print, CD, Internet ► Get ready for the 2010 Census NACIS 2008 6 The Paired Surveys ► Since 1940 there have been two key components to the Decennial Census The “short form” survey is comprised of approximately 7 items asked of every household, the results sometimes known as Summary File 1 The “long form” survey comprised of approximately 40 items asked “long form” the in 2010 of about 1 in 6No households, results sometimes known as Summary File 3 The two components are linked in that data from the “short form” are used to adjust results from the “long form” by age, race, housing units, etc. PRC NACIS 2008 7 How the ACS Collects Data PRC ► Surveys approximately 250,000 households each month ► Uses professional survey staff rather than temporary hires as in the Decennial Census ► Uses a Master Address File to attempt to identify all households in the US. Also gathers data from group quarters residents ► Uses a mail-out census form with follow-up telephone and field interviews ► The survey contains 55 questions for persons and 30 for housing units, about same as long form 2000 census NACIS 2008 8 Tabulation and Publication of the ACS ► Similar to the 2000 Census Tables are similar to those for the sample data (SF3) in the 2000 Census Geographies are similar to those in the 2000 Census data – states, counties, cities, tracts, school districts, zip code areas, and many others, but no block level data. There will be Public Use Microsample (PUMS) data ► Differs from the 2000 Census Data will become available sooner after collection Data will be available on an annual basis Some data will be available for a single year but much will only be published for 3 and 5 year averages There will be no paired “short form” data to use to adjust the ACS survey results PRC NACIS 2008 9 Geographies for the ACS ► The geographies for the ACS are basically the same as for the 2000 Census The hierarchy of states-counties-tracts-block groups Metro areas, cities, census designated places Other geographies – congressional districts, zip code areas, school districts, TAZ’s, rural areas of 60,000+ population ► Some boundaries change: political jurisdictions annually, school districts every two years ► Block groups generally the smallest geography available, no block level data ► Reference maps in PDF format and boundary files in SHP format on WWW.CENSUS.GOV . PRC NACIS 2008 10 The Release Schedule ► ► One year, three year, and five year data Data now available for larger areas such as Single Year Estimates 65,000+ counties, larger school 3 Year Estimates 20,000+ districts, PUMS areas Type of data 5 Year Estimates ► Year of release 2006 2007 2008 2009 2010 2011 2012 Population Size and Area Census tracts and Block Groups Issues of statistical reliability and confidentiality limit detail Averaging for five year data 2005 ► PRC Multi-year data a solution 2006 2010 Release 2007 X 2008 2011 Release 2009 2010 X Averaging for three year data 2005 2008 Release 2009 Release NACIS 2008 2006 X 2007 X 2008 Note that only the beginning and end years differ in the data 11 For Example: ACS Data For Oregon School Districts ► Now – Single year data only available for districts 65,000 and over ► Nov. 2008 – three year data for districts 20,000 and over ► Late 2010 – five year data for all districts, plus 3 year and 1 year data PRC NACIS 2008 12 Part 2 – What Demographers and Others Are Saying about the ACS PRC NACIS 2008 13 Pay Attention to Sampling Errors PRC ► ACS sample smaller: For five year data the ACS sample is smaller than that for the Census long form ► We have sinned: Sampling errors a problem with previous census data, but we tended to ignore, especially in mapping of the data ► You can’t ignore it: The ACS provides explicit data on confidence limits whereas the 2000 census provided calculating formulas NACIS 2008 14 Data Will Be Better for Large Areas, Poorer For Small Areas PRC ► Annual data: At the state, county, large school district level publication of annual data will allow tracking of changes ► Better surveys: The use of professional surveyors and careful use of telephone and field follow-ups will result in more complete questionnaires and less “imputation” for missing entries ► Larger sampling errors: For small areas such as census tracts data will only be available as five year averages. Sampling errors may be large NACIS 2008 15 Weighting of the Sample PRC ► Need to inflate sample: If you sample 1 in 10 households, the results need to be inflated by a factor of roughly 10 (more complicated than this) ► The long and the short of it: The Census long form data could be weighted from the short form data by households, population, race , age, sex ► Weighting weaker for ACS: The ACS data will be weighted from various Census Bureau estimates NACIS 2008 16 Some Suggestions on Using ACS Data – Linda Gage ► ► ► ► ► ► ► PRC Don’t try to analyze all of the data at once, even if you use all of the items. Concentrate on the data items that you already use in your work. Don’t assume the Census data are more accurate than ACS. Compare Census and ACS data to administrative records that you have available. Consider whether the data make sense. Learn to use and provide the standard errors provided with the ACS. Share what you learn about using the ACS with the Census Bureau and other professionals. NACIS 2008 17 Part 3 – Examples Using the ACS PRC NACIS 2008 18 Example 1 – Viewing Tabular Data from the 2005 ACS ► The next data are for Public Use Micro-Sample Areas (PUMAs) and school districts with populations 65,000 and greater ► PUMA’s have more than 65,000 persons and single year data are available for all of them. ► However, sampling errors can be large. PRC Data for sub-county areas Several counties combined to Reach 100,000 threshold NACIS 2008 19 Data for Oregon School Districts ► The “margin of error” is for the 90% confidence level. This is the type of error reporting used in the ACS. These data are for a PUMA, many of which have over 100,000 population ► Note the range for kindergarten enrollment of 1,046 – 2,068 Selected Social Characteristics PUMA 00100, Oregon Population 3 years and over enrolled in school Nursery school, preschool Kindergarten Elementary school (grades 1-8) High school (grades 9-12) College or graduate school PRC Estimate Margin of Error Lower Bound Upper Bound 27,935 1,510 1,557 13,179 6,900 4,789 +/-1,271 +/-468 +/-511 +/-852 +/-745 +/-1,033 26,664 1,042 1,046 12,327 6,155 3,756 29,206 1,978 2,068 14,031 7,645 5,822 EDUCATIONAL ATTAINMENT Population 25 years and over Less than 9th grade 9th to 12th grade, no diploma 75,473 5,263 6,974 +/-1,139 +/-717 +/-982 74,334 4,546 5,992 76,612 5,980 7,956 High school graduate (includes equivalency) Some college, no degree Associate's degree Bachelor's degree Graduate or professional degree 25,060 19,635 4,920 9,021 4,600 +/-2,142 +/-1,657 +/-926 +/-1,185 +/-803 22,918 17,978 3,994 7,836 3,797 27,202 21,292 5,846 10,206 5,403 83.8 NACIS 2008 18 +/-1.5 +/-1.8 82.3 16.2 85.3 19.8 Percent high school graduate or higher Percent bachelor's degree or higher 20 Or, consider the variability in enrollment data for school districts ► PRC The values in red show estimates that have a margin of error at least 20% of the estimate, then at least 10% NACIS 2008 21 Example 2 - Analysis of change using census tract level data PRC ► Housing tenure – Large universe, modest change from year to year. We examine the change from 2001 to 2003 using ACS five year data for 19992003 and 2001-2005. ► Citizenship – Small universe (foreign born) and modest change from year to year. We use the same data as above. NACIS 2008 22 Housing Tenure Change - Census Tracts ► A large universe. Overlapping time periods. ► ► ► % in 2001 % in 2003 Varies from county mean ► ► ► ► ► % Change @ 60% significance @ 80% significance @ 90% significance @ 95% significance PRC Tracts with values significantly different There remain a number of tracts for which change 1999 2000 2001 2002 2003 than the county average over time is statistically significant and they appear of 42.7% 2001 2002 2003 2004 2005 not to be random in space. NACIS 2008 23 Housing Tenure – Grid Map Generalization ► ► ► Choropleth mapping Allocation to census block centroids The highly generalized form of the contours may lend itself to easier verbalization of the spatial patterns. How would one assess statistical significance? Grid map generalization PRC 0.25 sq 0.50 sq 1.00 sq 1.50 sq mi mi mi mi NACIS 2008 24 Change in Foreign Born Who Are Citizens – Census Tracts ► A smaller universe than for housing. Overlapping time periods. 1999 1999 2000 2000 2001 2001 2002 2003 2004 2005 Only a2003 few tracts show that 2002 2004 Significantly 2005 change different from is statistically significant, fewer than county percent of 37.4 one would expect by chance. I would who are citizens not publish this map. % in 2001 ► % in 2003 ► Varies from county mean? ► ► ► ► ► ► This doesn’t look good! % Change @ 60% significance @ 80% significance @ 90% significance @ 95% significance PRC NACIS 2008 25 Example 3 – The Published Tables Are But One Representation of Reality ► The following example shows the large sampling variability in the block group data using % rental households of all single family households. ► The universe of households was recreated from the 100% data from the 1990. ► 20 random samples were drawn on a 1 in 6 basis. ► Choropleth maps were draw for each sample? ► Which map is the right one, the best representation of reality? PRC NACIS 2008 26 Here are 20 “might have been” samples for 1990 SF3 sample data ► The following sequence shows 20 possible sets of data that might have been obtained in the sampling process. ► Note the considerable local variation but also the persistence of some of the high values ► The lesson: Be cautious about interpreting small area sample census data. PRC Which of these 20 representations of reality is closest to the truth? NACIS 2008 27 No telling what you can find in some weird sample ► ► ► PRC On running 100’s of sampling simulations for row houses and condos we think we came up with an image of Elvis ? Can you see an image of Elvis in this map? ? Beware: you can get some strange geographical coincidences in maps of sample data ? NACIS 2008 28 Suggestions on mapping with ACS data PRC ► Reliability - Inform your readers that the map is based on sample data and the reality might vary. Where you can, provide quantitative measures of error. ► Period averages - Inform your readers when the data are for a 3 or 5 year period, and what this means in the particular context. ► Aggregate, Aggregate - Reduce standard errors by aggregating over longer time periods or for larger geographies. Avoid mapping with block group level data. ► Broaden class intervals - Look at the standard errors in the data when setting class intervals so as not to give a false impression of precision. ► Educate your clients - Let them know the limits of the ACS data for mapping. Say “no” to ill conceived requests. ► Share - Share what you learn about mapping with the ACS with colleagues. Provide feedback to the Census Bureau. NACIS 2008 29 Where does one get the ACS data? ► ACS: http://www.census.gov/acs/www/ ► American Factfinder: ► FTP Site: http://www2.census.gov/acs/MultiYearEstimates/ PRC NACIS 2008 30 Additional Resources ► US Census Bureau ACS: http://www.census.gov/acs/www/ ► Population Reference Bureau: The American Community Survey, http://www.prb.org/pdf05/60.3The_American_Community.pdf ► National Academy of Sciences: Using the American Community Survey: Benefits and Challenges, forthcoming in paperback, http://books.nap.edu/catalog.php?record_id=11901 ► Population Research and Policy Review (2006), No 25: A Special Issue on the ACS: http://www.springerlink.com/content/102983/ ► Missouri Census Data Center: Ten Things to Know about The ACS: http://mcdc2.missouri.edu/pub/data/acs2005/Ten_things_to_know.shtml ► Portland Demographic Trends CD – Copies available, see presenter. PRC NACIS 2008 31 The End Richard Lycan Population Research Center Portland State University Portland, Oregon, 97207-0751 lycand@pdx.edu PRC NACIS 2008 32