Statistical Analysis & Dissemination of Census Data Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Statistical Analysis and Dissemination of Census Data Outline The Power of Maps Dynamic Census Atlases Overview & Examples Digital Geographic Data for Dissemination Overview & Example Spatial Analysis Techniques Introduction and Example Overview & Cost and Benefits Digital Data Dissemination Strategies and Users Overview of Users Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Anyone or anything can be associated with a known location in the world Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 CHILE: HOUSING AND POPULATION CENSUS DISTRICTS 2002 Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Tsunami Affected Areas in Gizo, Solomon Islands Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 The power of maps Maps Maps Maps Maps Maps Maps communicate a concept or an idea. are often meant to support textual information appeal to the viewer’s curiosity summarize large amounts of information concisely can be used for description, exploration, confirmation, tabulation encourage comparisons: Between different areas on the same map: where are population densities highest? Between different maps: is child mortality higher in the districts of province A than in province B? same area: where and by how much do literacy rates for males and females differ in the districts? Between maps for different time periods: did fertility rates decline since the last census? Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Dynamic census atlases Alternative to a static census atlas Publishing of a digital map and database together with mapping software can allow users to produce custom maps of census indicators. Normally includes digital boundary files at a lower resolution than the full census database to allow fast drawing and low disk usage closely integrated attribute table should contain only a selected number of census indicators. Densities and ratios that are appropriate for mapping should already be calculated. Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Dynamic census atlases The data provider should therefore provide an easy-to-use package together with the boundaries and data. The use of that package should require minimal training and experience. The application should be “plug-and-play”—after installation, the user should immediately be able to produce maps Drill-down options for different geographic selections Interactive area delineation options (e.g. select schools in a district) Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 A screenshot of Ukraine’s dynamics census atlas Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques the main use of spatial analysis is for census products and services Techniques include: buffering, linear interpolation, point pattern analysis, and cartograms, etc. All offer functionality beyond standard thematic (choropleth) mapping, with many tools now available in both commercial and open-source software programs. Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques Some prevalent forms of spatial analysis especially useful for use with population data include: Queries Distance measurements Transformations Buffering point-in-polygon analysis Polygon overlay analysis Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques Queries: Often this is the first step in an analysis, where one seeks to create a subset of units such as populated places with certain characteristics, allowing the user to check how typical an observation is against other observations They use a GIS program to answer simple questions posed by the user, with no changes in the database and no new data produced. An example of a query using geocoded census data is, select all towns with a population greater than 1,000 persons. These towns can then have their attributes summarized, for instance, to measure their total fertility rates against smaller towns and villages, then map the results The term exploratory data analysis refers to investigations of patterns and trends in data using such techniques as querying Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Area delineation E.g. Interactive determination of school districts with the same number of children in each school grade by aggregating census dissemination areas Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques Distance measurements Easily done with all GIS programs, using the centroids (or center points) of cities, towns, and villages. An analysis can be done to select villages located more than a kilometer from a school, clinic, or water source. These can then be further analyzed using the attribute information for the populated places themselves. Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques Transformations Methods of spatial analysis that use simple geometric, arithmetic or logical rules to create new datasets Transformations can include operations that convert raster into vector data, or a stream of GPS coordinates into a route or a boundary Of all the transformational techniques, buffering is the most well known and important Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques Buffering (transformation) Involves building a new data layer by identifying all areas that are within a certain specified distance of the original. Buffering can be performed on points, lines and polygons and can be weighted by attribute values. Buffering can be used to model travel time, for instance, by creating a “catchment area” around a particular feature such as a school or a clinic. This provides a measure of accessibility that can be mapped across the extent of a country. Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 “is near to”: Buffer Operations • Point buffer • Affected area around a Hospital • Catchment area of a water source Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Buffer Operations • Line buffer • How many people live near the polluted river? • What is the area impacted by highway noise? Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Buffer Operations • Polygon buffer • Area around a reservoir where development should not be permitted Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques point-in-polygon analysis Determines whether a point lies inside or outside a polygon. Can be used to compare geocoded village centroids lying inside and outside hazardous areas such as tropical storm tracks or earthquake zones. Polygon overlay analysis Involves comparison between the locations of two different polygonal data layers. For example, the boundaries of two administrative districts could be compared to troubleshoot errors in the field enumeration process Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques Spatial interpolation A spatial analysis method designed to fill in values that lie between observations A variety of methods including inverse-distance weighting and kriging are used to estimate the values of unsampled sites based on Tobler’s first law that all nearby objects are more similar than distant objects Kriging: interpolation technique for obtaining statistically unbiased estimates of spatial variation of known points such as surface elevations or yield measurements utilizing a set of control points In kriging, the general properties of a surface are modeled to estimate the missing parts of the surface Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Example of linear interpolation creating contours Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques Thiessen polygons Have the unique property that each polygon contains only one input point (e.g. settlements), and any location within a polygon is closer to its associated point than to the point of any other polygon. This method assumes that the values of the unsampled data are equivalent to those of the sampled points. Thiessen polygons illustrated Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Areas of influence Commuting distances: daily commuters flow Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques Descriptive summaries are a spatial equivalent of descriptive statistics (such as mean and standard deviation) that represent the essence of a dataset in 1 or 2 numbers Centers of population are the two-dimensional equivalent of a statistical mean and are often used to display the center of population using the weighted average of x and y coordinates of populated points Point pattern or cluster analysis regards the distribution of points in space irrespective of their actual locations to determine whether patterns are random, clustered, or dispersed hot spots are where high values are surrounded by high values, or cold spots, where low values are surrounded by low values. These are particularly useful for identifying populations at risk as well. Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Spatial Analysis Techniques Cartograms sometimes used to display census results The areas of the original polygons are expanded or contracted based on their attribute values such as population size or voting habits Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Modelling: smoothing Evolution of the population beetwen two censuses Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Digital Geographic Data for Dissemination Demand for digital databases that consist of extractions of the census agency’s digital geographic master database will only increase Census data are an important input in policy planning and academic analysis in many fields. Health service provision, educational resource allocation, design of utilities and infrastructure, and electoral planning are some applications where government agencies require spatially referenced small area population statistics. Commercial users employ such data for marketing applications and location decisions. Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Digital Geographic Data for Dissemination Benefits and costs Benefits: Unsurpassed detail and precision, the potential use of census data in numerous applications--especially when overlaid on other geographic data such as terrain, and the relative ease of management and storage of thousands of units Costs: expense in processing and data management, possible data disclosure issues, and quality control;costs of metadata production should be factored into the equation as well Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Digital Data Dissemination Strategies and Users The wide range of potential users of disaggregated census data means that the NSO needs to pursue a multi-leveled digital data dissemination strategy. Broadly, we can distinguish between the following types of users: Advanced GIS users Computer literate users Novice users Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Digital Data Dissemination Strategies and Users Advanced GIS users work easily with large datasets and can use ftp to access them Require extensive metadata. Sometimes called data extractors or “power users” They will want access to spatial and attribute information in a comprehensive digital geographic format The census office needs to supply comprehensive documentation on the geographic parameters used for the geographic database as well as on the individual census variables The spatial information will be distributed in an open geographic format that can be easily converted into any number of commercial GIS formats Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Digital Data Dissemination Strategies and Users Computer literate users Government, commercial or private sector users who want to be able to browse the thematic information in a census database spatially. Want to produce thematic maps and thus need to be able to perform simple manipulation of cartographic parameters. Simple analytical functions such as aggregation of census units to customdesigned regions should also be possible. This group of users is best served with a comprehensive, pre-packaged application that is designed for a commercial or freely available desktop mapping package. Documentation requirements are somewhat smaller, since the users are unlikely to change the geographic parameters of the database or perform more advanced GIS operations. Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Digital Data Dissemination Strategies and Users Novice users Largely want to view pre-prepared maps on a computer and perhaps perform some basic queries Best data distribution strategy is often to produce a self-contained digital census atlas This atlas could consists of a series of static map images, for example, in the form of a slide show Or it could be a very simple mapping interface with pre-designed map views that allow basic queries Both, static maps and a simple map interface, can be made accessible through the Internet Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 GRACIAS POR SU ATENCIÓN Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008