Spatial Statistics I RESM 575 Spring 2011 Lecture 10 Spatial Statistics Measuring geographic distributions Testing statistical significance Identifying patterns Measuring geographic distributions Identify spatial characteristics of a distribution Where is the center? What feature is most central? How are features dispersed around the center? 3 ArcTools in ArcGIS 4 Where is the center? Mean Center tool Computes the average X and Y coordinates of all features Creates a new point feature 5 Example 6 Example 7 Mean center tool More common use: To compare distributions of different types of features or to find the center of features based on an attribute value 8 Example 9 Example 10 Example 11 What is the most central feature? Central feature tool Identifies the most centrally located feature Feature having the lowest total distance to all other features 12 Example 13 Example 14 What is the most central feature? More interestingly is by adding a weight in the analysis such as population We are now finding not just which site is most central but which is the most accessible to the greatest number of people. 15 Example 16 Measuring feature distribution Standard distance tool Measures distribution of features around the mean Result is a summary statistic representing distance If circle is large, incidents are widespread If small, incidents are more localized 17 Example 18 Distributional trends Directional distribution (standard ellipse) tool Identify spatial trends in the distribution of features Uses Compare distributions Examine different time periods Show compactness and orientation 19 Example 20 Testing statistical significance The next section of identifying patterns or later spatial relationships allows us to perform significance tests on the results before accepting them 21 Using significance tests with spatial data Spatial data contradicts some of the assumptions of inferential statistics You need to be aware of these limitations! 22 Assumptions Testing a random sample With GIS data in a database you may not know if the data was randomly sampled How large the sample is in relation to the population? Even with randomness assumed, spatial data often violates the independence of observations in a sample Spatial data is rarely evenly distributed across a region 23 For spatial pattern analysis… The null hypothesis is that features are evenly distributed across the study area Hard to imagine this being true You have to make one of two common sampling assumptions: randomization or normalization 24 Identifying Patterns Why study patterns? Range from completely clustered to completely dispersed 25 Identifying patterns (applications) Forestry applications USFS may measure the pattern of clear cuts to ensure sufficient contiguous forest habitat remaining Agency may allow a level of clustering of clear cuts and then make sure it is not exceeded Wildlife studies if population is dispersed then species can live in a wide range of habitats, if clustered then it has very specific habitat requirements 26 Goal for analyzing spatial patterns Are there underlying spatial processes influencing the locations of our features? Are our features randomly located throughout the study area, or are they displaying a clustering or dispersed pattern? 27 Approaches for analyzing spatial patterns Approach #1 Global calculations Identifies overall patterns or trends in the data Effective for complex messy data Interested in broad overall results Work by comparing feature locations and/or attributes to a theoretical random distribution to determine if you have statistically significant clustering or dispersion 28 Analyzing spatial patterns Can be found with the Average Nearest Neighbor which does not require specifying an attribute, or If based on an attribute we can test for Spatial autocorrelation using the (Moran’s I) tool Things that are closer are more alike than things that are not Measures similarity of neighboring features Identifies if features are clustered or dispersed 29 After tool runs, go to Geoprocessing - > Results 30 Double click to open the html report for your latest run 31 32 After tool runs, go to Geoprocessing - > Results 33 34 Global SA interpretation I > 0 positive spatial autocorrelation I < 0 negative spatial autocorrelation The more positive or more negative, the greater amount of spatial autocorrelation 35 2nd Approach Local calculations Identify the extent and locations of clustering Answer where do we have spatial clustering Process every feature within the context of its neighboring features in order to determine whether it represents a spatial outlier, or if part of a statistically significant spatial cluster 36 37 38 Locate the hot spots This is a local question that requires a hot spot analysis (Getis-Ord Gi*) tool Indicates the extent to which each feature is surrounded by similarly high or low values Where do features with similar attribute values cluster spatially together 39 Getis-Ord Gi* tool Identifies where clustering occurs in both high and low values Calculates a Z score for each feature High Z = hot spot (when a feature has a high value and it is surrounded by other features with high values) Low Z = cold spot (when we have features with low values surrounded by other features with low values) 40 Notes on the Z score Z score is a measure of standard deviation It is a reference value that’s associated with a standard normal distribution A very high or low Z score would be found in the tails 41 More notes on Z score A very high or low Z score means that the pattern deviates significantly from a hypothetical random pattern For example, when using a 95% CI, Z scores are -1.96 and +1.96 If Z is between these -1.96 and +1.96 you can’t reject the null You are seeing one version of a random pattern If very high or low (ie -2.5 or +5.4) you have a pattern that’s too unusual to be a pattern of random choice so we reject the null hypoth REMEMBER: The null hypothesis is that features are evenly distributed across the study area 42 Reference Mitchell, A. 2005. ESRI Guide to GIS Analysis, Volume 2. ESRI press, Redlands, CA. ArcGIS 10 Help 43