TOPICS COVERED SO FAR (as of 14 February 2011) NOTE THAT

advertisement
TOPICS COVERED SO FAR (as of 14 February 2011)
NOTE THAT THE EXAM WILL ALSO COVER GLOBAL CLUSTERING AND RIPLEY’S K-FUNCTIONS.
As per our discussions in class – this is a challenging exam and requires that you have completed
the readings online as well as the laboratory materials and kept up with my lectures and the inclass discussions.
GIS TOPICS
 Shapefiles and vectors vs gridded data and raster
 Discrete vs continuous
o Known locations with values or known cases occurrences
 Deterministic versus probabilistic techniques for deriving continuous surfaces
KDE VS IDW

Kernel density estimates a measure of CONCENTRATION (SMOOTHING)
o Kernel – Spatial filter (exclude beyond the filter)
o Bandwidth is the size of the filter we use
 Size; WHAT IS OPTIMAL?
 Fotheringham et al. hopt
 Has a tendency to “oversmooth”
 BASED ON STANDARD DISTANCE
EXAMPLE:
Counts of phenomenon in certain geography


Descriptive spatial statistics
o Mean Center – mean x, y of our point distribution
o Standard distance – Dispersion around the mean center
 THIS IS NOT A DIRECTIONAL MEASURE
o Standard deviation ellipse – this can be directional
o BOTH SD and SDEV are centered on the spatial mean
INTERPOLATION – estimating some value from an un-sampled location from a
known point distribution with measured values
o Inverse Distance Weighting (measures the phenomenon of distance
decay)
o 1/distance^power (default is sq but will see optimal routine in GEOSTAT
ANALYST)
o We calculate UNKNOWN LOCATIONS WITH THE GRIDDED SURFACE WE
ASSIGN
 Grid cell size, power value, number of neighbors (12…. User
assigned value)
o CROSS VALIDATION – hold out some known data from the model, train
the model with the rest, test the model with the hold out after the model
is built known as post hoc testing
 Training for model building
 Testing for model accuracy assessment or evaluation
o RMSE – measure of predicted value at a point versus the actual value at
that point; “lower is better measure”
o Kriging can leap from deterministic to probabilistic through the fitting of
a semi-variogram
 IDW IS NOT SENSITIVE TO DIRECTIONALITY
Example
Contour (these could be derived from the IDW gridded surface) of climate measure; disease
rate surface
Monte Carlo? WTH?









POINT DISTRIBUTION IS RANDOM OR CLUSTERED?
CSR – Complete spatial randomness –
IN DMAP CSR IS DERIVED FROM 999 iterations of point distributions from a random
reassignment of case data to locations
Fixed lattice over the landscape (this is an x, y distribution)
FILTER SIZE TO DETERMINE
LATTICE SIZE (determine how far apart)
Adaptive filter in DMAP IS DEMONINATOR DEPENDENT – literally the filter gets as big as
required to meet the minimum denom setting
Selective filtering – URBAN SETTING; RURAL SETTING (Classes)
Fixed filter – IN DMAP THESE ARE ONLY CIRCULAR
CONFINDENTIALITY
NUMERATOR RULE: any count less than 5 associated with demographic data cannot be
published
DENOMINATOR RULE IN FL: 20k or less has to be aggregated up to a large areal unit until 20k is
exceeded
SPATIAL MASKS OR RANDOM SHAPES
Download