Sparse Versus Dense Spatial Data R.L. (Bob) Nielsen Professor of Agronomy Purdue University West Lafayette, IN 47907-1150 Email: rnielsen@purdue.edu Spatial data & GIS Spatial data are the fundamental components of agricultural GIS. Growers hope to minimize or manage spatial yield variability in order to increase or maximize profitability. The causes of yield variability must therefore be determined, which requires the acquisition of additional spatial data sets or ‘layers’ of information. Spatial data sets can be ... Dense Many data points per acre e.g., grain yield data sets often consist of 300 to 600 data points per acre Sparse Fewer data points per acre e.g., typical grid soil sampling results in an average of 0.4 data point per acre GIS software … Interpolates or fills in the spatial 'holes' in the data to create pretty color maps that mysteriously become the essence of truth for believers. Dense data sets have fewer 'holes' per acre than do sparse Thus, less interpolation is required Thus, the resulting map is intuitively more believable Yield data are dense … One sec. readings at 3 mph equal to 1 data point every 4.4 ft 600 data points per acre with a 6-row combine header Yield maps are believable … Data Map Very little interpolation required to create yield map. Soil sample data are sparse Typical 2.5 acre sampling grid Only 0.4 point per acre Organic matter surface map Interpolated from o.m. values of 2.5 acre soil sample data Reality check Soil surface color from reclassified aerial IR Mediocre correlation Soil o.m. surface map interpolated from 2.5acre samples Half-acre soil sampling More intense sampling Five times as many data points as before Still sparse relative to aerial imagery Reality check Soil surface color from reclassified aerial IR Improved correlation Soil o.m. surface map interpolated from halfacre samples Consequence of sparse sampling Poor interpolation of spatial variability 2.5 ac soil O.M. map Aerial image, reclassified half-ac soil O.M. map The challenge … In order to interpret yield maps wisely, you will need far more data layers than just soil nutrient levels and soil types. Many factors influence yield! Acquiring these data will require forethought, time, timeliness, attention to detail, and (of course) money! The good news Some of the additional data sets you will acquire will be dense and, therefore, satisfactory for creating spatial maps Topography Soil EC Aerial photography Satellite imagery The bad news Some of the additional data sets you will acquire will be sparse data sets, the maps from which must be taken with the proverbial ‘grain of salt’. Soil nutrients Plant populations Stand uniformity Plant height Insect pressure Disease pressure Weed pressure Soil compaction Bottom Line: Data collected by field scouting, including soil nutrient sampling, are often too sparse for GIS programs to accurately interpolate spatial relationships Yet, more intensive data collection is often cost- and time-prohibitive Example: Plant Counts in Late Planted Soybean Approx. 10 plant population checks per acre on a fairly equal grid basis 292 total data points on 30 acres Cost: Three hikers, two GPS units, one day Directed sampling Added another 80 population checks on the fly as our eyeballs dictated 372 data points Cost: Included in first day’s work Revisited field, second day GIS map did not agree completely with our eyeballs, so revisited field Added another 54 population checks Total of 426 data points on 30 ac. Cost: Three hikers, one GPS unit, one day Soy population map Based on original grid samples (10 per acre) > 200k 150 to 200k 100 to 150k 50 to 100k < 50k Did add’nl sampling help? Original data Minor, but potentially useful improvements Original data plus directed samples on the fly Including revisit Reality check Green vegetation index (NDVI) from IR aerial image (8 July) Not perfect, but acceptable Our map of populations (17 June) Recommendations Sample as densely as time and money will allow. From the perspective of crop scouting or monitoring, you can never have too much data! Remember, you rarely have a visual idea of what the true spatial pattern is! So, sometimes directed sampling is not feasible. Recommendations Sample in as much of an equidistant pattern as is logistically possible. Better for GIS software, easier on the person in the field. Begin with a grid pattern, modify with additional directed sampling as suggested by other data layers or your own eyes. Thanks for your attention! Farming is a gamble, so let’s practice …. Pick a card and concentrate on it! I will make your card disappear! Did you concentrate hard? I believe your card is missing!