Document 11863919

advertisement
This file was created by scanning the printed publication.
Errors identified by the software have been corrected;
however, some errors may remain.
Explicit Consideration of Multiple
Landscape Scales While Selecting
Spatial Resolutions
-
John B. Collins and Curtis E. Woodcock
Abstract. - The information content of a digital image - defined as
the spatial variation of the image data - depends on data resolution.
Spatial data may vary at a number of different scales simultaneously,
so the choice of an appropriate resolution depends on the information
content at the landscape scale relevant to a particular application. A
useful framework for studying variation at different scales in digital
images is provided by a nested hierarchical model of the landscape.
The landscape is modeled as being composed of spatial units of a typical size, which are themselves composed of smaller units, and so on.
The overall variation of the image data can be decomposed into variation due to effects of different levels of such a hierarchy. The hierarchical decomposition of the variation in a digital image is reflected by a
similar hierarchical decomposition of the scene semivariance. Decomposed semivariograms are used to assess the relationships between data
resolution and information content at particular landscape scales.
INTRODUCTION
The issue of scale is of fundamental importance to disciplines concerned
with phenomena which vary spatially. Yet the relationship between the scale at
which a phenomenon is observed and that phenomenon's characteristic scale of
variation is not completely understood. In this discussion, scale of observation
is defined as the spatial interval over which observations are made. When data
are collected by remote sensing devices, this term corresponds to sensor resolution, or size of the Instantaneous Field of View (IFOV). The characteristic
scale of a phenomenon is the range of resolutions over which that phenomenon
may be characterized. These definitions correspond to those proposed by Quattrochi ( 1993).
Research Assistant, Boston Liniversiry Department of Geography. Boston. MA,
Associm Professor of Geography. Boston Universily, Boston, MA.
In remote sensing investigations, resolution is an important consideration.
The information contained in an image, which can be identified as the overall
variance of the image data, declines as resolution becomes more coarse. The
simplest view of the issue is one which regards the selection of an appropriate
resolution as a compromise between greater information content (fine resolu-.
tions) and more manageable data volume (coarse resolutions). But for some
applications, greater spatial variation is not always desireable. For example,
classification of remotely sensed imagery is most successful when pixel size is
large enough that some amount of averaging of within-class variation occurs
(Markham and Townshend, 1981).
The fact that variation of spatial data occurs at multiple scales has an
explanation in the field of landscape ecology. It is becoming an accepted tenet
of that field that landscapes are often arranged hierarchically (O'Neill et al.,
1986). That is, landscapes are often composed of objects which are internally
homogeneous and externally variable with respect to spatially varying
processes. These objects may be composed of smaller objects, and so on. For
example, a landscape may be composed of a number of watersheds, each containing a collection of vegetation stands, which each contain individual plants.
Associated with each level of such a hierarchy is a set of processes acting at
that level. The range of sizes of objects on a given level approximately defines
the spatial frequencies at which those processes vary - i.e. their characteristic
scales. So multiple scales of variation of spatial data arise from a scene's
hierarchical structure.
The choice of an appropriate resolution for a particular investigation
depends on prior knowledge of the characteristic scale of variation of the
phenomenon of interest. This paper presents methods to help make such a
choice under explicit consideration of the hierarchical nature of landscapes. A
review of the use of hierarchical partitionings of spatial data is presented, followed by an explanation of geostatistical methods for investigating resolutiondependent characteristics of digital images.
SPATIAL HIERARCHIES
A number of studies use a hierarchical model to identify multiple
landscape scales. A common paradigm involves partitioning a scene into
objects of regular size and shape - usually squares or rectangles - each of
which are characterized by some aggregation of the underlying spatially distributed variable (Grieg-Smith, 1963; Townshend and Justice, 1995). Then there
is a clear and explicit relationship between any level of the hierarchy and a
scale of observation. By examining some attribute of each level of the hierarchy - especially the data variance - multiple scales of variation may be
revealed.
An alternate approach is the use of hierarchies of irregular-shaped objects,
such as political boundaries (Moellering and Tobler, 1972). The characteristic
scales associated with any level are identified with the range of sizes of objects
on that level. However, the hierarchies so constructed can be rather arbitrary.
Woodcock et al. (1994) describe a forest mapping project which makes use of a
hierarchical model dividing the landscape into ecologically meaningful units.
(see also Woodcock and Harward, 1992; Collins et al., 1995). Under this
model, the entire image is considered to be composed of a number of vegetation classes such as conifer, hardwood, and brush. Each of these can be broken down into smaller homogeneous patches, referred to as regions. Within
the conifer class, for example, a region corresponds to a forest stand. Regions,
in turn, are collections of individual plants. Such a hierarchy can be associated
with ecosystem processes, which tend to be more homogeneous within units
and heterogeneous between units at any level.
The partitioning of a scene into ecologically meaningful units prior to
analysis constitutes explicit identification of the characteristic scales under
study. Knowledge of these characteristic scales is important for assessing the
relationship between resolution and information content for a given application.
Statistical Analysis of Hierarchal Data
A general and mathematically precise definition of a hierarchical scene
model is needed. Consider a regionalized variable Z which depends on vector
location s within some domain set D!o). When discussing analysis of digital
images, D(o) is a subset of two-dimensional space R ~ and
, Z could be radiance
in some wavelength band or any other spatially-varying quantity. The hierarchical model consists of partitioning the domain set D(o) into a family of objects
(sets) {D(*)},which form level 1 of a hierarchy. Each of the level-1 objects
may be divided into a number of level-2 objects, which can collectively be
referred to as
Further partitions proceed in an obvious manner. If level
n is the highest level of the hierarchy, then the family of domain sets {D(,)}
is
the collection of infitesimal points on which Z is defined. Any point s E D(o)
will be contained in exactly one set associated with each level of the hierarchy.
Such a set of level k is D ( k ) ( ~ )This
.
terminology is illustrated by Figure 1.
Figure 1. A hierarchical partitioning of a scene. A. The set of level-1 objects
forming a partition of D(0). B. The level-2 objects {D(2)}further partitioning the level-1
objects.
Determination of the relative contributions of different levels of a hierarchy to overall variation in a data set is the goal of Hierarchical Analysis of
Variance (Moellering and Tobler, 1972). Under the Hierarchical ANOVA
model, the observed value at a point in the scene is considered equal to the
overall mean for the area, plus a deviation, or effect, characteristic of each level
o_f the big-archy. The effect of level k of a hierarchy at point s is defined as
D(k)(s)-D(k-l)(~),
where D(k)(s)is the mean value of Z over the level-k object
containing s . Note that each effect constitutes a regionalized variable in its
own right. Further elaboration of the Hierarchical ANOVA model can be
found in standard texts on the subject (e.g. Dunn and Clark, 1974). The model
leads to the following important result.
where o * ~represents the overall variance of Z , and 02(i)represents the variance of the effect of level i . The overall data variance decomposes into the
sum of variances of each effect. Unfortunately, the ability to make statistical
inferences from a hierarchical partitioning of variance relies on the assumption
of independence of observations. Since this assumption is badly violated for
most spatial data sets, the technique is suited only for descriptive purposes.
When the regionalized variable Z exhibits appropriate stationarity characteristics, its variation is described by the scene semivariogram (Isaaks and
Srivastava, 1989; Cressie, 1993).
where h is a vector lag. Similarly, relationships between two regionalized variables Y and Z are described by the cross-semivariogram.
The parameters describing the scene semivariogram - its range, sill, and
derivative at the origin - can be shown to be related to quantifiable attributes
of a scene (Woodcock et al. 1988a+b).
Since the semivariogram provides a better characterization of the variation
of a regionalized variable than does the variance, it is useful to ask whether
there is a geostatistical analogy to Eq. 1. Collins et al. (1995) present an
example to show that under the Hierarchical ANOVA model, the scene
semivariogram decomposes as
where yZ represents the overall scene semivariogram, y(, represents the
semivariogram associated with the effect of level i of the hierarchy, and yG)(k)
is the cross-semivariogram between the effects of levels j and k. It is interesting to note that when overall scene variance is considered (Eq. 1) all crossproduct terms vanish, but that this is not the case for spatial variance. The
cross-semivariograms in (4) arise as the result of fine-scale continuity across
boundaries of larger-scale objects (Collins and Woodcock, manuscript in
preparation). They generally have small magnitudes and relatively short
ranges, and approach zero at large lags. The relevance of Eq. 4 to the present
study is its indication that semivariograms associated with particular levels of a
hierarchy - i.e. with different characteristic scales in the landscape - may be
discussed individually.
EFFECT OF RESOLUTION ON SPATIAL STRUCTURE
The scale-specific semivariograms included in Eq. 4 may be used to
retrieve quantitative information about scene variation at arbitrary resolutions.
In geostatistical parlance, sensor resolution - the size and shape of the IFOV is called the support of the measurements. There exists a well-developed
theory of the effect of changes of support on regionalized variables (Rendu,
1978; Zhang et al., 1990), and its applications in the context of remote sensing
(Jupp et al., 1988+1989; Atkinson and Curran, 1995). This section discusses
some of the ways in which information relating to scale of observation can be
derived from semivariograms. Notationally, subscripts indicative of levels in a
hierarchy are dropped, as the ideas apply to any regionalized variable with
finite variance. The relevance to multiple scales of variation comes from the
indication of Eq. 4 that these methods can be applied to scale-specific
semivariograms.
Regularization and Deregularization
As was noted above, the effect of a level in a hierarchy is a regionalized
variable, and is defined at all points in a scene. However, point-scale observations are never available from a remote sensing device, since there is always
some averaging of scene radiance over the sensor IFOV. That is, the data and
the semivariograms derived from it are regularized. It is desireable to derive
the point-scale, or punctual semivariograms from the regularized ones.
In the theoretical development which follows, the support of observations
is modeled as a set in the domain space D(o) over which the continuouslyvarying values of some effect are averaged. A useful quantity is the average
punctual semivariance between two sets in the scene, denoted A and B .
"XA ,B ) =
1
J J
Mes(A )Mes(B ) A
y(z-z') dz' dz
Mes(*) denotes the Lebesque measure of a set (i.e. its area in this context).
This definition is particularly useful when the sets A and B are shifts of a sensor IFOV. Specifically, let w denote some support, and let w h denote its translation by distance h . The regularized semivariogram at a lag of h can be
found from the punctual semivariogram by (Rendu, 1978)
yw(h) =y(w;wh) -y(w,w)
(6)
This relationship cannot be inverted to find the punctual semivariogram from
its regularized counterpart. But Atkinson and Curran (1995) present a numerical technique based on this relationship which may be used to estimate it.
The effects of regularization are most pronounced on higher levels of a
scene hierarchy, where the characteristic scales of variation are the smallest.
When scales of variability are much smaller than the support, 'microstructures'
in the data are undetected and spatial variation is lost. This phenomenon plus
the effect of measurement error can possibly lead to an observed 'nugget
effect' or a non-zero intercept of a regularized semivariogram. At scales larger
than that at which data are collected, the nugget effect shows no spatial structure. This leads to a simple relationship between the nugget effect observed
via support w (Nw) and that observed at a larger support W (Nw):
The nugget effect behaves differently under changes of support than does
the spatially correlated component of variation. Thus it is useful to express the
regularized semivariogram as (Zhang et al., 1990):
where j, is a regularized zero-nugget semivariogram, from which a punctual
zero-nugget semivariogram y* may be estimated. Then using 6, 7, and 8 the
semivariogram for any support W can be expressed as
Since parameters describing semivariograms relate to properties of the scene
under study, the ability to determine semivariograms for arbitrary supports is
useful. When used with a hierarchical decomposition of a scene
semivariogram, it can yield useful information concerning the relationship
between image resolution and spatial variation at different characteristic scales.
Determination of Variance at Arbitrary Resolutions
As was mentioned above, the overall scene variance is perhaps the best
measure of its information content. As a rule, the variance decreases as the
size of support increases. But the precise rate of decrease depends on the autocorrelation structure of the image, as revealed by the semivariogram. If autocorrelation between points is strong, then regularization will have little effect
since averaging over the IFOV will tend to occur between points with similar
values.
This assertion is justified mathematically. Let D(o) represent an area being
studied. If this area has been sampled on supports w , then the variance of supports W within L) (0) is (Zhang et al., 1990)
r
VW,D(,)) =
-I
The first term accounts for the nugget effect among the supports W , and the
last two terms deal with the spatial1y-correlated component of variation. Eq.
10 gives the variance for arbitrary resolutions, and so is useful for a number of
purposes.
CONCLUSION
Methods to determine the effect of sensor resolution on different
landscape scales have been presented. Characteristic scales of variation can be
identified by partitioning a scene hierarchically into ecologically meaningful
units. In order not to restrict the discussion to a particular classification of
scene objects, a general hierarchical model has been used. Use of different
landscape hierarchies may be called for in different situations, but the ideas discussed here continue to apply.
Semivariograms may be calculated characterizing the spatial variation at
each landscape scale, and may be used to derive information about scene structure at arbitrary resolutions. Throughout the discussion above, the assumption
has been made that the data are second-order stationary at all scales, a requirement for the calculation of semivariograms. This assumption needs to be
justified in order for the analysis of semivariance to be appropriate. If this condition is met, the relationships between resolution and semivariance presented
here are general with respect to directional qualities of variation. All results
apply equally for isotropic and non-isotropic semivariograms.
REFERENCES
Atkinson, P. M. and Curran, P. J. (1995). Defining an optimal size of support
for remote sensing investigations, IEEE Transactions on Geoscience and
Remote Sensing, 33(3):768-776.
Collins, J. B., Woodcock, C. E. and Jupp, D. L. B. (1995). Spatial dependence
and nested hierarchical scene models, 1995 ACSM/ASPRS Annual
Convention and Exposition Technical Papers:
535-544.
Cressie, N. A. C. (1993). Statistics for Spatial Data, John Wiley and Sons,
Inc., New York.
Dunn, 0. J. and Clark, V. A. (1974). Applied Statistics: Analysis of Variance
and Regression, John Wiley and Sons, Inc., New York.
Hobbs, R. J. and Mooney, H. A., Eds. (1990). Remote Sensing of Biosphere
Functioning, Springer-Verlag, New York.
Isaaks, E. H. and Srivastava, R. M. (1989). An Introduction to Applied
Geostatistics, Oxford University Press, Oxford.
Jupp, D. L. B., Strahler, A. H., and Woodcock, C. E. (1988). Autocorrelation
and regularization in digital images I: basic theory, IEEE Transactions on
Geoscience and Remote Sensing, 26(4):463-473.
Jupp, D. L. B., Strahler, A. H., and Woodcock, C. E. (1989). Autocorrelation
and regularization in digital images 11: simple image models, IEEE
Transactions on Geoscience and Remote Sensing, 27(3):247-258.
Markham, B. L., and Townshend, J. R. G. (1981). Land cover classification
accuracy as a function of sensor spatial resolution, Proceedings of the 15th
Int. Symp. on Remote Sensing of Environment, Ann Arbor, Mi.: 10751090.
Matheron, G. (1963). Principles of geostatistics, Economic Geology, 58: 12461266.
Mollering, H. and W. Tobler (1972). Geographical variances, Geographical
Analysis, 4 (1):35-50.
O'Niell, R. V., DeAngelis, D. L., Waide, J. B., and Allen, T. F. H. (1986). A
Hierarchical Concept of Ecosystems, Princeton University Press,
Princeton, New Jersey.
Quattrochi, D. A. (1993). The need for a lexicon of scale terms in integrating
remote sensing data with geographic information systems, Journal of
Geography, 92(3):206-212.
Rendu, J. M. (1978). An Introduction to Geostatistical Methods of Mineral
Evaluation, South African Institute of Mining and Metallurgy,
Johannesburg.
Townshend, J. R. G. and Justice, C. 0. (1995). Spatial variability of images
and the monitoring of changes in the normalized difference vegetation
index, International Journal of Remote Sensing, 16(12):2187-2195.
Woodcock, C.E., Collins, J., Gopal, S., Jakabhazy, V., Li, X., Macomber, S.,
Ryherd, S., Wu, Y., Harward, V. J., Levitan, J., and Warbington, R.
(1994). Mapping Forest Vegetation Using Landsat TM Imagery and a
Canopy Reflectance Model, Remote Sensing of Environment, 50:240-254.
Woodcock, J. E. and J. Harward (1992). Nested-Hierarchical Scene Models
and Image Segmentation, International Journal of Remote Sensing,
l3(16):3 167-3187.
Woodcock, C. E., Strahler, A. H., and Jupp, D. L. B. (1988). The use of
variograms in remote sensing: I. scene models and simulated images,
Remote Sensing of Environment, 25:323-348.
Woodcock, C. E., Strahler, A. H., and Jupp, D. L. B. (1988). The use of
variograms in remote sensing: 11. real digital images, Remote Sensing of
Environment, 25 :349-379.
Zhang, R., Warrick, A. W., and Meyers, D. E. (1990). Variance as a function
of sample support size, Mathematical Geology 22(1): 107-121.
Download