TREE CROWN DELINEATION FROM DIGITAL ELEVATION MODELS AND HIGH RESOLUTION IMAGERY Christopher Meia , Sylvie Durrieub a INRIA, ICARE Project Team, 2004 Route des Lucioles - BP 93, 06902 Sophia Antipolis Cedex, France – christopher.mei@sophia.inria.fr b UMR3S, Maison de la Télédection, 500 rue Jean-François Breton, 34093 Montpellier Cedex 5, France – sylvie.durrieu@teledetection.fr KEY WORDS : forestry, high resolution multi-spectral imagery, infrared imagery, DEM/DTM, watershed algorithm, LIDAR, feature detection. ABSTRACT : The aim of this study was to develop a methodology to delineate accurately tree crowns using digital elevation models independently of source (laser scanned data or stereoscopic pairs) and to identify the complementary information that can be obtained from high resolution imagery. This methodology uses the Watershed algorithm as the baseline. However the direct application of this algorithm generates over-segmentation for complex tree-crowns and non-wooded areas. To solve these problems a preprocessing stage consisting of a mixture modeling adaptive thresholding technique was developed to remove non-wooded areas. Results obtained on different plots of land are discussed putting forward the influential factors. 1. INTRODUCTION There is an increasing need for accurate and cost effective forest resource information for operational and strategic applications (ecological sustainability, forest exploitation, rural development). The combination of high resolution digital elevation models, obtained for example from laser scanned data, and multispectral images appears particularly promising as a source of forest survey tools (Leckie et al., 2003). Research in the area of high resolution imagery (HRI) has led to the development of several methods for tree crown delineation. Tree tops often appear as maxima in intensity in satellite pictures and cast shadows around the tree crowns. In a 3D model where height corresponds to spectral intensity, tree crowns can be seen as peaks and shadows as valleys. Gougeon (Gougeon, 1998) developed a tree crown delineation method using these properties with a valley following approach. A similar technique was used by (Warner et al., 1998) but this time using texture to allow the detection of trees with a lower intensity. The results are generally good in high to medium density conifer forests. Preprocessing to remove non-vegetated areas is necessary for lower density areas. Methods that use region approaches have also been studied (Brandtberg & Walter, 1998; Culvenor et al., 1998). The zero-crossings of the second derivative (Laplace operator) can be used to detect parts of the tree crowns. A region growing algorithm can then be used with the tree tops (maxima spectral response) as seeds and light intensity as growing function right up to the pre-calculated boundaries. These methods are more robust for detecting trees which are partly in shadow. However preprocessing is also needed to avoid detecting light patches in lower density areas. Pollock (Pollock, 1998) uses model matching, training and spatial information to locate tree crowns. The advantage of this type of approach is that it avoids improbable regions. However in Pollock’s method, the training is performed by an operator. This could be a disadvantage in some circumstances. Voronoı̈ diagrams with a fuzzy approach have also been tested (Dubé et al., 1998). The use of a probabilistic model is also an efficient way to avoid un- likely regions. Research in the field of high resolution radar sensors is recent. At first these were used as an efficient way to generate digital terrain models (DTM) for orthorectification of digital satellite imagery. Later they were used to study tree heights (Andersen et al., 2001) and to identify trees by using the pulse response of the vegetation (Pyysalo & Hyyppä, 2001). Numerous studies have confirmed the importance of small footprint LiDAR data for forest inventories. Stand characteristics, like tree heights, basal area and stand volume can be accurately estimated by using laser scanning (Maltamo et al. (2004); Naesset & Bjerknes (2001); Means et al. (2000)). Some countries are considering in a close future the use of LiDAR technologies to perform their forest inventories (Nilsson et al., 2003; Wulder, 2003). Digital elevation models (DEM) obtained by LiDAR sensors or stereoscopic views have more rarely been used for tree delineation. The chief advantage of DEMs is that they are not affected by non-uniform light intensity. This means that they can be used to detect trees that would otherwise be in shadow. Obtaining tree heights is also an interesting characteristic. However DEMs are often less precise than HRI and trees that are very close cannot be extracted. The aim of the work described in this article is to use the complementary information from both a DEM and HRI of the same region to obtain a robust tree crown delineation algorithm. Although the DEMs employed in this study have been obtained from LiDAR data and consequently DTMs can also be extracted, it has been decided that the approach should be DTM independent in order to be able to use DEMs obtained from stereoscopic views. 2. STUDY SITE AND DATA The study site is a Mediterranean forest located near Montpellier city, in the south of France. In order to identify the limits of the application of the method according to the stands characteristics, six plots dissimilar in dominant species, diameter-class distribution and tree spatial distribution were selected : a closed and an open mature - 218 - International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol. XXXVI - 8/W2 Aleppo pine (Pinus halepensis) stand, an umbrella pine (Pinus pinea) young plantation, a mature poplar plantation and an olive orchard. The data used in this study consists of : • 1 m resolution digital elevation and terrain models (DEM and DTM) produced by an engineering firm, GeoLas Consulting, from LiDAR data with a 15 cm horizontal precision and a 0.5 m vertical precision. These data were acquired on 26th June 2002 with a small-footprint Toposys laser scanning system. DEM HRI • orthorectified multispectral images (blue, green, red and Infra Red channels) with a 0.5 m resolution, acquired with a digital line camera that was operated in parallel with the LiDAR instrument. All the data are georeferenced to the RGF 93 coordinate system. Pre−processed DEM Watershed A ground truth database was available for the two Aleppo pine stands and the Umbrella pine plantation. It includes, for each plot, 1) individual tree location recorded in the RGF 93 coordinate system and 2) dendrographic measurements (mean or both south-north and east-west crown diameters; total height, stem height, stem diameter at 1,30 m above ground). For the poplar plantation and the olive orchard, trees were easily identifiable on the IRC images and were manually delineated. The resulting vector layers were considered as the ground reference. 3. DTM Pre−processing Extraction of vegetated areas Mask Regions HRI Region Improvement METHOD Tree crowns 3.1 The watershed algorithm DEMs can be represented as grayscale images with the pixel values representing altitude. By inverting the images, trees appear as catchment basins. These provide the right configuration for the application of the Watershed algorithm (Vincent & Soille, 1991). Figure 1: Methodology for tree crown delineation The results obtained by the direct application of the algorithm on a DEM led to 1) over-segmentation of most of tree crowns and 2) imprecise borders obtained in low tree density areas To overcome this problems a specific methodology was developed (Fig. 1). 3.2 Pre-processing the DEM The over-segmentation of regions is not due to the classic problem of the nature of the image (which would have led to the application of a distance transform, a geodesic erosion or a marker-controlled segmentation). In this figure the over-segmentation is due to the complex tree crowns which do not conform to the basin assumption. Preprocessing with the application of a median or averaging filter with a mask approaching the actual size of the trees in the image can improve the results, the disadvantage being that some regions are then under-segmented. 3.3 Extracting wooded areas The second problem to be solved is the imprecise borders obtained for the tree crowns in low density areas. In this case, the Watershed borders do not correspond to tree crowns (Fig. 2). Separating terrain and non terrain areas (including trees) would solve this problem. Where a DTM is available, vegetated area extraction is possible by thresholding the image difference DEM − DT M . This is not possible with the DEM on non flat areas because the altitude of the terrain True tree crown border Tree crown border found by the algorithm True tree crown border Tree crown border found by the algorithm Figure 2: Position of the watershed borders for high density wooded areas (left) and for low density wooded areas (right) - 219 - International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol. XXXVI - 8/W2 is irregular. • (µ1 − µ2 )2 A method has been developed to extract wooded areas from DEMs (it was again decided to use DEMs instead of HRIs to avoid the problem of shadowed trees). The histogram of a DEM on a small vegetated region frequently contains two classes : the first corresponds to ground pixels, the second to tree tops. Spliting the DEM into squares and estimating a threshold elevation value for each square makes it possible to separate the trees from the ground locally (Fig. 3). • σ 1 , σ2 DEM Image of thresholds Gaussian blurring Thresholded image Incorrectly thresholded regions have been identified by comparing their variables to the Gaussian distribution generated by the variables of all the thresholded regions. This approach can be seen as a way of learning the global distribution of trees over a given region. A region can be incorrectly thresholded either because it is homogeneous (only trees or only ground) or because the trees are different to those in the rest of the DEM. By increasing the size of the square used for the region, it may be possible to make the region ressemble the surrounding regions more closely and find a satisfactory threshold (Fig. 5). Threshold Figure 3: Adaptative thresholding for wooded areas extraction This procedure is simplistic because some squares may not contain two classes (both ground and trees). It is essential to take into consideration the size of squares required to obtain a robust separation of these two regions. These difficulties have motivated the use of a Gaussian gray-level histogram modelling algorithm where both classes were modelled by a Gaussian that minimized the approximation error (Fig. 4). The threshold is at the intersection of both Gaussian functions. The functions’ characteristics can then be used to identify the regions that have been incorrectly thresholded. Figure 5: Result of the adaptive thresholding algorithm before correction (left) and after correction (right) 3.4 After masking by applying the adaptative threshold, the watershed algorithm is applied. Regions obtained in that way may contain buildings. These non vegetated regions can easily be identified using the normalized difference vegetation index (NDVI) on the HRI. The higher precision of the HRI can also be used to improve borders of the identified tree crowns. 3.5 Max1 Analysis Max2 Thresholding 0 mu1 mu2 Threshold 255 Altitude Ground and tree heights are modeled by two Gaussian functions Figure 4: Mixture modelling applied to the histogram The variables used to identify the incorrectly thresholded regions must be independent of altitude (you can not just compare the thresholds). The following variables were used taking this into consideration : • • max1 max2 , number of pixels number of pixels per square per square (max1 − max2 )2 number of pixels per square Automatic tree delineation Validation The results of the automatic terrain/non-terrain classifications are compared to a reference classification by analysing the contingency matrices. The reference classification was obtained by thresholding the Canopy Height Model (CHM = DEM − DT M ). The threshold was set manually so as to best suit the wooded areas visible on the IRC image. Evolution of the classification accuracy with the size of the square that was chosen to split the DEM is analysed for the different stands and related to the stand characteristics. The results were analysed before the iterative improvement step. The theoretical optimal size of the square used on the DEM image in the terrain/non-terrain separation method can be estimated for a regular stand. The window must contain at least one pixel of each class (terrain/non-terrain) whatever its position. The optimal size of the window is then a function of the tree diameters, distance between trees and direction of the plantation. It varies quite a √ lot according to the stand characteristics, from d/2 to d 2 for stands composed of trees at a distance d with perfectly round crowns. For real stands we assumed that the mean distance between trees for open stands, or between gaps for closed stand, could give a rough approximation of the optimal square size. This mean distance was estimated - 220 - International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol. XXXVI - 8/W2 from the density of trees (respectively gaps). The results of the automatic tree delineation (called segmentation in what follows) are compared to ground reference delineation of trees. The delineation used as the reference were obtained either by photo-interpretation of the IRC images for uniform stands with well individualised trees or using a ground truth data base for the other stands. A statistical analysis is first performed. The number and the size of automatically delineated trees are analysed for each selected plots and compared to the ones of the reference delineation. A more precise analysis, which can be called ”the spatial analysis of the segmentation”, is performed by comparing the segment boundaries resulting from the automated recognition method to the ground reference. The layers are compared in a vector format. Seven types of overlaps are defined for segmentation accuracy assessment. A tree is well identified when there is a one-to-one correspondence (only one segment associated with one ground delineation and vice versa), with a tolerance of a two pixels gap between both boundaries and with an overlap area greater than 80 % of the delineated tree area. The other classes are : over-segmented trees (more than one segment associated with one ground delineation) ; under-segmented trees (a segment includes significant part (>10 %) of more than one tree); trees that are both ”over-segmented” and ”under-segmented” that is to say that among the several segments associated with the ground delineation at least one is common to two or more trees; trees that are well identified but with an overlap area <80 % of the delineated tree area ; omitted trees ; and finally segments not associated with a tree (commission errors). 4. 4.1 RESULTS Extraction of wooded land For five stands (olive orchard, poplar, dense pine stand, sparse pine stand, young pine plantation) the evolution of the classification accuracy with respect to the initial square size for the extraction algorithm is analysed. For olive orchard, a large range of values was studied, from 1 to 40 meters with a one meter step. For other stands the studied size are spaced between 5 and 10 m. The maximum classification rate and the corresponding window size are related in table 1. The classification rate corresponding to the square size which is the closest to the mean distance between gaps or trees is also reported in table 1. The maximum classification rates are high for all stands (from 87.9 % to 98.7 %). For most of the stands the classification rates obtained with a square size close to the mean distance between trees or gaps stay high except for poplar and young pine plantations. The sharp decrease of the classification accuracy rate (>10 %) for these stands can be explained by the non homogeneous spatial distribution of the gaps (all around the plot for example for the pine plantation). In that case mean distance is no more a good indicator of the optimal window size. These results show the good potential of the proposed method to separate terrain and non terrain. The adaptive step decreases the sensitivity to the initial size square as expected. 4.2 Tree segmentation The number of trees of the ground reference data and the automatically delineated per radius classes are shown in Fig. 8. The results of the spatial comparison between the automated recognition method and the ground reference segmentation are presented in table 2. This spatial analysis was only realised for two plots. For heterogeneous stands (closed and open pine stands) the reference vector layers obtained from the ground truth data base did not fit enough the IRC images to be usable (tree crowns approximated with circles, shifts in tree locations). The best results are obtained for the poplar plantation. Only a slight over-segmentation is noticed (+6 % of the trees). The mean radius of segments is greater than the tree radius (+41 %) and this over estimation is observed for all the radius classes (see the shift in the histograms on fig $). However 80 % of the trees are well identified which indicates that the over-estimation of the radius stays in the limits set for an accurate segmentation. Taking into account the class of ”well identified trees with a problem of area” the number of well identified trees reaches 90 %. For the olive orchard, an homogeneous stand with 71 % of the trees in the same radius class, the results are not as good. We can notice an over estimation of the number of trees obtained by automatic segmentation for all the classes of radius except for the [2 m − 3 m[ radius class that contains most of the trees. Trees were either over segmented or under segmented. This is finally expressed by a number of segments greater than the real number of trees (+26 %) and by a mean of segment radius approximately the same as the real tree one (0.27 pixels). The well identified trees represents only 52.6 % of the trees (76.3 % if well identified trees with a problem of area are included). This result can be explained by the shape of olive trees. These fruit trees are pruned in their centre in order to let the light get into the middle of the trees. The crown shape has no more the appearance of a catchment basin on the inverted DEM but is closer to a torus which explains the over-segmentation and misplacement. For the two last stands analysed, the open pine stand and the closed pine stand, most of the trees have a small diameter (class radius [ 0 - 1 [ m) and could not be identified on the 1 m precision DEM. This led to the number of trees being sharply under estimated for both stands (- 43.6 % for open stand and -70 % for the closed stand). 5. DISCUSSION The terrain/non-terrain classification method gave very encouraging results for all the studied stands. Results concerning tree segmentation were much more ”stand-dependant”. The best results were obtained for the poplar plantation with high and regularly spaced trees (90 % of well identified trees). For natural pine stands the segmentation gave poor results. This was not surprising because of the complex shapes of the crowns and the irregular spatial distribution of trees. For these stands achieving an individual tree segmentation with the available data is unlikely. However interesting information concerning tree density, spatial organisation could be derived from the automatic tree crown delineation. The spatial resolution of the used DEM (1 m) allowed us to identify trees with a radius equal to that resolution. Better results are expected to be an outcome of DEM improvement. Some promising tests to obtain a 0.5m DEM from the same LiDAR data have already been performed. More information could also be derived from the complementarity of the DEM and HRI. Future studies could be undertaken to improve the Water- - 221 - International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol. XXXVI - 8/W2 Stand Olive trees Poplar trees Dense pine Sparse pine Young pine Average distance between trees (or holes) in m 7.8 9.5 (holes) 13.3 (holes) 18.2, 15.2(holes) 10.2 (holes) Size of optimal square in m % of correctly classified pixels square closest to average distance % of correctly classified pixels 11 25 20 10 91.8 98.7 87.9 84 8 10 10 20 90.6 86 85.7 83.8 30 88.2 10 68.8 Table 6: Accuracy of classification Nb Trees correctly segmented Trees correctly segmented with a problem of area Trees over-segmented Trees under-segmented Trees under- and over-segmented Trees omitted Extra segments Total number of trees in the stand Correct (segmentation+area) 20 4 5 4 5 0 0 38 Olive trees %of total number of trees of the stand 52.6 10.5 13.2 10.5 13.2 0 0 Nb 128 15 Poplars %of total number of trees of the stand 80.5 9.4 2 13 1 0 18 159 63.2 1.3 8.2 0.6 0 11.3 89.9 Table 7: Comparison between automated recognition and ground reference Figure 8: Figures (a) to (d) show the numbers of trees per radius classes calculated for the ground reference data and for the results of automatic delineation - 222 - International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol. XXXVI - 8/W2 shed algorithm using level sets. It would be interesting to study automatic identification of optimal square size for mixture modeling thresholding. Higher level knowledge could lead to better results by identifying tree types. Extra information can easily be extracted from the regions (tree crown diameter, height, ...) and could be used by an identification algorithm (ex : neural network) to create maps of regions automatically. References Andersen, H.-E., Reutebuch, S. E. & Schreuder, G. F. (2001) Automated individual tree measurement through morphological analysis of a lidar-based canopy surface model. In 1st International Precision Forestry Symposium pp. 11–22, Seattle, Washington. University of Washington, College of Forest Resources. Brandtberg, T. & Walter, F. (1998) An algorithm for delineation of individual tree crowns in high spatial resolution aerial images using curved edge segments at multiple scales. In Automated Interpretation of High Spatial Resolution Digital Imagery for Forestry pp. 41–54 Natural Resources Canada, Canadian Forest Service, Victoria, B.C., Canada. Culvenor, D. S., Coops, N., Preston, R. & Tolhurst, K. G. (1998) A spatial clustering approach to automated tree crown delineation. In Automated Interpretation of High Spatial Resolution Digital Imagery for Forestry pp. 67– 80 Natural Resources Canada, Canadian Forest Service, Victoria, B.C., Canada. H. U. (2003) Laser scanning of forest resources. ScandLaser Scientific Workshop on Airborne Laser scanning of Forests., , 43–51. Pollock, R. (1998) Individual tree recognition based on a synthetic tree crown image model. In Automated Interpretation of High Spatial Resolution Digital Imagery for Forestry pp. 25–34 Natural Resources Canada, Canadian Forest Service, Victoria, B.C., Canada. Pyysalo, U. & Hyyppä, H. (2001) Reconstructing tree crowns from laser scanner data for feature extraction. In Automated Interpretation of High Spatial Resolution Digital Imagery for Forestry Institute of Photogrammetry and Remote Sensing, Helsinki University of Technology. Vincent, L. & Soille, P. (1991) Watersheds in digital spaces: an efficient algorithm based on immersion simulations. IEEE PAMI, 1991, 13 (6), 583–598. Warner, T. A., Lee, J. Y. & McGraw, J. B. (1998) Delineation and identification of individual trees in the eastern deciduous forest. In Automated Interpretation of High Spatial Resolution Digital Imagery for Forestry pp. 81–91 Natural Resources Canada, Canadian Forest Service, Victoria, B.C., Canada. Wulder, M. A. (2003) The current status of laser scanning of forests in canada and australia. ScandLaser Scientific Workshop on Airborne Laser scanning of Forests, , 21– 33. Dubé, P., Hay, G. J. & Marceau, D. J. (1998) Voronoı̈ diagrams, extended area stealing interpolation and tree crown recognition : a fuzzy approach. In Automated Interpretation of High Spatial Resolution Digital Imagery for Forestry pp. 115–125 Natural Resources Canada, Canadian Forest Service, Victoria, B.C., Canada. Gougeon, F. A. (1998) Automatic individual tree crown delineation using a valley-following algorithm and a rulebase system. In Automated Interpretation of High Spatial Resolution Digital Imagery for Forestry pp. 11–23 Natural Resources Canada, Canadian Forest Service, Victoria, B.C., Canada. Leckie, D., Gougeon, F., Hill, D., Quinn, R., Armstrong, L. & Shreenan, R. (2003) Combined high-density lidar and multispectral imagery for individual tree crown analysis. Canadian Journal of Remote Sensing, 29, 633–649. Maltamo, M., Eerikainen, K., Pitkanen, J., Hyyppä, J. & Vehmas, M. (2004) Estimation of timber volume and stem density based on scanning laser altimetry and expected tree size distribution functions. Remote Sensing of Environment, 90, 319–330. Means, J. E., Acker, S. A., Fitt, B. J., Renslow, M., Emerson, L. & Hendrix, C. J. (2000) Predicting forest stand characteristics with airborne scanning lidar. Photogrammetric Engineering and Remote Sensing, 66, 1367– 1371. Naesset, E. & Bjerknes, K.-O. (2001) Estimating tree heights and number of stems in young forest stands using airborne laser scanner data. Remote Sensing of Environment, 78, 328–340. Nilsson, M., Brandtberg, T., Hagner, O., Holmgren, J., Persson, A., Steinvall, O., Stemer, H. & Sderman, O. - 223 -