A triangular form-based multiple flow algorithm to estimate overland flow distribution and accumulation on a digital elevation model Petter Pilesjö and Abdulghani Hasan GIS Centre, Department of Physical Geography and Ecosystems Sciences, Lund University Sölvegatan 12, SE-223 62 Lund, Sweden Running head: Triangular form-based multiple flow algorithm Corresponding Author: Petter Pilesjö Sölvegatan 12 223 62 LUND Keywords: digital terrain modelling, digital terrain analysis, hydrological modelling, surface flow estimation, flow routing algorithm. 1 Abstract In this study, we present a newly developed method for the estimation of surface flow paths on a digital elevation model (DEM). The objective is to use a form-based algorithm, analysing flow over single cells by dividing them into eight triangular facets and to estimate the surface flow paths on a raster DEM. For each cell on a gridded DEM, the triangular form-based multiple flow algorithm (TFM) was used to distribute flow to one or more of the eight neighbour cells, which determined the flow paths over the DEM. Because each of the eight facets covering a cell has a constant slope and aspect, the estimations of—for example—flow direction and divergence/convergence are more intuitive and less complicated compared to many traditional raster-based solutions. Experiments were undertaken by estimating the specific catchment area (SCA) over a number of mathematical surfaces, as well as on a real-world DEM. Comparisons were made between the derived SCA by the TFM algorithm with eight other algorithms reported in the literature. The results show that the TFM algorithm produced the closest outcomes to the theoretical values of the SCA compared with other algorithms derived more consistent outcomes and was less influenced by surface shapes. The real-world DEM test shows that the TFM was capable of modelling flow distribution without noticeable ‘artefacts’, and its ability of tracking flow paths makes it an appropriate platform for dynamic surface flow simulation. 2 1. Introduction Topography is critical for modelling distributed hydrological processes, and especially surface/overland flow. The key parameter in catchment topography that has to be estimated is flow distribution, which indicates how overland flow is distributed over the terrain. Slope direction controls the pathway of the overland flow and also substantially influences the sub-surface flow pattern. Many authors have discussed different flow routing algorithms and how they have been applied on digital elevation models (DEMs). The following paragraphs are based mainly on discussions by Zhou et al. (2011) and Pilesjö (2008). One can conclude that the use of a DEM has made it possible to estimate flow at each location over a surface. Based on the flow distribution estimation, the drainage pattern over a surface—as well as various hydrological parameters, such as catchment area and up-stream flow accumulation—can be modelled (Wilson et al., 2008). The most critical spatial data required for digital hydrological modelling are the surface elevations, which are typically modelled as discrete samples over the land surface. Although there have been numerous models developed for this purpose—such as the triangulated irregular network (TIN), and digital contour and hybrid structures (i.e., a grid with intermeshed break-lines) (Ackerman and Krauss, 2004)— the gridded DEM is the most commonly used data source for terrain analysis because of its simple data structure, ease of implementation, and rapidly growing applications of digital photogrammetry and remote sensing technology (Li et al., 2005). Grid DEM-based hydrological modelling algorithms have been developed since the early stages of geographical information systems (GIS) (Beven and Moore, 1994; Wilson and Gallant, 2000; Zhou et al., 2008). However, because a grid DEM itself is an approximation of the real-world continuous surface using regularly spaced samples, the implementation of terrain analysis models is inevitably affected by the DEM grid structure. In practice, numerous assumptions and optimisations about mass transportation 3 and movement on a specific local surface have to be made to establish a workable mathematical model (Holmgren, 1994), resulting in significantly different approaches and methodologies towards terrain modelling (Zhou and Liu, 2002). It has been recognised that such modelling algorithms may produce significantly inconsistent results related to the terrain complexity and DEM data property (Zhou and Liu, 2004; Zhou et al., 2006), which lead to the presence of some significant artificial patterns (known as ‘artefacts’) in the output. Generally speaking, on a grid DEM, the surface flow and local catchment area are approximated by applying one out of two common approaches, namely, the use of single flow direction (SFD) and/or multiple flow direction (MFD) algorithms, according to whether the algorithms consider flow divergences. The SFD algorithms (O’Callaghan and Mark, 1984; Fairfield and Leymarie, 1991; Lea, 1992; Orlandini et al. 2003) do not allow flow divergence and restrict the mass movement to only one downhill direction at a time, even though the flow may be proportionally distributed into two adjacent downstream cells (e.g., Tarboton, 1997). The MFD algorithms (Freeman, 1991; Quinn et al., 1991; Costal-Cabral and Burges, 1994; Qin et al., 2007; Seibert and McGlynn, 2007; Gallant et al., 2011) consider flow divergence and assume that mass (or flow) on a grid DEM can be transported to more than one flow direction. In many cases, SFD algorithms can produce satisfactory results over horizontally concave surfaces where convergence of flow is assumed. On plane or horizontally convex slopes, where parallel or divergent flows are more likely, it is often more appropriate to divide the flow into two or more directions. Combinations of the two types of algorithms are often preferred when modelling water flow over natural surfaces (Pilesjö et al., 1998). Studies show that a large part of the uncertainty in derived hydrological parameters can be explained by the grid data structure of the DEM (Zhou and Liu, 2004). When a natural continuous surface is represented by regularly distributed spot heights, it is inevitably difficult to determine the way that surface flow distributes over adjacent cells (Olsson and Pilesjö, 2002). On the other hand, flow 4 estimation over a surface with a constant slope and aspect, such as a facet in a TIN, would be considerably more consistent, and an SFD algorithm would be adequate without the complication of flow divergence. In this study, we are developing a new flow routing method, taking advantage of the SFD algorithms concerning consistency, the MFD algorithm concerning the flexibility/flow diversity, the TIN modelling regarding the continuity, and applying it to the grid DEM. The grid cells in the DEM (each of them treated as a ‘centre cell’ when estimating flow from that cell) are further partitioned into triangular facets, which have constant slopes and aspects. Based on these conclusions, the surface flow path from each facet can be uniquely tracked and distributed to other facets covering the centre cell, and/or to one or more of the eight cells surrounding the centre cell. A comparison between the proposed algorithm and other algorithms reported in the literature is made using a data-independent test method (Zhou and Liu, 2002), as well as a test on a real-world DEM. To summarise, the aim of this study is to create and evaluate a flow-algorithm that can simulate overland flow in a realistic way on a digital elevation model, and overcome the previously mentioned problems. How flat areas and man-made barriers are taken care of, discussed by Hasan et al. (2012a), is beyond the scope of this study. 2. Methodology A key challenge when estimating flow over grid-based DEMs is how to model the flow movement over each grid cell (Zhou et al., 2006). A typical assumption for this has been to assign a ‘flow package’ (or a package of water) to the centre of each grid cell. Based on this, the grid cell is treated as a point on a continuous surface, where subsequently, a unit of flow package is generated and flows to the next downhill point(s). This assumption makes it difficult to incorporate the form of the surface overlapping the cell into the flow routing. 5 In this study, we attempt to create a computational model that is capable of simulating the surface flow, with less impact than the grid data structure of a DEM. The realistic simulation of the flow pattern is considered valuable not only as a means to estimate drainage areas but also in estimating the amount of water in time and space (Pilesjö, 2008). The approach is to sub-partition the grid cell according to the surface form so that the flow direction over a local surface can be uniquely defined (Tarboton, 1997). A grid cell is further partitioned by creating eight local, triangular facets between the cell centre and the eight surrounding cell centres of the neighbouring cells. 2.1 Computation of slope and aspect for each triangular facet When a facet is defined, the slope and aspect values are constant for this surface. The coordinates of the three vertices of a triangle (compare to Figure 1) are known as C1(x1, y1, z1), C2(x2, y2, z2) and M(x3, y3, z3). The facet is formed as a plane as follows: z f ( x, y ) ax by c (1) where a, b and c can be derived as: ( y1 y3 )( z1 z 2 ) ( y1 y2 )( z1 z3 ) ( x1 x2 )( y1 y3 ) ( x1 x3 )( y1 y2 ) ( x x )( z z ) ( x1 x3 )( z1 z 2 ) b 1 2 1 3 ( x1 x2 )( y1 y3 ) ( x1 x3 )( y1 y2 ) c z1 ax1 by1 a (2) Let p and q denote the gradients in the W-E and N-S directions, respectively. According to Equation 1, for a triangular facet with known vertices, we have p fx f a x , and q fy f b y The slope () and aspect () of the facet can therefore be derived as: 6 (3) q p b a 180 arctan 90 180 arctan 90 p p a a arctan p 2 q 2 arctan a 2 b 2 (4) The aspect angle is calculated clockwise from North. 2.2 Flow routing within a cell The proposed triangular form-based multiple flow algorithm (TFM) combines the advantages of different flow distribution algorithms in a very simple but innovative way. The work is partly based on, and can be seen as a further development of, the TFN algorithm presented by Zhou et al. (2011). The major improvement considers form. The TFM algorithm is based on multiple flow distribution allowing overland flow to all lower cells surrounding a centre cell while TFN is a ‘single flow algorithm’. BY dividing cells into sub-cells the proposed TFM algorithm topographic form is treated in more detail, allowing flow to all neighbouring cells. It is developed to be consistent for all terrain types: convex, concave, plane terrain, as well as their combinations. Around the midpoint (M, see Figure 1) of the cell in question (the centre cell from where the flow is estimated), eight planar triangular facets are constructed with midpoints of two adjacent cells (C1 and C2). With the aid of these eight triangular facets, our current grid cell (centre cell) is divided into eight triangular facets (1-8 in Figure 1). The slope and slope direction (aspect) of each of these triangular facets can be calculated. The form of the current grid cell is represented by the combined surface of the eight triangular facets. The area of each facet is equal to 1/8 of the cell size, and represents the flow portion contributed by that facet. The overland flow of each triangular facet is to be routed out of the facet (towards other facet(s) or neighbouring cell(s)), or stays in the same triangular facet depending on slope direction. 7 In the following sections, we take triangular facet number one in Figure 1 as an example to explain the possible estimations of flow routing within the eight facets. Naturally, the same approach is applied when estimating the flow from another of the eight facets (2-8) to its neighbouring facets. The first step is to calculate the slope direction/aspect value for the facet to be analysed. As illustrated in Figure 2, depending on the aspect value, three flow routing alternatives are possible; water can be directly routed from a facet to a neighbouring cell. This alternative results in NO routing to other facets and is denoted as stay; ALL water on a facet can be routed to ONE other neighbouring facet and is denoted as move; water on a facet can be routed to a neighbouring facet AND a neighbouring cell, or to two neighbouring facets and is denoted as split. The three alternatives are described in further detail below: 1. If the aspect value of facet 1 is between 0 and 45 degrees (see Case 1, Figure 2a, where 0 ≤ ASP ≤ 45), then the amount of water (directly proportionate to the area of the facet) to be transported from this facet to a neighbouring cell will stay as it is (1/8 of a cell). 2. If the aspect value is between 90 and 180 degrees (see Case 3, Figure 2d, where 90 ≤ ASP ≤ 180), or between 225 and 270 degrees (see Case 3, Figure 2e, where 225 ≤ ASP ≤ 270), then all the water on the facet will be moved to one adjacent facet; that is, to facet number 2 and/or facet number 8. This means that the amount of water to be routed from facet 1 to a neighbouring cell in these cases will be 0, and this water (1/8) will instead be added to facet number 2 and 8, respectively. 3. If the aspect value is between 45 and 90 degrees (see Case 2, Figure 2b, where 45 < ASP < 90), or between 270 and 360 degrees (see Case 2, Figure 2c, where 270 < ASP < 360), then the water on facet 1 will be split, and partly routed into one neighbouring facets according to a vector split (e.g., Pilesjö, 2008). The amount of water to be later routed to a neighbouring cell will initially stay on the facet, while the other portion will be moved to a neighbouring facet. In the example given in Figure 8 2b, some water will stay (for further routing to the cell above the facet; see below), while some water will be moved to facet number 2. In the example given in Figure 2c, some of the water will be moved to facet number 8. If the aspect value is between 180 and 225 degrees (see Case 4, Figure 2f, where 180 < ASP < 225), then the water on facet 1 will be split between two adjacent facets; one portion will be added to the amount of water (proportional to the area) of facet number 2, while another portion will be added to facet number 8. The resulting amount of water of facet number 1 will be 0. Note, in the case of two facets sloping towards each other, i.e., contradictive slopes, no water is routed between the two facets, but the alternative stay is applied. Because water can be transported between more than two facets (from facet 1 to facet 2; to facet 3 to facet 4; and then to neighbouring cells), flow routing between facets continues until all water has reached the outflow facet(s) of the cell. Water is then transported to neighbouring cells. The result of this first step is the redistribution of water within a cell. It also estimates where (to which neighbour cells) water will route. 2.3 Flow distribution to neighbouring cells After doing the flow routing for each cell—which consists of eight individual facets—the routing of water to adjacent cell(s) takes place. In one or more of the eight facets, water is ‘waiting’ to be routed to neighbouring cell(s). If the cell represents a concave landform, we can have only one facet holding all the water (directly proportional to the area, 1, of one cell); if the cell represents a convex surface (compare to a pyramid), there might be water in all eight facets. Each facet has two neighbouring cells (e.g., see cells C1 and C2 for facet number one in Figure 1), and the next step is to distribute the accumulated water between these cells. Two different cases can then be found: 9 1. If one neighbouring cell is lower in elevation than the centre cell (where the facet is located) and the other cell is higher or equal to the centre cell, then the water accumulated in the facet will all be distributed to the lower cell. 2. If both neighbouring cells are lower in elevation than the centre cell, then the accumulated water in the facet will be proportionally distributed to both lower cells to slope (an x value of 1, Equation 5): fi tan i x tan 2 j 1 1. , for all ß > 0 (5) x j where i,j = flow directions (1…2) to lower cells, fi = flow proportion (0…1) in direction i, tan ßi = slope gradient between the centre cell and the cell in direction i, and x = variable exponent. 3. If both neighboring cells are higher or equal in elevation than the central cell and there are other lower elevation cells then the accumulated water in the facet will be proportionally distributed to all lower cells. 2.4 Accumulated flow estimation and specific catchment area Flow distribution to neighbouring cells is estimated for all cells except for the border cells in the DEM. Starting from cells with no incoming water—that is, peaks or cells at the edge of the DEM—flow accumulations (in the unit of cells) are estimated as going downhill in the catchments. The results are estimated as values of flow accumulation for every cell, with the exception of the border cells in the DEM. When comparing results between different flow routing algorithms, the specific catchment area (SCA) was used. The SCA can be estimated using the definition and method reported by Costa-Cabral and Burges (1994) as: 10 SCA TCA TCA g ( sin A cos A ) (6) where SCA is the specific catchment area defined as the upstream catchment area per unit contour line (m2/m); TCA is the total upstream catchment area at a given point (m2); is the flow width of the catchment outlet (m); A is the slope aspect at the centre of the grid cell (º), estimated by approximation of a second order trend surface and representing the general flow direction of that cell; and g is the grid size (m). 3. Experiment Two different methods have been used to test the proposed flow estimation algorithm (TFM). The first one is a data-independent test based on the method proposed by Zhou and Liu (2002), testing the results of flow estimation on a number of mathematical surfaces. This makes it possible to derive and compare quantitative measurements of the accuracy for different methods. We also apply the proposed algorithm to a real-world DEM. The spatial patterns of estimated flow and accumulated flow are then visually examined, to detect if there are significant artefacts. 3.1 Data-independent assessment Repeating the experiment carried out by Zhou et al. (2011), the following surfaces were used: ellipsoid (representing a convex slope), inverse ellipsoid (representing a concave slope), plane (representing a straight slope) and saddle (representing a convex/concave slope association). The four different mathematical surfaces are visualised below, in Figure 3. The generation of the surfaces, including the formula defining the elevation at each and every cell, can be found in Zhou et al. (2011). According to Zhou et al. (2011), the cell size for all surfaces is 5 metres, and the number of cells varies between 15 000 and 40 000. For the plane surface, the slope is set to 66º, and the aspect to 243º. 11 At each cell of the mathematical surfaces, the theoretical ‘true’ specific catchment area (SCA) was calculated according to the method described by Zhou and Liu (2002). When the ‘true’ SCA values, as well as the estimated flow algorithm dependant ones are known, the error at each cell can be computed as: Ei SCAi SCAt (7) where Ei denotes the error (or residual) at the ith cell using a selected algorithm, SCAt and SCAi denote the theoretical and estimated value (using the flow algorithm to be tested) of the SCA at the ith cell, respectively. The Root-Mean-Standard Error (RMSE), Mean Error (ME, denoted as E ) and Standard Deviation of the residuals (SD, denoted as ) were then computed according to Zhou et al. (2011) for the assessment and comparison of the selected algorithms: n E RMSE i 1 2 i (8) n n E E i 1 i (9) n E E n i 1 2 i (10) n 1 where n denotes the total number of grid cells for each mathematical surface DEM. The proposed TFM algorithm was applied on the four mathematical surfaces, and the estimated flow accumulation values for each cell were converted to SCA in the unit of grid cells according to Equation 6. Then, the estimated SCA values were compared with the ‘true’ values computed. Statistical analyses 12 of the differences (RMSE, E , and ) were carried out to estimate overall accuracy as well as spatial distribution of errors. 3.2 Selection of algorithms for comparison The proposed TFM algorithm was compared with eight commonly used, tested, and well documented flow algorithms. Three of these can be classified as grid based single flow algorithms (D8, D8-LTD, and D), three as grid based multiple flow algorithms (MD, FMFD, and QMFD), and two as (partly) vector-based algorithms (DEMON and TFN). The selected algorithms are as follows: Deterministic Eight-Node (D8) (O’Callaghan and Mark, 1984) Deterministic Eight-Node Least Transversal Deviation (D8-LTD) (Orlandini et al., 2003; Orlandini and Moretti, 2009) Deterministic infinite number of possible single flow directions (D) (Tarboton, 1997) Triangular Multiple Flow Direction (MD) (Seibert and McGlynn, 2007) Freeman Multiple Flow Direction (FMFD) (Freeman, 1991) Quinn Multiple Flow Direction (QMFD) (Quinn et al., 1991) Digital Elevation Model Networks (DEMON) (Costa-Cabral and Burges, 1994) Triangular Facet Network (TFN) (Zhou et al., 2011) Which algorithm to select for comparisons can always be discussed. However, the selected eight algorithms were judged to represent a wide variety of solutions, involving raster as well as vector-based algorithms, all widely available to the research community. We used the System for Automated GeoScientific Analysis, SAGA (2012) for testing D8, D, MD, FMFD and DEMON algorithms, the 13 original FORTRAN code available from the author’s website1 for the D8-LTD algorithm, and the original code for the TFN algorithm available directly from the authors. Software implementation for the QMFD algorithm was conducted using the C++ programming language from the Zhou et al. study (2011). ArcGIS2 GIS software was used for comparison and visualisation of the results. 3.3 Real-world testing To visually examine the result of the proposed TFM algorithm, it was applied to a real-world DEM. This 1 metre resolution DEM was interpolated form LiDAR data measured at the Stordalen mire and its catchment area. Stordalen is a peat land area in the Arctic region 10 km west of Abisko (68º 20' N, 19º 03' E), north of Sweden. The LiDAR data, as well as the interpolation procedure, are described in detail in Hasan et al. (2012b). Because the differences between the different flow algorithms are relatively small (see below), the main purpose of real-world testing is to identify possible artefacts created by the proposed TFM algorithm. However, to highlight differences, we have also included accumulated flow estimations based on the D8 algorithm. 4. Results The tests of the different flow algorithm can be divided into two parts: first, the quantitative test of the significant differences between estimated values and ‘true values’ of SCA over the mathematical surfaces, and second, the qualitative test examining spatial distribution of errors over the mathematical surfaces, as well as visual interpretation of the real-world modelling. 4.1 Quantitative accuracy assessment 1 2 http://www.idrologia.unimore.it/orlandini/download.html. ArcGIS 9.3, © Environmental Systems Research Institute, Inc., http://www.esri.com. 14 The significant differences between the nine different flow algorithms and the ‘true values’ of SCA for the four mathematical surfaces are presented in Table 1. For three out of four of the mathematical surfaces, the results show that the proposed TFM algorithm outperforms the other eight algorithms in terms of RMSE. It has the lowest RMSE values out of 5.88 m, 8.28 m, and 7.66 m, for the ellipsoid, the inverse ellipsoid, and the saddle, respectively. For the plane, the TFM algorithm has a higher (6.13 m) RMSE value than the TFN algorithm (1.86 m) and is also just higher than the DEMON algorithm (5.18 m). Regarding the systematic bias, represented by the Mean Error ( E ) value, the TFM algorithm shows better results than the all other algorithm for two of the four mathematical surfaces: the ellipsoid ( E = 0.99 m) and the saddle ( E = 0.61 m). For the inverse ellipsoid the differences between the TFM result ( E = 4.00 m) and the slightly better TFN ( E = -3.95 m) and DEMON ( E = 3.88 m) are marginal. For the plane, only the TFN algorithm shows a better result ( E = 1.85 m) than the TFM algorithm ( E = 3.12 m). If we assume normal distribution, then the Standard Deviation of the Residuals () indicates the representability of the Mean Error; the lower the Standard Deviation, the more narrow distribution of errors around the Mean Error. For the ellipsoid surface, the TFM algorithm has a Standard Deviation of 5.80 m, which is the third lowest of the eight algorithms. However, because the FMFD ( = 2.32 m) and the QMFD ( = 4.61 m) algorithm both have considerably higher mean errors (6.87 m and 6.64 m compared to -0.99 m) the deviation from zero (no error) is actually smaller for the TFM algorithm. This is also the case for the saddle, where only the FMFD algorithm has a lower Standard Deviation than TFM (6.95 m compared to 7.63 m), but a much higher mean error (13.19 m compared to 0.61 m). For the plane, the TFM standard deviation (5.27 nm) is higher than the deviations of DEMON (3.91 m) and 15 TFN (0.11 m) algorithms, and for the inverse ellipsoid the TFM algorithm shows the lowest standard deviation out of all the tested algorithms. The Mean Error and Standard Deviation also indicate possible skewness in the estimation of flow accumulation. If the standard deviation is less than the absolute value of the mean error, this indicates that at least 83% of the errors are either all positive or all negative. Such a systematic error is not desirable. Referring to Table 1, the absolute mean error never exceeded the standard deviation for three algorithms (DEMON, TFN, and TFM); exceeded it at one surface, for three algorithms (D8, D8-LTD, and D); exceeded it at two surfaces, for one algorithm (MD); and exceeded it at three out of four surfaces, for two algorithms (FMFD and QMFD). When comparing the frequency distribution of positive and negative errors for these eight algorithms, only the TFM algorithm did not exceed skewness of the proportion 80-20 (%) for any of the four surfaces. D8-LTD exceeded 80-20 for one surface, D8, D, MD, and TFN did it for two surfaces, DEMON for three, and FMFD and QMFD exceeded the 8020 proportion for all four surfaces. It is also worth noting that the skewness was one-sided (either over or underestimating SCA for all mathematical surfaces) for all tested algorithm but TFM. In Figure 4 we illustrate the results of one single flow algorithm (D8), one multiple flow algorithm (QMFD), and the proposed form-based algorithm (TFM) applied on the four mathematical surfaces. Differences in flow routing are obvious, especially for the saddle, where only the TFM algorithm logically routes 50% of the water in NW and SE directions. Regarding the quantitative comparison between the proposed triangular form-based multiple flow algorithm (TFM) and the eight alternative algorithms it clearly shows that the TFM algorithm produces the best over-all results. Even if it did not produce the best result in every test over every mathematical surface, it shows by far the best and most consistent results. Referring to Table 1 one can conclude that out of the 12 measurements of accuracy (RMSE, E , and for the four mathematical surfaces) the TFM 16 algorithm outperforms all other algorithms in six of these. The algorithms producing the second best result, the TFN algorithm, outperforms all other algorithms in three out of these 12 cases. It should also be noted that these three cases are all linked to estimation on the plane surface. Generally it can be concluded that the raster-based single flow algorithms (D8 and D8-LTD) produce the poorest results, with RMSE values between 50 and 224 m (average = 130.0 m). The raster-based multiple flow algorithms (D, MD, FMFD, and QMFD) produce better results, with RMSE between 7 and 66 m (average = 32.7 m). The (partly) vector-based algorithms (DEMON and TFN) produce even more reliable results, with RMSE between 2 and 56 (average = 14.7 m). The reason for this can be explained by the obvious over-simplification adopted by the single flow algorithms, in combination with the advantages in geometric precision connected to the vector solutions. The average RMSE value for the TFM algorithm is 10.0 m. 4.2 Spatial distribution of errors on the artificial surfaces The ‘true values’ of SCA, as well as the estimated values applying the proposed TFM algorithm for the four mathematical surfaces, are presented in Figure 5. Even if differences can be observed, it is clear that the modelled SCA values, to a high degree, coincide with the true values. The spatial distribution of residuals of derived SCA—with respect to the theoretical ‘true’ value of SCA on the four mathematical surfaces, for all of the nine tested algorithms—is shown in Figure 6. Logically, and based on the errors reported above (see Table 1, for example), the TFM, TFN and DEMON algorithms show relatively less errors compared to the other algorithms tested. A general difference between the raster-based algorithms (D8, D8-LTD, D, MD, FMFD, and QMFD) and the vectorbased algorithms (DEMON and TFN) can also be identified: the vector-based algorithms show a more random pattern of error distribution, while the errors in the raster-based algorithms seem to be more additive. However, for the TFM algorithm, this systematic bias seems to be relatively low. 17 The less-biased pattern of the TFN algorithm is explained by the way SCA values are calculated. By counting the number of flow lines passing each cell, in vector mode, the derived SCA values of the upper stream cells do not affect that of the down-stream cell (see Zhou et al., 2011). However, as will be further discussed, the TFN algorithm does not produce better overall results and is much more CPU demanding. 4.3 Real-world testing Even if visual differences are often difficult to detect when modelling flow over real-world surfaces, the derived SCA values between the TFM and D8 algorithm were estimated for a peat land area in northern Sweden. Figure 7 presents the visual comparison between the results of the two algorithms. The results show that the TFM has produced a more balanced simulation on the flow distribution on this terrain surface, with a more realistic spatial pattern on the different terrain types (straight slopes, converging slopes and diverging slopes). In comparison, the D8 algorithm shows a possible advantage in representing the drainage network, but it also demonstrates its mostly criticised weakness: an unacceptable 45-degree bias that results in significant artefacts, such as unrealistic parallel drainage lines. 5. Discussion Even if the proposed TFM algorithm takes a raster approach when estimating the overland flow pattern—although this has been recognised as one of the major sources of error for flow estimation (Zhou and Liu, 2004)—it seems superior to all other tested algorithms, including the (partly) vector based DEMON and TFN algorithms. 5.1 Accuracy comparison 18 The accuracy estimations have been carried out using both quantitative comparisons between estimated flow accumulation/SCA and ‘true values’ over mathematical surfaces, and visual interpretation of a natural surface in northern Sweden. The main purpose of the latter is to reveal possible artefacts, caused by over-simplifications and ‘illogical flow routing’. Common problems connected to the multiple flow algorithms include the limited possibilities in adjusting the overland flow to the terrain form. Logically, we assume the flow to convert over concave surfaces and divert over convex surfaces. This can be regulated by the x exponent in Equation 5, but this is rarely performed (see Pilesjö et al., 1998). However, this is taken care of in the proposed form-based algorithm. The logic in the flow routing is exemplified in Figure 4, where it can be observed how water is and should be divided over a ridge (the saddle). The results of visual analysis over both artificial and real-world surfaces show no obvious systematic bias or artificial spatial pattern for the TFM algorithm. The single flow algorithms have the capability of modelling concave landforms relatively well, but show illogical results for plane and convex landforms. The opposite is valid for most of the multiple flow algorithms; because flow is normally distributed to all neighbour cells with lower elevation this creates unrealistic flow accumulation values for concave surfaces. 5.2 Potential implementation The source code of the proposed TFM algorithm is available from the authors. In this code, also solutions how to tackle e.g., sinks, flat areas, and man-made barriers are implemented (Hasan et al., 2012a). Compared to vector based methods (e.g., the TFN algorithm), the computational time (CPU) of the proposed TFM algorithm is also fast, making it more useful. Most likely, the major hurdle for the implementation of the TFM algorithm, as well as other more advanced multiple flow algorithms, is the slow response from the research and industry communities. This, in combination with the question if 19 ‘better’ flow algorithms are really needed (and they not always are), will probable slow down the potential implementation. 5.3 Assessment methods The proposed TFM method has been evaluated and compared with eight other commonly used algorithms reported in the literature. Both quantitative and qualitative evaluations of flow accumulation/SCA, on mathematical surfaces (ellipsoid, inverse ellipsoid, plane, and saddle) as well as real-world data, have been carried out. Root mean standard errors, mean errors, and standard deviations of the residuals have been calculated and compared. Visual interpretations have been made. However, more thorough and rigorous evaluation methods can always be performed, not only on the quantitative errors of the estimated catchment area (such as the area estimation error) but also on the uncertainty of the flow path or their derivatives’ morphology (see Orlandini and Moretti, 2009). 6. Conclusion In this study we have tested a newly developed triangular form-based multiple flow algorithm (TFM). The algorithm is form-based in the sense that it divides the cell from where the flow routing takes place into eight triangular facets to estimate flow over the cell. The flow is then distributed to one or more of the eight neighbour cells, determining the flow paths over the DEM. Because each of the eight facets covering a cell has a constant slope and aspect, the estimations of, for example, flow direction and divergence/convergence are more intuitive and less complicated compared to many traditional rasterbased solutions. Experiments undertaken by estimating the specific catchment area (SCA) over a number of mathematical surfaces, as well as on a real-world DEM, clearly indicate an all-round advantage of estimating flow and flow accumulation in all forms of slopes when using the TFM algorithm compared to other algorithms. The TFM algorithm outperforms eight commonly used compared algorithms (D8, D8-LTD, D, MD, FMFD, QMFD, DEMON, and TFN), proven by both 20 quantitative measurements including RMSE, ME and SD, and qualitative evaluation including the visual analysis of spatial patterns of residuals on the artificial surfaces and SCA over a real-world DEM. It is also concluded that the fact that the TFM algorithm is raster-based in terms of flow distribution between different cells in the DEM, making it considerably faster than vector based alternatives (e.g., TFN), is highly desirable when processing larger DEM. Overall, the proposed TFM algorithm is considered both adequate in representing overland flow over a DEM, resulting in accurate estimations of flow accumulation and SCA, and efficient enough to have a high potential for broad implementation. 7. Acknowledgements The European Union funding programme Erasmus Mundus ‘External Cooperation Window’ (EMECW lot8) financed the second author of this study. The authors are grateful to Qiming Zhou, Hong Kong Baptist University, and Yumin Chen, Wuhan University, for fruitful cooperation and data exchange. 8. References Ackermann, F., K. Kraus, (2004), Reader commentary: grid based digital terrain models, Geoinformatics, 7(6), 28-31. Beven, K. J., and I. D. Moore, (eds.) (1994), Terrain Analysis and Distributed Modelling in Hydrology, John Wiley & Sons, Chichester. Costa-Cabral, M. C., and S. J. Burges (1994), Digital elevation model networks (DEMON): A model of flow over hillslopes for computation of contributing and dispersal areas, Water Resources Research, 30(6), 1681-1692. 21 Fairfield, J., and P. Leymarie (1991), Drainage network from grid digital elevation models, Water Resources Research, 27(5), 709-717. Freeman, T. G. (1991), Calculating catchment area with divergent flow based on a regular grid, Computers & Geosciences, 17(3), 413-422. Gallant, J.C., and Hutchinson, M.F., 2011. A differential equation for specific catchment area, Water Resources Research, 47(5), W05535, http://dx.doi.org/10.1029/2009WR008540 Hasan, A., P. Pilesjö and A. Persson (2012a) Drainage Area Estimation in Practice, how to tackle artefacts in real world data, In the proceeding Surface models for geosciences, Symposium GIS Ostrava 2012 Ostrava, Czech republic. Hasan, A., Pilesjö P. and Persson A. (2012b) On generating digital elevation models from LiDAR data – resolution versus accuracy and topographic wetness index indices in northern peatlands, Geodesy and Cartography, Taylor & Francis. Holmgren, P. (1994), Multiple flow direction algorithms for runoff modelling in grid based elevation models: An empirical evaluation, Hydrological Processes, 8, 327-334. Lea N. J. (1992), An aspect-driven kinematic routing algorithm, in Overland Flow: Hydraulics and Erosion Mechanics, edited by A. J. Parsons, and A. D. Abrahams, pp. 393-407, UCL Press, London. Li, Z., Q. Zhu, and C. Gold (2005), Digital Terrain Modelling: Principles and Methodology, CRC Press, Boca Raton. O’Callaghan, J. F. and D. M. Mark (1984), The extraction of drainage networks from digital elevation data, Computer Vision, Graphics, and Image Processing, 28, 323-344. 22 Olsson, L. and P. Pilesjö (2002), Approaches to spatially distributed hydrological modelling in a GIS environment, in Environmental Modelling with GIS and Remote Sensing, edited by A. Skidmore, Taylor & Frances, London. Orlandini, S., G. Moretti, M. Franchini, B. Aldighieri and B. Testa (2003), Path-based methods for the determination of non dispersive drainage directions in grid-based digital elevation models, Water Resources Research, 39(6), 1144, doi:10.1029/2002WR001639. Orlandini, S., and G. Moretti (2009), Determination of surface flow paths from gridded elevation data, Water Resources Research, 45, W03417, doi: 10.1029/2008WR007099. Pilesjö, P. (2008), An integrated raster-TIN surface flow algorithm, in Advances in Digital Terrain Analysis, edited by Q. Zhou, B. Lees, and G. Tang, pp. 237-255, Springer, Berlin. Pilesjö, P., Q. Zhou, and L. Harrie (1998), Estimating flow distribution over Digital Elevation Models using a Form-Based Algorithm, Geographic Information Science, 4, 44-51. Qin, C., A.-X. Zhu, T. Pei, B., Li, C. Zhou, and L. Yang (2007), An adaptive approach to selecting a flow-partition exponent for a multiple-flow-direction algorithm, International Journal of Geographical Information Science, 21(4), 443-458. Quinn, P., K. Beven, P. Chevallier, and O. Planchon (1991), The prediction of hillslope flow paths for distributed hydrological modelling using digital terrain models, Hydrological Processes, 5, 9-79. SAGA 2.0.5, © Saga User Group Association, http://www.saga-gis.org. Accessed 2012-04-02. Seibert, J. and B. L McGlynn (2007), A new triangular multiple flow direction algorithm for computing upslope areas from gridded digital elevation models, Water Resources Research, 43, W04501. 23 Tarboton, D. G. (1997), A new method for the determination of flow directions and upslope areas in grid digital elevation models, Water Resources Research, 32(2), 309-319. Wilson, J. P., and J. C. Gallant (eds.) (2000), Terrain Analysis: Principles and Applications, John Wiley & Sons, New York. Wilson, J. P., G. Aggett, Y. Deng, and C. S. Lam (2008), Water in the landscape: A review of contemporary flow routing algorithms, in Advances in Digital Terrain Analysis, Edited by Q. Zhou, B. Lees, and G. Tang, pp. 213-236, Springer, Berlin. Zhou, Q., and X. Liu (2002), Error assessment of grid-based flow routing algorithms used in hydrological models, International Journal of Geographical Information Science, 16(8), 819-842. Zhou, Q., and X. Liu (2004), Analysis of errors of derived slope and aspect related to DEM data properties, Computers and Geosciences, 30: 369-378. Zhou, Q., B. Lees, and G. Tang (eds.) (2008), Advances in Digital Terrain Analysis, Springer, Berlin, 462p. Zhou, Q., X. Liu, and Y. Sun (2006), Terrain complexity and uncertainties in grid-based digital terrain analysis, International Journal of Geographical Information Science, 20(10), 1137-1147. Zhou, Q., P. Pilesjö, and Y. Chen (2011), Estimating surface flow paths on a digital elevation model using a triangular facet network. Water Resources Research 47 (7), art. no. W07522. http://dx.doi.org/10.1029/2010WR009961. 24 Table 1. Accuracy comparison between the TFM algorithm and eight other selected algorithms on the four mathematical surfaces (unit = metres). Partly from Zhou et al. (2011). RMSE D8 D8-LTD D MD FMFD QMFD DEMON TFN TFM Ellipsoid 63.53 Inverse ellipsoid 224.31 Plane 154.53 Saddle 103.68 E RMSE -12.07 -86.22 -109.87 -2.06 62.37 97.21 207.08 174.98 108.67 50.08 103.66 172.01 E RMSE -14.04 -26.21 -42.54 11.52 96.19 21.51 -10.65 173.01 66.03 -21.43 26.43 62.75 -49.92 171.62 27.67 -5.71 18.69 62.45 38.02 27.07 14.52 -11.81 56.37 -18.55 62.75 -49.92 17.32 -4.41 8.46 53.23 38.02 16.74 7.25 6.87 34.94 14.11 29.62 25.26 14.91 13.19 2.32 31.97 15.47 6.95 8.08 6.64 41.24 17.09 42.26 31.61 15.76 13.11 4.61 37.53 28.06 8.75 17.21 4.47 11.21 3.88 5.18 3.41 55.64 6.67 16.62 10.52 3.91 55.23 8.42 1.97 10.25 -3.95 1.86 1.85 8.11 -0.92 8.18 9.46 0.11 8.06 5.88 -0.99 8.28 4.00 6.13 3.12 7.66 0.61 5.80 8.23 5.27 7.63 E RMSE E RMSE E RMSE E RMSE E RMSE E RMSE E 25 C1 C2 8 1 7 2 M 3 6 5 4 Figure 1. In a 3 by 3 cells window, the centre cell is divided into eight triangular facets (1-8). Each facet is formed from three points; one is the centre cell (M), and the two other are two adjacent cells (e.g., C1 and C2). 26 1 2 2 2. 3. 4. 3 3 4 5. 6. 7. Figure 2. An illustration of how water can be routed from one triangular facet to other facets. Different aspect values lead to different actions (stay, move and split). 27 Ellipsoid Inverse Ellipsoid Plane Saddle Figure 3. Illustrations of the four different mathematical surfaces used in the data-independent assessment of flow algorithms. 28 Ellipsoid Inverse ellipsoid Plane Saddle Elevation 99.3 100 99.3 100.4 107.2 114.2 99 101 103 99.6 100 100.7 98.6 100 98.6 93.1 100 107.2 98 100 102 100 100 99.8 97.3 98 97.3 85.9 93.1 100.4 97 99 100.4 99.8 98.8 100 100 100 101 D8 100 QMFD 4 4 12 12 29 21 42 21 26 12 19 34 29 37 13 17 13 55 TFM 4.4 4.4 4.9 21.1 21.1 31 18.1 12.8 18.1 38 46.1 31 38.4 10.6 50 7.3 7.3 35.4 Figure 4. Distribution of flow from a centre cell to the eight neighbouring cells using different flow algorithms over sub-surfaces (3 x 3 cells) of four mathematical surfaces (inverse ellipsoid, plane, ellipsoid, and saddle). 29 A1 C1 A2 B1 C2 D1 B2 D2 Figure 5. A comparison between the ‘true’ SCA values (A1, B1, C1, D1) and the ones modelled by applying the proposed TFM algorithm (A2, B2, C2, D2) for four different mathematical surfaces (ellipsoid, inverse ellipsoid, plane and saddle). Low SCA values (dark brown) to high SCA values (dark blue). 30 Ellipsoid Inverse Ellipsoid Plane Saddle D8 D8-LTD D > 1,000 100 - 100 10 - 100 1 - 10 -1 - 1 MD -10 - -1 -100 - -10 -1,000 - -100 < -1,000 FMFD QMFD DEMON DEMON 31 TFN TFM Figure 6. The spatial distribution of residuals of the derived SCA compared to the theoretical ‘true’ values over four mathematical surfaces. 32 D8 TFM Figure 7. A comparison between the D8 and TFM algorithms on estimated SCA over a real-world DEM. 33 Figure captions Figure 1. In a 3 by 3 cells window, the centre cell is divided into eight triangular facets (1-8). Each facet is formed from three points; one is the centre cell (M), and the two other are two adjacent cells (e.g., C1 and C2). Figure 2. An illustration of how water can be routed from one triangular facet to other facets. Different aspect values lead to different actions (stay, move and split). Figure 3. Illustrations of the four different mathematical surfaces used in the data-independent assessment of flow algorithms. Figure 4. Distribution of flow from a centre cell to the eight neighbouring cells using different flow algorithms over sub-surfaces (3 x 3 cells) of four mathematical surfaces (inverse ellipsoid, plane, ellipsoid, and saddle). Figure 5. A comparison between the ‘true’ SCA values (A1, B1, C1, D1) and the ones modelled by applying the proposed TFM algorithm (A2, B2, C2, D2) for four different mathematical surfaces (ellipsoid, inverse ellipsoid, plane and saddle). Figure 6. The spatial distribution of residuals of the derived SCA compared to the theoretical ‘true’ values over four mathematical surfaces. Figure 7. A comparison between the D8 and TFM algorithms on estimated SCA over a real-world DEM. 34