Image processing of multiphase images obtained via X-ray microtomography: A review Schlüter, S., A. Sheppard, K. Brown, and D. Wildenschild (2014), Image processing of multiphase images obtained via X-ray microtomography: A review, Water Resources Research, 50, 3615–3639. doi:10.1002/2014WR015256 10.1002/2014WR015256 American Geophysical Union Version of Record http://cdss.library.oregonstate.edu/sa-termsofuse PUBLICATIONS Water Resources Research REVIEW ARTICLE 10.1002/2014WR015256 Key Points: First survey of image processing methods for multiphase fluid images A novel protocol is suitable for various types of porous media Many routines come with a freely available open-source library Correspondence to: S. Schl€ uter, schluets@onid.orst.edu Citation: Schl€ uter, S., A. Sheppard, K. Brown, and D. Wildenschild (2014), Image processing of multiphase images obtained via X-ray microtomography: A review, Water Resour. Res., 50, 3615– 3639, doi:10.1002/2014WR015256. Received 4 JAN 2014 Accepted 2 APR 2014 Accepted article online 7 APR 2014 Published online 25 APR 2014 Image processing of multiphase images obtained via X-ray microtomography: A review Steffen Schl€ uter1,2,3, Adrian Sheppard2, Kendra Brown1, and Dorthe Wildenschild1 1 School of Chemical, Biological and Environmental Engineering, Oregon State University, Corvallis, Oregon, USA, Department of Applied Mathematics, Research School of Physics and Engineering, Australian National University, Canberra, Australian Capital Territory, Australia, 3Department of Soil Physics, Helmholtz-Centre for Environmental Research—UFZ, Halle, Germany 2 Abstract Easier access to X-ray microtomography (lCT) facilities has provided much new insight from high-resolution imaging for various problems in porous media research. Pore space analysis with respect to functional properties usually requires segmentation of the intensity data into different classes. Image segmentation is a nontrivial problem that may have a profound impact on all subsequent image analyses. This review deals with two issues that are neglected in most of the recent studies on image segmentation: (i) focus on multiclass segmentation and (ii) detailed descriptions as to why a specific method may fail together with strategies for preventing the failure by applying suitable image enhancement prior to segmentation. In this way, the presented algorithms become very robust and are less prone to operator bias. Three different test images are examined: a synthetic image with ground-truth information, a synchrotron image of precision beads with three different fluids residing in the pore space, and a lCT image of a soil sample containing macropores, rocks, organic matter, and the soil matrix. Image blur is identified as the major cause for poor segmentation results. Other impairments of the raw data like noise, ring artifacts, and intensity variation can be removed with current image enhancement methods. Bayesian Markov random field segmentation, watershed segmentation, and converging active contours are well suited for multiclass segmentation, yet with different success to correct for partial volume effects and conserve small image features simultaneously. 1. Introduction The last decade has seen a tremendous progress in X-ray tomography and imaging techniques providing new means to analyze a multitude of research problems in porous media research. In the scope of water resources research, applications range from soil-water-root interactions and mechanical and hydraulical properties of rocks to pore-scale modeling of multiphase flow and continue to appear in related fields of research (see reviews by Blunt et al. [2013], Cnudde and Boone [2013], Wildenschild and Sheppard [2013], and Anderson and Hopmans [2013]). Progress in image progressing has kept a comparable pace in terms of new developments in image enhancement, image analysis, and hardware architectures [e.g., Ketcham and Carlson, 2001; Sheppard et al., 2004; Kaestner et al., 2008; Porter and Wildenschild, 2010; Tuller et al., 2013]. Since X-ray tomography is becoming a standard technique available to an increasing number of research groups in water resources research, more and more scientists have a need for information on how to process their data. Not everyone new to the field has the resources to develop their own image processing toolbox, tailored for the research question at hand, or the budget to take advantage of powerful image processing software that often has a rather comprehensive scope. A relief in this regard are software toolboxes which are freely available to the scientific community like IMAGEJ [Ferreira and Rasband, 2012], ITK [Ibanez et al., 2005], QUANTIM [Vogel et al., 2010], BLOB3D [Ketcham, 2005], or SCIKIT-IMAGE [van der Walt et al., 2014], just to name a few. Their multiphase segmentation capabilities are somewhat limited and may require substantial operator input. The software used in this study is described in the Appendix A. However, comparing the performance of different image processing methods on the same set of test images often leads to very different results. A notorious example is image segmentation of a gray value image into objects and background [Sezgin and Sankur, 2004; Iassonov et al., 2009; Baveye et al., 2010]. Yet, these comparative studies often merely list the performance of several segmentation methods with respect € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3615 Water Resources Research 10.1002/2014WR015256 to a certain quality measure or highlight the user-dependency of the segmentation result, but lack in useful information as to why a specific method fails under certain circumstances and how this may be avoided by suitable preprocessing. Another shortcoming is that many recent review papers on image segmentation with a focus on soil images deal with binary segmentation only [Baveye et al., 2010; Wang et al., 2011; Houston et al., 2013a] and do not provide solutions to multiclass segmentation problems. This review paper has two main objectives. First, we survey various segmentation methods with respect to multiclass segmentation. We focus on methods that operate on a single image, i.e., coupled images scanned at different X-ray energy levels [Rogasik et al., 1999; Costanza-Robinson et al., 2008; Armstrong et al., 2012] or in a wet and dry state [Culligan et al., 2004; Wildenschild et al., 2005] are not discussed here. We refer the reader to Brown et al. [2014] where we demonstrate that a single-energy method outperforms a three-energy method and discuss the potential shortcomings of either approach. All of the surveyed methods are locally adaptive, i.e., in addition to global histogram information they consider some neighborhood statistic for class assignment. In particular, we will examine hysteresis segmentation [Vogel and Kretzschmar, 1996], indicator kriging [Oh and Lindquist, 1999], converging active contours [Sheppard et al., 2004], watershed segmentation [Vincent and Soille, 1991], and Bayesian Markov random field segmentation [Kulkarni et al., 2012]. As quality measures, we will use misclassification error, volume fraction, specific interfacial area, and a connectivity measure. Second, we point out that the performance of image segmentation cannot be examined independently of image enhancement prior to classification. To do so, we compare the impact of different denoising methods on the segmentation results. We have chosen standard noise removal methods that were reviewed for application in lCT data of porous media before [Kaestner et al., 2008; Tuller et al., 2013]. Moreover, we apply efficient algorithms for image artifact removal, such as intensity bias [Iassonov and Tuller, 2010] and ring artifacts [Sijbers and Postnov, 2004]. Finally, we illustrate the impairment of proper threshold detection that is due to low contrast and imbalanced histograms, and present methods to correct it. The performance of different segmentation methods is evaluated by means of three test images. We start with a synthetic test image of a partially saturated packing of spheres, where the volume fractions and interfacial areas of the wetting, nonwetting, and solid phase are known exactly. The true image is superimposed with ring artifacts, blur, and noise, and the success of different combinations of denoising and segmentation in recovering the morphological properties of the true image is compared. Subsequently, the most suitable combinations are applied to two real images of quite different scopes. The first is a synchrotron image of a three-fluid medium impaired by intensity variation and noise [Brown et al., 2014]; the second is a mCT image of a soil with macropores, organic matter, and rocks impaired by noise and blur [Houston et al., 2013b]. The paper is organized as follows: in section 2, we provide the details for each image processing method, while image enhancement and segmentation results are compared in terms of visual appearance and morphology measures in section 3. In section 4, we discuss the results and provide recommendations for best practices, and our findings are summarized in section 5. 2. Methods 2.1. Artifact Removal Due to shortcomings in the image acquisition process, the base signal of an X-ray scan is often superimposed by different kinds of image artifacts [e.g., Ketcham and Carlson, 2001; Wildenschild et al., 2002]. The most frequent impairments are image noise due to a low count of incoming radiation at the detector, and image blur due to movement, hardware constraints, or suboptimal image reconstruction. As discussed below, there are powerful denoising methods that efficiently remove noise in homogeneous locations and at the same time conserve edges between objects. Other image artifacts which are less trivial to remove a posteriori are ring artifacts, due to defective diodes in the detector panel, or beam hardening of polychromatic beams, which manifests itself in the reconstructed image as streakings around high-attenuation objects and intensity variation with distance to the sample center. Note that there are means to avoid some of these artifacts already during image acquisition or image reconstruction like a slightly altering detector panel position during scanning and wedge calibration [Ketcham and Carlson, 2001]. Here we focus on methods that can be directly applied to the reconstructed volume. € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3616 Water Resources Research 10.1002/2014WR015256 Ring artifacts can be removed separately for each image slice z after transforming the image from Cartesian coordinates x 5 (x, y) into polar coordinates x5ðr; uÞ, where x is the location vector, (x, y) is the horizontal and vertical coordinate, and ðr; uÞ are radius and angle. In this way, the rings appear as vertical lines that can be removed with a moving window W of width w R, where R is the radius of the sample. The window detects average variations in gray value median ~IðrÞ along r and normalizes them subsequently [Sijbers and Postnov, 2004]. Only homogeneous rows within W contribute to the median ~IðrÞ at column r, where the homogeneity threshold H has to be set by the user. This method has some shortcomings since objects aligned to a certain radius may also be removed. To our knowledge, the removal of streaking artifacts due to beam hardening is an unresolved problem. Intensity bias, on the other hand, can be removed rather easily given that it is not superimposed by changing attenuation coefficients due to variable fluid saturation or matrix porosity. To do so, requires an iterative procedure [Iassonov and Tuller, 2010]: (i) the image is segmented into the class with the highest attenuation and background by simple, histogram-based thresholding. (ii) The mean gray value within the highest density class is stored as a function of radius. (iii) A smooth function is fitted to the data: IðrÞ5a1b cos ð2pr=RÞ1c exp ðr=RÞ (1) where a, b, and c are fitting parameters. (iv) The smooth function is used to normalize the data. Steps (i–iv) are repeated until convergence is achieved after two to four iterations. 2.2. Denoising 2.2.1. Median Filter A good noise removal algorithm should exert significant smoothing in homogeneous regions (i.e., zones with low-intensity gradient rI(x)) and minimal modification of edges (i.e., high rI(x) zones), where r is the @ @ @ differential operator with respect to three dimensions r5 @x 1 @y 1 @z . The simplest method for nonlinear denoising is a median filter (MD) with a cubic kernel of diameter d: ^I MD ðxÞ5I0 ðxÞ Md ðxÞ (2) where * denotes convolution, I0 is the raw image, and ^I is the denoised result. The gray value that divides the set of d3 sorted gray values within Md into equal halves is assigned to the current voxel at location x [Gonzalez and Woods, 2002]. Note that this routine is usually applied in one loop and is rather slow for large kernel sizes, mainly due to sorting. However, a tremendous increase in speed is achieved by applying lookup tables and a moving median, i.e., for a kernel shift of one position only a small amount of d2 gray values has to be replaced in a table [Huang et al., 1979]. 2.2.2. Anisotropic Diffusion Filter Another popular, nonlinear denoising method is the anisotropic diffusion (AD) filter [Perona and Malik, 1990; Catte et al., 1992]. The rationale of this method is that the Gaussian distribution is the solution to the diffusion equation with a constant diffusion coefficient D. In the same way, applying the diffusion equation with a nonlinear diffusion coefficient amounts to smoothing with a Gaussian kernel of strongly varying standard deviation. Obviously, D should depend on the local intensity gradient rI(x). Hence, anisotropic diffusion calls for a numerical solution of the following partial differential equation (PDE): ^I 0 5I0 (3) @^I AD 5r½DðjrðGr ^I AD ÞjÞr^I AD @t (4) where t is numerical time, ^I AD is short for ^I AD ðx; tÞ, and the gradient of smoothed intensity values, convolved by a Gaussian Gr of standard deviation r, serve as an edge detector. The simplest implementation is: € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3617 Water Resources Research 10.1002/2014WR015256 ( DðjrðGr ^IÞjÞ5 1; jrðGr ^IÞj j 0; jrðGr ^IÞj > j (5) where j is a diffusion stop criterion. The number of iterations is another important parameter that has to be set manually, because the solution would eventually converge to uniform intensity. 2.2.3. Total Variation Filter Another PDE-based approach is total variation (TV) denoising [Rudin et al., 1992]. The rationale behind this method is to minimize the intensity variation in the image by means of the following cost function: 9 > > > ð = 2 0 ^I TV 5 argmin jr^IðxÞjdx 1 k jI ðxÞ2^IðxÞj dx > > ^I > > > ; :|fflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflffl} |fflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflffl}> 8 > > > <ð regularization (6) fidelity where k is a scale parameter that controls the trade-off between regularization, i.e., smoothing, and fidelity to the raw data I0 . The solution is achieved with the following set of coupled PDE’s: ^I 0 5I0 ! @^I TV r^I TV 5r 1k I0 2^I TV 1K @t jr^I TV j @K 5aðI0 2^I TV Þ @t (7) (8) (9) where ^I TV is short for ^I TV ðx; tÞ. The time step control a can be made adaptive to @K/@t. The number of iterations, used as a stopping criterion, is less crucial as compared to ^I AD , because the solution does not converge to uniform intensity due to the fidelity term used in equation (6). 2.2.4. Nonlocal Means Filter Unlike the previous methods, the nonlocal means filter (NL) is a linear filter, i.e., the gray value at the current location is the average of gray values at other locations, assigned with some suitable weighting factors, w. However, in contrast to standard linear filters (Gaussian filter, mean filter, etc.), it does not use a small-sized kernel, but potentially the entire image as a search window. The rationale is to compare the neighborhoods of all voxels y 僆 I with the neighbors of the current voxel at location x [Buades et al., 2005]: ^I NL ðxÞ5 X wðx; yÞI0 ðyÞ (10) y2I Thus, the similarity of a whole neighborhood with fixed size determines the weight w(x, y) with which a distant voxel will influence the new value of the current voxel. More specifically, the weights are significant only if a Gaussian kernel Gr with standard deviation r around y looks like the corresponding Gaussian kernel around x: 0 ð 1 2 0 0 ðnÞ jI ðx1nÞ2I ðy1nÞj dn G r B C 1 C wðx; yÞ5 exp B @2 A h2 ZðxÞ (11) where n scans the neighborhood, h acts as a filtering parameter that can be adapted to the level of image noise and Z(x) is the normalizing factor. Note that the computational cost for the neighborhood search in the entire image can become excessive, so restricting the search to a certain window size (y 僆 S in equation (10)) is required [Buades et al., 2008]. € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3618 Water Resources Research 10.1002/2014WR015256 2.3. Edge Enhancement A notorious problem in image processing is partial volume effects due to image blur. That is, image edges do not manifest themselves as crisp intensity steps, but rather as gradual intensity changes spanning several voxels. A standard method to sharpen the image, i.e., to enhance the intensity gradient locally, is unsharp masking [Sheppard et al., 2004]: 0 ^I UM ðxÞ5I0 ðxÞ2 wðGr I ðxÞÞ 12w (12) where r should roughly match the half width of blurry edges and w defines the degree of edge enhancement, where [0.1, 0.9] is a suitable range. In the context of the last section, this corresponds to the inverse diffusion equation. Evidently, unsharp masking will also enhance noise, so the image should be denoised first. Alternative edge enhancement methods like a difference of Gaussians or a Laplacian of Gaussian filter [Gonzalez and Woods, 2002; Russ, 2006] have a very similar concept and are not further discussed here. 2.4. Image Segmentation 2.4.1. Histogram Bias Correction The frequency distribution of gray values can have an unfavorable shape for threshold detection. Typical examples are low contrast, imbalanced class proportions, class skewnesses, or class variances. Here we list three methods that can partially remove these histogram traits and thus facilitate more reliable threshold estimates: 1. Gradient mask: Partial volume effects due to blurred phase edges can cause long tailings in the histogram, which lead to skewed distributions for the lowest and highest intensity class. Partial volume voxels can be identified through high-gradient intensities and treated by different strategies [Panda and Rosenfeld, 1978]. One is to calculate the average gray value of partial volume voxels and use it as an optimal threshold [Schl€ uter et al., 2010], another is to mask them out and only calculate the histogram for low-gradient regions. The mask is generated by unimodal thresholding [Rosin, 2001] of the histogram of intensity gradients. 2. Histogram equilization: Image contrast is often enhanced by linear or nonlinear intensity rescaling, sometimes also denoted as histogram stretching [Gonzalez and Woods, 2002; Russ, 2006]. An alternative approach to contrast enhancement is contrast-limited adaptive histogram equalization (CLAHE) [Pizer et al., 1987]. Image contrast can be defined as the slope of the cumulative density function of gray values. Limiting this contrast corresponds to clipping the histogram at a certain cutoff. The histogram area thus removed is uniformly distributed over the range of gray values that occur in the image. In principle, this algorithm operates on the histogram of a certain search window to obtain a locally adaptive contrast enhancement. The method can be generalized such that any transition between global and local contrast enhancement is achieved [Stark, 2000]. In this study, histogram clipping is only applied to the global histogram to improve threshold detection and is not mapped to the corresponding image. 3. ROI dilations: Some images exhibit unimodal histograms due to very imbalanced class proportions, i.e., a very small volume fraction of a certain phase and a very large volume fraction of the background. A balanced frequency distribution can be obtained with a new, semiautomatic algorithm: (i) pick a threshold manually that detects the class with lowest volume fraction as the region of interest (ROI) and binarize the image. (ii) Dilate the thus obtained mask. (iii) Multiply the original image with the mask in order to compute the ROI histogram. (iv) If it is not yet clearly bimodal (multimodal) return to step (ii). 2.4.2. Global Thresholding Image segmentation is a crucial step in image processing and affects all subsequent image analyses. In this context, it is common to refer to global thresholding as approaches where classes are assigned to voxels by histogram evaluation only, without considering how the gray values are spatially arranged in the corresponding image. A multitude of different thresholding methods exist today which have been reviewed by various authors [Sahoo et al., 1988; Pal and Pal, 1993; Trier and Jain, 1995; Sezgin and Sankur, 2004]. The € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3619 Water Resources Research 10.1002/2014WR015256 general conclusion, if any, is that none of the reviewed methods excel at all segmentation problems. A comprehensive survey by Sezgin and Sankur [2004] compared 40 different thresholding methods, most of them global, and classified them according to fundamental principles. Five out of those methods were chosen for implementation in this study as well as an additional one: 1. G1—Maximum Variance: This is a classic method based on discriminant analysis [Otsu, 1975]. Consider the histogram as an estimator of the probability of a certain gray value and n – 1 thresholds to divide the histogram into n classes {C0, C1,. . .,Cn}. The total variance r2T of the population of gray values can be divided into the sum of withinclass variances r2W and the between-class variance r2B of class means. The objective is to find the set of n – 1 thresholds ft0 ; t1 ; . . . ; tn21 g that maximize this between-class variance. 2. G2—Minimum Error: Minimum Error Thresholding assumes the histogram to be composed of normal distributions for each class [Kittler and Illingworth, 1986]. The Gaussian modes that are fitted to the histogram usually overlap at certain gray values. As a consequence, assigning those voxels to only one class will deliberately lead to a certain misclassification error for the other class. The objective is to set the thresholds ft0 ; t1 ; . . . ; tn21 g such that the misclassification error is minimal. 3. G3—Maximum Entropy: This method exists in various modifications. The classic approach, which is implemented here, relies on Shannon Entropy as a measure of the information content of a signal [Kapur et al., 1985]. Assume a threshold to be close to the maximum or minimum gray value. Objects will barely appear in the output image and will be almost completely surrounded by background. Hence, the information content of the resulting image is low. A set of thresholds ft0 ; t1 ; . . . ; tn21 g can be adjusted such that the image has the richest detail, i.e., the information transfer is optimal. This is achieved by maximizing the sum of class entropies. 4. G4—Fuzzy C-means: This method combines the classic k-means algorithm [Ridler and Calvard, 1978] with fuzzy set theory [Jawahar et al., 1997]. Membership functions M0, M1, . . . Mn are assigned to each gray value depending on the distance to each class mean l0, l1, . . . ln and a fuzziness index s. The optimal set of thresholds ft0 ; t1 ; . . . ; tn21 g is detected at the intersections of adjacent M. 5. G5—Shape: This method detects the thresholds ft0 ; t1 ; . . . ; tn21 g at the local minimum between two adjacent histogram peaks [Tsai, 1995]. If the number of peaks np exceeds the predefined number of classes n, iterative Gaussian smoothing is first applied to the histogram, until np 5 n. If np < n, the missing thresholds are set at the location of maximum histogram curvature instead. 6. G6—Average: It can be shown that some methods are optimal under certain conditions [Kurita et al., 1992], e.g., equal class probabilities, equal class variances, etc. However, these conditions are hardly ever met in a real image and all methods will be biased to some degree. Assuming that the bias of different methods may partly cancel out due to the different criteria that they optimize, an averaged threshold over all methods may lie closer to the true, unknown optimum. Since some methods may fail completely, outliers have to be removed, i.e., only thresholds within ðt k 2rtk ; t k 1rtk Þ contribute to the final value, where k 5 0,. . ., n – 1. Note that in the first three methods the computational cost for an exhaustive search over all sets of thresholds in a n-dimensional search space increases exponentially with increasing number of classes n. Therefore, the methods are implemented in an efficient way by using lookup tables to store each term of a specific objective function for every possible pair of class boundaries, and employ these tables to calculate the objective function for an arbitrary number of classes [Liao et al., 2001]. For the fuzzy c-means, the exhaustive search is replaced by an iterative search [Jawahar et al., 1997]. The search space for the shape method is one-dimensional irrespective of the number of classes. € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3620 Water Resources Research 10.1002/2014WR015256 2.4.3. Local Segmentation In contrast to global, histogram-based thresholding, locally adaptive segmentation methods also account for some kind of neighborhood statistic for class assignment in order to smooth object boundaries, avoid noise objects, or compensate for local intensity changes. Due to the added flexibility, local segmentation methods often result in more satisfying segmentation results [Iassonov et al., 2009; Wang et al., 2011]. Five different local segmentation methods, which have all been successfully applied to porous media images in the past, will be used in this study: 1. L1—Hysteresis: Hysteresis segmentation, which is sometimes also denoted as bilevel segmentation or region growing, was introduced as a method to improve edge continuity in gradient images [Canny, 1986]. In the same way, it improves the class assignment of partial volume voxels in soil images [Vogel and Kretzschmar, 1996; Schl€ uter et al., 2010]. Two thresholds have to be set by the user, a lower threshold that identifies voxels, which definitely belong to a low-intensity class, and an upper threshold for which the uncertainty of class assignment is highest. A voxel in the intermediate gray value range is only assigned to the low-intensity class, if a neighbor voxel already belongs to the low-intensity class. In other words, low-intensity voxels serve as seed regions for iterations of conditional dilations. Unassigned voxels which cannot be accessed by this region growing process are assigned to the high-intensity class. Hysteresis thresholding is not a multiclass segmentation method in its strictest sense as the procedure has to be repeated n – 1 times for n classes. 2. L2—Indicator kriging: In this geostatistical method, spatial correlation is used as a local assignment criterion [Oh and Lindquist, 1999; Houston et al., 2013b]. Again, two thresholds are specified by the user to define two a priori classes. The upper threshold of the unclassified range is extended toward gray values that definitely belong to a high-intensity class. The class assignment at an unclassified location depends on the weighted average of indicator values in its neighborhood, where the weights are obtained by kriging. Indicator kriging has to be repeated n – 1 times for n classes as well. 3. L3—Bayesian Markov Random Field: ^ The rationale of this probabilistic segmentation method is to find a spatial arrangement of class labels C with minimum boundary surface that at the same time honors the gray value data in the best possible way [Berthod et al., 1996; Kulkarni et al., 2012]. This is a combinatorial optimization problem which is solved in the framework of Markov random fields (MRF), i.e., by only evaluating the interaction between direct neighbors: 9 8 > > > > > > ð ð qffiffiffiffiffiffiffiffiffiffi 0 = < I 2l c x ^ argmin 1 b cðc ; c Þ 2pr2c 1 C5 x y 2 > > 2rc ^ X P > > C > ; :|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl} |fflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflffl} > (13) class boundaries class statistics with ( cðcx ; cy Þ5 21; cx 5cy 11; cx 6¼ cy (14) where X is the population of all voxels, P is the population of all pairs of neighboring voxels x and y, cx is the class label at x, lc and r2c are class mean and variance, and b is a homogeneity parameter that determines the weight of the penalty term for class boundaries. Class updating is achieved in a deterministic order denoted with iterative conditional modes (ICM) [Besag, 1986], where sufficient convergence is usually achieved after three to five loops. 4. L4—Watershed: The watershed algorithm uses lines of highest gradient to demarcate class borders locally [Beucher and Lantuejoul, 1979; Vincent and Soille, 1991; Roerdink and Meijster, 2000]. The image is preclassified with simple € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3621 Water Resources Research 10.1002/2014WR015256 thresholding to set markers for the immersion process. High-gradient zones are subsequently set to unclassified again, as they will be assigned by the watershed algorithm. These high-gradient zones are either identified by edge detection (Sobel, Canny, etc.) or by a small cubic kernel that detects neighborhoods with nonuniform class assignment. The latter method produced the better results for the images examined in this study. The unclassified zones are filled from different ends using the intensity gradient as a pseudoheight until the watershed line is reached. Finally, the voxels located directly on the separation line and on plateaus are filled with the most representative class in their neighborhood. 5. L5—Converging Active Contours: This approach is a combination of the watershed method and active contour methods, which uses gradient and intensity information simultaneously [Sheppard et al., 2004]. The method is initialized by identifying seed regions for each class; as in the methods above, this means that each voxel is either assigned to one of the classes or left unassigned, to be classified during the main segmentation step. Like the watershed method described above, this initial classification is normally done by thresholding in both intensity and gradient space; each class has an upper and a lower threshold along with an upper gradient threshold. The main classification algorithm proceeds by simultaneously growing the boundaries of these seed regions toward each other. The speed at which the boundaries advance varies spatially and temporally, depending on the local gradient and optionally on the distance of the local gray value to its class mean. The algorithm ends when all boundaries have converged. The final separation line assignment problem described above that affects some watershed methods is not an issue because every nonseed voxel is traversed by one class boundary before any others. The advancement of the boundaries is implemented efficiently using the fast marching algorithm; despite this, the method is quite computationally intensive and requires parallel implementations to operate on large 3-D images. 2.5. Postprocessing Denoising and boundary refinement can also be applied to the class image C as postprocessing instead of smoothing the raw image I0 . Among the most popular postprocessing methods for binary images are morphological operators like erosion and dilation [Serra, 1982]. However, in multiclass images they would correspond to a minimum and maximum filter, which are not suited for this purpose at all. A median filter, on the other hand, removes segmentation noise and rugged class boundaries rather well. It does however, have the unwanted feature that the results happen to depend on the order in which the classes are num^ MA , which assigns the most representative class among bered. This is avoided by applying a majority filter C all neighbors in a cubic kernel to the central voxel. It is common to enforce two criteria to prevent unwanted features where more than two phases meet: (i) the most representative class exceeds a certain volume fraction and (ii) the volume fraction of the most representative class exceeds the volume fraction of the old class at the central voxel by a certain percentage. An alternative postprocessing method is size-dependent object removal. For this method, each object of each class has to be labeled. Volumes are then determined by voxel counting, and objects smaller than a user-dependent threshold are filled with the class that completely surrounds the object. Consequently, rugged surfaces of big objects are conserved. Small deleted objects at the boundary between two materials are filled by simultaneous dilation of both materials. 2.6. Structural Analysis The list of tools with which complex structures like porous media can be analyzed is virtually endless. This study is constrained to only four very simple, but meaningful metrics. If ground-truth information is available from a true image I, a misclassification error ME can be determined: Nx 1 X ME5 dðci ; ^c i Þ; Nx i51 ( dðci ; ^c i Þ5 1; ci 6¼ ^c i 0; ci 5^c i (15) where ci and ^c i are the true and current class at location i and Nx is the number of all voxels. Bulk volumes V and surface areas a are examined for each individual phase. Specific interfacial areas between two phases (a and b) are obtained by the following relation: € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3622 Water Resources Research 10.1002/2014WR015256 1 aab 5 ðaa 1ab 2ac Þ 2 (16) where c is the union of all other classes. Both V and a are determined with Minkowski functionals [Vogel et al., 2010]. In this way, surface area estimates are directly obtained from the segmented voxel image and problems associated with aligning isosurfaces of each individual phase are avoided. Finally, a dimensionless connectivity indicator for a specific phase is calculated [Renard and Allard, 2013]: Ca 5 Nl 1 X n2 Na2 i51 i (17) where each cluster of phase a has a label li and a size ni, Nl is the number of clusters and Nn is the number of all phase a voxels. Cluster labeling is typically achieved with a fast method by Hoshen and Kopelman [1976]. Ca has the edge over the popular Euler number that it is bounded by (0, 1] and less sensitive to noise. 3. Results 3.1. Synthetic Test Image 3.1.1. Image Generation Synthetic images provide ground-truth data against which different image enhancement and segmentation methods can be compared. The test image in this study is generated to resemble a partially saturated sand packing. To this end, a volume of 512 3 512 3 128 voxels is filled by nonoverlapping spheres with a radius of r 5 50 voxels placed at random locations. If a new sphere does not fit at a given location, the radius decreases in steps until the overlap vanishes. The location is completely abandoned when r 5 20 is reached. The procedure stops after 250 spheres have successfully been placed resulting in a porosity of 0.412. Subsequently, an opening transform [Serra, 1982] is applied to the pore space with a spherical structure element of r 5 15. The pore space which has been removed by this opening is considered to be filled with the wetting phase (w) and the remaining, larger pore bodies are assigned to the nonwetting phase (n). This does not necessarily produce a physically realistic fluid distribution, but evokes fluid configurations that are similar enough to serve as a test scenario. The resulting volume fractions are Vw 5 0.279 and Vn 5 0.133. Gray values of 50, 125, and 205 are assigned to nonwetting, wetting and solid phase, respectively, resulting in the true image I. An axial image slice of I is depicted in Figure 1a. Subsequently, the image is superimposed with ring artifacts, blur, and noise in order to obtain a more realistic, raw image I0 (Figure 1b). The ring artifacts are additive with random height and random radius. The rings manifest themselves as a random gray value offset in the range [2100,100], where the resulting gray value is limited at [0,255]. Subsequently, blurring is achieved with a cubic mean kernel of five voxels on a side. Finally, uncorrelated Gaussian noise is added with a signal-to-noise ratio of rb/rn 5 2, where rb and rn are the standard deviations of gray values in the blurred image and the noise model, respectively. 3.1.2. Denoising and Edge Enhancement Four different denoising methods (Figures 1d–1g) are applied to the raw image I0 in Figure 1b in order to remove noise followed by an edge enhancement step to mitigate partial volume effects caused by blur. The median filter was applied one time with a cubic convolution kernel with a side length of d 5 7 voxels. The anisotropic diffusion filter was applied with j 5 15, r 5 1.0, and t 5 1.5 (nine iterations). The total variation filter was applied with k 5 2 stopping after t 5 0.12 (30 iterations). For the nonlocal means filter, a cubic kernel of d 5 9 was used for neighborhood search with a Gaussian convolution kernel of r 5 2 in a reduced search window S of d 5 23 voxels. The two criteria for the majority filter are set to an absolute majority of 0.33 and a higher occurrence than the class of the central voxel of 0.05. The settings of each denoising method have been carefully chosen to achieve strong denoising, deliberately accepting some degree of edge smoothing. Subsequently, an unsharp mask filter for edge enhancement with a Gaussian kernel of r 5 1.5 and a weighting factor of w 5 0.7 was applied to each denoising result. Finally, the results of the unsharp mask after anisotropic diffusion ^I AD1UM and total variation € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3623 Water Resources Research a 10.1002/2014WR015256 0.09 b 0.07 frequency c (b) (d) (e) (f) (g) 0.08 0.06 0.05 0.04 0.03 0.02 0.01 0 0 d e 25 50 f 75 100 125 150 grey value 175 200 225 250 g Figure 1. (a) True image I with spheres and size-dependent distribution of wetting and nonwetting phase, (b) raw image I0 exhibiting ring artifacts, blur and noise, (c) histograms before and after image enhancement. (bottom row) Image enhancement results with different denoising methods in combination with unsharp masking: (d) ^I MD1UM , (e) ^I AD1UM , (f) ^I TV1UM , and (g) ^I NL1UM . denoising ^I TV1UM had to be cleaned by a median filter with a d 5 5 kernel to smooth rough surfaces that were enhanced during unsharp masking. The histograms in Figure 1c and visual appearances in Figures 1d–1g indicate different success in noise removal and edge preservation. The edges in the median image ^I ME1UM appear slightly more blurry than in the images for the other three denoising methods. Moreover, we observe that the nonlocal means filter ^I NL1UM is superior in removing ring artifacts during the denoising step alone and does not require the additional cleanup of rough edges by a median filter mentioned above. 3.1.3. Ring Artifact Removal The ring artifact routine of Sijbers and Postnov [2004] is applied slice wise with a window size of w 5 27 pixels and a homogeneity threshold H adapted to the noise level in Figures 1d–1g. The outcome is depicted for the enhanced image ^I TV1UM after TV denoising and unsharp mask edge enhancement (Figure 2c). Obviously, the ring artifact removal works well for some rings, but is incomplete for others even after carefully testing different w and H. For instance, the white ring within the green window exhibits a varying magnitude after denoising (Figure 2a). Since the high artifact magnitude in the pore space contributes less than a b c Figure 2. (a) Test image after a combination of TV denoising and unsharp masking ^I TV1UM (corresponds to Figure 1f). Yellow frame marks sharp boundaries before ring artifact removal and the green ring corresponds to the window in polar coordinates. (b) Same image in polar coordinates after line removal. Green rectangle depicts the moving window W. (c) Same image after ring artifact removal. The yellow frame highlights blur due to the back transform into Cartesian coordinates. € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3624 0.98 175 0.97 150 0.96 125 0.95 100 0.94 75 0.93 tl0-th0 l h t1-t1 tmax [0,1] 50 25 0.92 0.91 0.9 b INL+UM ground truth tmax [0,1] 0.01 0.001 0.0001 25 50 100 150 200 t0 - threshold dark-to-medium 0.07 50 75 100 125 150 gray value 175 200 225 0.012 tl[0,1]-th[0,1] INL+UM G1 G2 G3 G4 G5 G6 0.05 0.04 0.03 c d 0.01 frequency 0.06 frequency tl0-th0 l h t1-t1 0.1 frequency a B 200 10.1002/2014WR015256 1 B 1 0.99 t1 - threshold medium-to-bright 225 σ2(t0,t1) / 〈σ2(t0,t1)〉max - between-class var. Water Resources Research 0.008 0.006 0.004 0.02 0.002 0.01 0 0 25 50 75 100 125 150 gray value 175 200 225 25 50 75 100 125 150 gray value 175 200 225 Figure 3. (a) normalized objective function of Otsu’s method (G1) with the thresholds detected at the maximum between-class variance and the transition regions stopping at the 0.992 percentile. (b) Histogram of ^I NL1UM after ring artifact removal with thresholds t0max and t1max and transition ranges (½t0l ; t0h and ½t1l ; t1h ) detected with Otsu’s method (G1). (c) Threshold l ; th ½0; 1 are obtained accordingly. (d) Same averaging pairs for five different methods (G1–G5) and the corresponding averages after outlier removal (G6). The transition regions ½t½0;1 method after edge masking and histogram clipping. half to the window height in Figure 2b, it remains undetected by the median ~IðrÞ. Stretching the window over more than one slice as suggested by Ketcham [2006] does not improve the results. In addition, the back transform from polar into Cartesian coordinates introduces additional blur for high radii (yellow frame). 3.1.4. Global Thresholding In Figure 3a, the well-known Otsu method (G1) is applied as an example to the histogram of ^I NL1UM after ring artifact removal. Results for the other denoising methods are similar (not shown). The exhaustive search for each pair of thresholds detects the optimum at tmax 5 {95; 166}. Percentiles of this objective criterion can be used to obtain fuzzy threshold ranges [Oh and Lindquist, 1999], i.e., t1 is held constant and t0 decreases (or increases) until in this case r2b =hr2b imax 50:992 is reached at t0l (or t0h ). The procedure is repeated for t1 where t0 kept is constant. Applying these thresholds and associated ranges to the underlying histogram (Figure 3b) reveals a bias of Otsu’s method toward the class with highest volume fraction. Note that the true optimum, which is estimated by the intersection points of the individual gray value frequencies within in each class of the true image, is at t true 5f91; 162g. Some other thresholding methods are also afflicted by bias, yet in different directions (Figure 3c). Fuzzy c-means (G3) show the same bias due to imbalanced class probabilities. The shape analysis (G5) performs well since the local histogram minima happen to coincide with the intersection points in Figure 3b. The minimum error method (G2) is biased due to imbalanced skewness, i.e., only the wetting phase exhibits two long tailings due to partial volume effects in both directions, which increases the probability of being assigned to that class. The maximum entropy method (G4) fails completely in identifying meaningful thresholds. Figure 3c also depicts the arithmetic mean of thresholds and associated ranges after outlier removal (G6). This average set of thresholds at t 5 {88; 165} is closer to the true optimum. Note that the specific percentiles for each thresholding method have to be set individually, as each objective criterion exhibits very different ranges, i.e., some are bounded and yet others are logarithmic. A large part of the bias in Figure 3c can be removed easily. First, the impact of imbalanced € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3625 Water Resources Research a 10.1002/2014WR015256 b c e f nonwetting wetting solid d Figure 4. Segmentation results for different local segmentation methods applied on the image after nonlocal means denoising and unsharp masking ^I NL1UM : (a) original image, (b) global thresholding, (c) indicator kriging, (d) Bayesian MRF, (e) watershed, and (f) converging active contours. Differently colored frames highlight failures of various methods. skewness is mitigated by masking out partial volume voxels at phase edges [Panda and Rosenfeld, 1978]. Second, the impact of imbalanced class probabilities is reduced by histogram clipping [Pizer et al., 1987]. In this way, the skewness of the nonwetting and solid phase is also efficiently removed. The combination of both methods leads to a well-balanced histogram for which each of the surveyed thresholding methods ends up at similar values (Figure 3d). The average after outlier removal at t 5 {90; 162} matches the true optimum almost perfectly. 3.1.5. Local Segmentation The average set of thresholds (Figure 3d) are determined for each denoising method individually and then used for global thresholding as an initial step in all local segmentation methods. Note that a priori segmentation with Bayesian Markov random field (L3) and watershed segmentation (L4) requires single thresholds tkmax only, hysteresis segmentation (L1) uses the range ½tkl ; tkmax , and indicator kriging (L2) and converging active contours (L5) use ½tkl ; tkh , with k 5 {0; 1}. In addition, converging active contours requires a gradient threshold, which is determined automatically by unimodal thresholding of the gradient histogram. The results of each local segmentation method on ^I NL1UM are depicted in Figure 4. The segmentation result for hysteresis segmentation (L1) is left out, because it is indistinguishable from the outcome of global thresholding (G6). All segmentation results are free of ring artifacts and noise and exhibit smooth object boundaries which can be mainly credited to successful preprocessing. However, some segmentation results perform better than others in the way they recover image features impaired by blur. Incorrectly identified, apparent wetting films that are due to partial volume voxels at class boundaries between the nonwetting and solid phases (yellow frame) can be mainly attributed to image blur. Indicator kriging (L2) and hysteresis segmentation (L1) cannot cope with this problem, since these are iterative methods, i.e., the segmentation between wetting and nonwetting phase is independent of the segmentation between wetting and solid phase. In principle, Bayesian MRF segmentation (L3) has a mechanism to remove these films. To do so, the homogeneity factor b has to be set high, thus also removing true features of similar size. A moderate value of b 5 0.5 is used here. Yet, no matter how high b is set only direct neighbors are evaluated in the MRF paradigm and since the films are thicker than two voxels they cannot be penalized. Another fundamental shortcoming of Bayesian MRF segmentation that has previously gone unnoticed is its tendency toward bias in the class statistics (equation (13)). The histogram of the wetting phase exhibits long tailing toward both € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3626 Water Resources Research 1000 0.01 100 0.001 10 frequency 0.1 0.0001 class statistics term - eq. 13 10.1002/2014WR015256 1 0 40 preprocessed thresholds 80 120 160 gray value non-wetting wetting 200 240 modified wet. solid Figure 5. First term of the objective function for MRF segmentation (equation (13)) as a function of gray value for each class. In the modified class statistics, rw is replaced by rw/2 for the wetting phase. directions in contrast to only one tail for the other classes (Figure 3c). Consequently, rw is much higher than rn and the probability of a voxel in the intermediate intensity range to be assigned to the wetting phase increases accordingly, resulting in even thicker apparent wetting films. This bias is mitigated here by a quick and rather inelegant fix to use rw/2 for the class statistics instead. In this way, the gray values at which the penalty functions for adjacent classes cross each other coincide much better with the previously detected thresholds (Figure 5). The watershed method (L4) successfully removes apparent wetting films in the yellow frame (Figure 4e). However, true wetting films are also being removed (white frame). Segmentation with converging active contours (L5—Figure 4f) exhibit excessive wetting film removal as well. Finally, no method is capable of restoring the small wetting phase bridge (green frame) in Figure 4, because its intensity is highly smoothed due to image blur. 3.1.6. Structural Analysis The shortcomings of the different segmentation methods are corroborated by structural properties of the nonwetting phase listed in Table 1. The misclassification error ME is roughly 3–4% for all methods including global thresholding. However, the lowest ME does not guarantee the best recovery of morphological properties. The bulk volume Vn of the nonwetting phase is slightly underestimated by all segmentation methods, except for watershed segmentation and converging active contours which show a tendency for overestimation, due to the way partial volume voxels are treated. Surprisingly, simple thresholding and hysteresis thresholding match the true an value best, whereas indicator kriging and Bayesian MRF underestimate it and watershed segmentation and converging active contours overestimate it as a consequence of partial volume voxel treatment. More importantly, the specific interface between fluids awn only diverges from the total nonwetting surface an if false wetting films are successfully suppressed. Simple thresholding, hysteresis thresholding, indicator kriging, and Bayesian MRF segmentation all fail in this respect. The only exception is ^ MA , simply because the Bayesian MRF segmentation on the raw data if combined with postprocessing C majority filter removes partial volume voxels. Watershed segmentation and converging active contours underestimate awn because a lot of true wetting films are removed as well. The connectivity index of the true image Cn is matched well by most segmentation methods, because the bubbles remain rather isolated no matter how wetting films are treated. Oversegmentation of the nonwetting phase with watershed segmentation and converging active contours may lead to lower Cn, however, because (i) the denominator Na in equation (17) increases and (ii) the increase in bubble volume is evenly distributed among all clusters in the numerator. The differences in structural properties due to different denoising methods are also listed in Table 1. Obviously, no denoising at all produced the worst results in all respects. Moreover, the structural properties do not vary much among the different denoising methods. Surprisingly, a simple majority filter ^ MA , applied on the segmented raw data without any preprocessing results in the best agreement with the C € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3627 Water Resources Research 10.1002/2014WR015256 Table 1. Structural Properties of Segmented Images for Different Combinations of Denoising and Segmentation Methods: Unsharp Mask (UM), Median (MD), Anisotropic Diffusion (AD), Total Variation (TV), Nonlocal Means (NL), Majority (MA)a ME (–) Vn (–) an (pix21) anw (pix21) Cn (–) 0.000 0.133 0.0169 0.0140 0.355 0.140 0.028 0.029 0.027 0.027 0.024 0.141 0.128 0.124 0.127 0.130 0.129 0.1005 0.0168 0.0165 0.0169 0.0172 0.0169 0.0954 0.0167 0.0165 0.0169 0.0170 0.0168 0.312 0.355 0.355 0.355 0.354 0.355 0.107 0.027 0.029 0.027 0.027 0.024 0.131 0.127 0.123 0.127 0.129 0.128 0.0722 0.0167 0.0164 0.0169 0.0171 0.0167 0.0695 0.0167 0.0164 0.0169 0.0170 0.0167 0.346 0.355 0.355 0.355 0.354 0.355 0.037 0.033 0.037 0.032 0.033 0.029 0.131 0.128 0.122 0.125 0.127 0.131 0.0202 0.0166 0.0162 0.0165 0.0166 0.0167 0.0201 0.0166 0.0162 0.0165 0.0166 0.0167 0.386 0.351 0.353 0.352 0.352 0.352 0.049 0.027 0.033 0.028 0.028 0.038 0.133 0.124 0.119 0.124 0.127 0.134 0.0282 0.0163 0.0160 0.0165 0.0166 0.0170 0.0272 0.0163 0.0160 0.0165 0.0165 0.0140 0.385 0.356 0.357 0.356 0.356 0.356 0.041 0.029 0.025 0.029 0.032 0.037 L5—Converging Active Contours 0.046 I0 ^I MD1UM 0.042 ^I AD1UM 0.042 ^I TV1UM 0.042 ^I NL1UM 0.043 ^ MA 0.043 C 0.136 0.135 0.134 0.135 0.137 0.136 0.0184 0.0172 0.0170 0.0174 0.0175 0.0174 0.0112 0.0110 0.0117 0.0111 0.0102 0.0101 0.358 0.355 0.355 0.355 0.322 0.359 0.136 0.136 0.137 0.136 0.138 0.136 0.0241 0.0173 0.0172 0.0174 0.0175 0.0175 0.0178 0.0108 0.0107 0.0110 0.0108 0.0105 0.390 0.324 0.324 0.324 0.324 0.360 Denoising True Image G6—Global Thresholding I0 ^I MD1UM ^I AD1UM ^I TV1UM ^I NL1UM ^ MA C L1—Hysteresis I0 ^I MD1UM ^I AD1UM ^I TV1UM ^I NL1UM ^ MA C L2—Indicator Kriging I0 ^I MD1UM ^I AD1UM ^I TV1UM ^I NL1UM ^ MA C L3—Bayesian MRF (b 5 0.5) I0 ^I MD1UM ^I AD1UM ^I TV1UM ^I NL1UM ^ MA C L4—Watershed I0 ^I MD1UM ^I AD1UM ^I TV1UM ^I NL1UM ^ MA C a ME is misclassification error, Vn is volume fraction of nonwetting phase, An is surface area density of nonwetting phase, Anw is surface area density between nonwetting and wetting phase, and Cn is connectivity index of nonwetting phase. true bulk properties. This is because any kind of preprocessing, i.e., denoising and edge enhancement, removes structural information to some degree. 3.1.7. Final Workflow At this point, some preliminary conclusions about a suitable image processing protocol can already be drawn in order to set the workflow for the remaining images: 1. The variability in structural properties among the different denoising methods was rather small. Qualitatively, all filters reduce noise in homogeneous areas well. In addition, only the nonlocal means filter sufficiently denoises along edges while smoothing across the edges is inhibited. In this way, an additional treatment of rugged surfaces after edge enhancement is not necessary. Thus, in the remainder of the paper, only the nonlocal means filter will be used. 2. A combination of methods for histogram bias correction causes a better agreement between different threshold detection methods. The outlier-corrected average of five different threshold detection methods reproduced ground-truth information well and will be used for global thresholding. € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3628 Water Resources Research 10.1002/2014WR015256 Figure 6. Workflow diagram for multiclass segmentation of the remaining multifluid image and soil image. Headlines denote image processing steps and gray boxes the specific methods. 3. Hysteresis thresholding and indicator kriging are not capable of multiclass segmentation, which leads to misclassifaction errors where more than two phases meet locally. Therefore, only converging active contours, watershed and Bayesian MRF segmentation will be applied to the remaining images. Simple thresholding will also be applied for sake of comparison. 4. Segmentation of the raw data with a subsequent majority filter performed slightly better in terms of structural properties than denoising the intensity data in advance. Both approaches will be compared with each other. These findings translate into the workflow diagram shown in Figure 6. Some steps are optional, e.g., the need for edge enhancement depends on the sharpness of the raw image, ROI dilations are only necessary, if the volume fraction of the phase of interest is very low, etc. 3.2. Synchrotron Image of Three-Fluid Phases in a Porous Medium 3.2.1. Image Enhancement A sample of sintered glass beads with a porosity of roughly 32% has been scanned at a resolution of 9.24 lm. The pore space was partially saturated with air (14%), oil (39%), and water (47%). The cylindrical region of interest has a diameter of 665 voxels and height of 210 voxels. The image is free of ring artifacts and is sharp, without noticeable blur (Figure 7a). The reconstruction method caused a slight decrease in mean intensity for large radii which is evident in the denoised image after nonlocal means denoising (Figure 7b). This radial intensity variation can be removed almost completely (Figure 7c) with intensity bias correction [Iassonov and Tuller, 2010]. This has a considerable impact on the histogram (Figure 7d). The frequency distributions for each class turns from broad and multipeaked into a narrow band. For the same reasons as discussed for the synthetic test image, edges are masked out and the histogram is clipped into a wellbalanced frequency distribution, so that five different global threshold detection methods (G1–G5) yield very similar sets of thresholds. Again, the average after outlier removal (G6) is used for final segmentation using local methods. Note that edge enhancement with unsharp mask filtering has not been applied, as there are hardly any partial volume voxels in this image. 3.2.2. Image Segmentation Simple thresholding (G6), Bayesian MRF segmentation (L3), watershed segmentation (L4), and converging active contours (L5) are applied to the multifluid image, either on the preprocessed ^I NL or on the raw I0 followed by a majority filter. A small subset of the segmentation results are depicted in Figure 8. Since the exact arrangement of interfaces is unknown, it is difficult to judge objectively what combination of methods performs best. The edge between beads and air is roughly two voxels thick. In the smooth ^I NL image, the edge is assigned to oil films, water films or both (Figures 8d, 8f, 8h, and 8j), whereas in the noisy image edge voxels are equally assigned to all four classes but subsequently assigned to air or beads since they constitute the most representative class in the neighborhood of an edge (Figures 8c, 8e, 8g, and 8i). It is almost impossible to conclusively determine whether a fluid film thicker than the image resolution really covers the beads entirely. However, small isolated water voxels between solid voxels and oil film are highly unlikely, and should be suppressed. Global thresholding without postprocessing has no mechanism to € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3629 Water Resources Research a b 0.045 0.035 c 0.008 (a) (b) (c) 0.04 d clip e 0.007 G1 G2 0.006 frequency 0.03 frequency 10.1002/2014WR015256 0.025 0.02 0.015 G3 0.005 G4 0.004 G5 0.003 G6 0.002 0.01 0.001 0.005 0 0 0 50 100 150 200 250 25 50 75 100 gray value 125 150 175 200 225 gray value Figure 7. Image enhancement of multifluid image: (a) raw image I0 , (b) nonlocal means filter ^I NL on I0 , (c) beam hardening removal on ^I NL . The yellow frame outlines the subset in Figure 8. (d) Histograms of Figures 7a–7c. (e) Global threshold detection after histogram clipping. achieve this. Bayesian MRF segmentation of ^I NL only succeeded in doing so with a relatively high homogeneity parameter (b 5 10). In turn, watershed segmentation fails to detect an oil film at the left side of the pendular water ring, either because of a lacking seed voxel for oil or because the underlying gradient image is not sharp enough to evoke two distinct edges within such a short distance. Converging active contours result in subjectively plausible results for both I0 and ^I NL . 3.2.3. Image Analysis The qualitative analysis illustrated in Figure 8 is corroborated by the results with respect to structural analysis in Table 2. The bulk volumes of air and oil, Va and Vo, remain largely unaffected by the choice of global a MRF c watershed CAC e g i f h j raw solid oil air NL means water b d Figure 8. Segmentation results for multifluid image: (a) raw image I0 , (b) nonlocal means filter ^I NL , global thresholding on (c) I0 with postprocessing or (d) on ^I NL , Bayesian MRF segmentation (e) with b 5 0.1 on I0 with postprocessing or (f) with b 5 10 on ^I NL , watershed segmentation (g) on I0 with postprocessing or (h) on ^I NL , converging active contours (i) on I0 with postprocessing or (j) on ^I NL . € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3630 Water Resources Research 10.1002/2014WR015256 Table 2. Volume Fractions V and Surface Area Densities a of Air (a) and Oil (o) for the Segmented Multifluid Imagea Denoising Va (–) aa (mm21) Vo (–) ao (mm21) aao (mm21) G6—Global Thresholding ^ MA C ^I NL 0.047 0.046 0.400 0.545 0.125 0.125 1.621 2.281 0.120 0.384 0.047 0.046 0.046 0.046 0.482 0.505 0.459 0.397 0.125 0.124 0.125 0.125 1.840 2.214 2.063 1.686 0.215 0.369 0.327 0.149 0.048 0.048 L5—Converging Active Contours ^ MA 0.047 C ^I NL 0.048 0.403 0.406 0.127 0.127 1.646 1.631 0.107 0.109 L3—Bayesian MRF ^ MA ðb5 0:1Þ C ^I NL ðb5 0:1Þ ^I NL ðb51Þ ^I NL ðb510Þ L4—Watershed ^ MA C ^I NL 0.421 0.407 0.121 1.642 0.130 0.123 1.650 0.119 a ^ is either applied prior to segmentation with a nonlocal means filter I NL or as postprocessing with a majority filter Denoising ^ MA . C segmentation methods. The surface area densities ao, aa, and aao, however, vary considerably among different segmentation methods. This is because small image objects like films, meniscii, and ganglia exhibit large surface-to-volume ratios and at the same time are associated with the highest uncertainty in terms of ^ MA leads to a general reduction in surface areas and to very similar class assignment. A majority filter C results for all segmentation methods. The surface areas increase with a decreasing boundary penalty factor b for Bayesian MRF segmentation on ^I NL and are highest for global thresholding on ^I NL , where there is no ^ MA , penalty term at all. The ao and aa values for b 5 10 are very similar to all postprocessed class images C to the outcome of converging active contours on ^I NL , and to the result after watershed segmentation on ^I NL . Yet, the specific surface area between air and oil (aao 5 0.149) is as much as 27% higher than the average of other methods (aao 5 [0.107 – 0.130]). Eventually, a decision as to which image is closer to reality needs to be made. But without an analytical solution or other ground-truth information such a decision will suffer from a certain level of subjectivity. 3.3. lCT Image of Soil 3.3.1. Image Enhancement The soil image corresponds to image 2 in Houston et al. [2013b] with a size of 256 3 256 3 256 voxels and a resolution of 32 lm. The sample consists of macropores (Vp 12%), organic matter (Vo 13%), soil matrix (Vm 72%), and dense particles like rocks (Vr 3%). The raw data I0 (Figure 9a) is again denoised with a nonlocal means filter ^I NL (Figure 9b). Afterward, edges are enhanced by an unsharp mask ^I NL1UM with r 5 1 and w 5 0.5 so that intensity values of partial volume voxels are forced closer to their respective class means (Figure 9c). The gradient mask in Figure 9d detects partial volume voxels at phase edges which are excluded from subsequent threshold detection. Each of these methods has a favorable impact on the intensity histogram (Figure 9e) in that valleys between the class peaks are much more pronounced. The combination of all methods, i.e., edge masking on ^I NL1UM , together with histogram clipping leads to a modified histogram for which all five threshold detection methods (G1–G5) lead to similar results (Figure 9e). The average after outlier removal (G6) is again used for all locally adaptive segmentation methods. 3.3.2. Image Segmentation Only some of the segmentation methods introduced in this paper are applied to the soil image. Global ^ MA (Figure 10a) shall serve as a refthresholding of the raw data I0 in combination with a majority filter C erence to which the other methods can be compared. Evidently, simple thresholding already leads to rather satisfying results if it is accompanied by suitable postprocessing. However, the segmented image clearly suffers from false organic coatings around macropores which can be attributed to incorrect assignment of partial volume voxels (violet frame). This misclassification of boundary voxels can be avoided with watershed segmentation (Figure 10b) and converging active contours (Figure 10c) both applied to denoised and edge-enhanced image ^I NL1UM . In addition, even thin macropores are correctly detected (green frame). Bayesian MRF segmentation is applied to ^I NL1UM with b 5 0.1, b 5 1, and b 5 10, € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3631 Water Resources Research 10.1002/2014WR015256 0.025 a c e a b 0.02 c 0.015 0.01 frequency d clip 0.005 0 0 40 80 120 160 200 240 gray value d f 0.01 0.008 0.006 frequency b combined G1 G2 G3 G4 G5 G6 0.004 0.002 0 0 40 80 120 160 200 240 gray value Figure 9. Image enhancement of a soil image: (a) raw image I0 [Houston et al., 2013b], (b) ^I NL after nonlocal means denoising, (c) ^I NL1UM after unsharp mask, (d) gradient mask on ^I NL1UM , (e) histogram of Figures 9b–9d and histogram clipping of Figures 9b. (f) Histogram after combined postprocessing (b 1 c 1 d 1 clip) together with the corresponding thresholds obtained by various global thresholding methods. respectively (Figures 10d–10f). We observe that the segmented images look very different for different b. If the penalty term for class boundaries (second term of equation (13)) has a low weight (b 5 0.1), the image looks very similar to (Figure 10a), i.e., all image objects are well preserved, yet all macropores exhibit false coatings of organic matter. In turn, if the homogeneity parameter is set very high (b 5 10), partial volume effects are suppressed and so are small image objects in general, like the thin macropore in the green frame. A moderate value (b 5 1) leads to an unsatisfactory trade-off between the two problems. 3.3.3. Image Analysis The qualitative interpretation is again corroborated by structural properties of macropores and organic matter summarized in Table 3. Bulk volume and surface area vary much more among different segmentation methods as compared to the multifluid image. For instance, the surface area density aop between organic matter and macropores decreases by a factor of 2.5 if the homogeneity parameter for Bayesian MRF segmentation is increased from b 5 0.1 (Figure 10d) to b 5 10 (Figure 10f), and is even smaller for watershed segmentation (Figure 10b) and converging active contours (Figure 10c). In addition, the connectivity indicator C is also sensitive to the choice of segmentation method, because small objects close to the image resolution have a high impact on the continuity of a phase. Therefore, macropore connectivity increases from Cp 5 (0.72 – 0.80) to Cp 5 (0.83 – 0.84) if thin macropores that connect larger pore bodies are correctly identified, e.g., the green frame in Figure 10. At the same time, the connectivity of organic matter decreases drastically from Cp 5 (0.94 – 0.96) to Cp 5 (0.64 – 0.74) if false organic coatings of macropores (violet frame) are successfully suppressed. 4. Discussion 4.1. Image Enhancement We have demonstrated how essential a suitable combination of image enhancement methods can be for subsequent image segmentation. The correction of intensity bias due to beam hardening was adequately € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3632 Water Resources Research a 10.1002/2014WR015256 b c e f pore soil matrix rock organic matter d Figure 10. Image segmentation of the soil image: (a) global thresholding on I0 with postprocessing, (b) watershed segmentation on ^I NL1UM , (c) converging active contours on ^I NL1UM , (d–f) Bayesian MRF segmentation on ^I NL1UM with b 5 {0.1, 1, 10}, respectively. corrected by the method of Iassonov and Tuller [2010], since the examined sample was cylindrical and fairly homogeneous. If the intensity bias had a more complex shape, which cannot be readily described as function of radius, the bias model could have been obtained by interpolation instead. For instance, the approach of Yanowitz and Bruckstein [1989] to use the Laplace equation to interpolate a threshold surface between slowly varying gray values along phase boundaries can be adapted to interpolate a bias surface between objects of the highest intensity class. Even better results can be expected by using the Poisson equation for this purpose [Perez et al., 2003]. The ring artifact removal routine [Sijbers and Postnov, 2004] succeeded in removing most of the rings. However, the ring artifact removal is incomplete if the artifact magnitude is not constant along rotation angle /. Moreover, the back transform from polar into Cartesian Table 3. Volume Fractions V, Surface Area Densities a, and Connectivity Indices C of Pores p and Organic Residues o for the Soil Imagea Denoising Vp (–) G6—Global Thresholding ^ MA 0.082 C ^I NL 0.096 L3—Bayesian MRF ^ MA ðb5 0:1Þ 0.085 C ^I NL ðb5 0:1Þ 0.093 ^I NL ðb51Þ 0.086 ^I NL ðb510Þ 0.087 L4—Watershed ^ MA 0.110 C ^I NL 0.117 L5—Converging Active Contours ^ MA 0.088 C ^I NL 0.129 € SCHLUTER ET AL. Vo (–) ao (mm21) aop (mm21) Cp (–) Co (–) 0.213 0.190 2.74 3.30 0.91 1.29 0.721 0.777 0.953 0.940 0.208 0.197 0.193 0.176 2.63 2.97 2.13 1.46 0.91 1.12 0.71 0.44 0.734 0.787 0.795 0.822 0.953 0.950 0.961 0.963 0.137 0.127 1.05 0.84 0.30 0.25 0.802 0.834 0.652 0.644 0.135 0.116 1.24 0.66 0.40 0.24 0.730 0.825 0.744 0.681 a Denoising is either applied prior to segmentation with a nonlocal means filter ^I NL or as postprocessing with a majority filter ^ MA . C C 2014. American Geophysical Union. All Rights Reserved. V 3633 Water Resources Research 10.1002/2014WR015256 coordinates introduces additional blur that increases with radius r. Thus, the performance is always somewhat worse as compared to line removal directly applied on the sinograms when this is possible [Ketcham, 2006]. Also, some methods operate on rings in Cartesian space directly [Freundlich, 1987]. Finally, Fourier and wavelet filters generally lead to improved ring removal [Raven, 1998; M€ unch et al., 2009]. The surveyed denoising methods were efficient in removing noise while keeping the blurring of edges at a minimum, given that the associated parameters are set adequately. In fact, image noise today has become a secondary issue for successful image analysis. Instead, image blur has been identified as more of a pitfall for the success of the various segmentation methods in this study. Edge enhancement with unsharp masks partly mitigates image blur, but is not capable of removing partial volume effects completely. Surely, future advances in X-ray tomography hardware and reconstruction software will lead to steady improvements in image quality, so that sharper images can be acquired on a routine basis. One of our significant findings was that even though image enhancement is often indispensable for robust threshold detection, it does not necessarily imply that the segmentation itself also has to be applied to the enhanced image. Instead, segmenting the raw data with the thus obtained thresholds may lead to fewer misclassification errors if suitable postprocessing is applied to the segmented images. This is because any image enhancement inevitably destroys some structural information in the raw data. For the images examined in this study, a majority filter on the segmented raw data produced good results, mostly because the objects had rather smooth, convex boundaries. In turn, a majority filter can be a less desirable option if true objects are thin, concave, have rough surfaces or acute angles. It is up to the user to always compare and decide which is the most favorable option. 4.2. Global Thresholding We have demonstrated that every histogram-based thresholding method relies on certain assumptions about the histogram shape. This introduces some bias if the class modes have different variance, skewness, or proportion. Many bias correction techniques for the standard methods used here have been suggested. For instance, minimum error thresholding can be corrected for imbalanced overlap [Cho et al., 1989] or can be used with Poisson distributions for each class instead of Gaussian distributions [Pal and Bhandari, 1993]. Shannon entropy can be replaced by Tsallis entropy for maximum entropy thresholding, allowing for an additional degree of freedom that can be used to tune the results [de Albuquerque et al., 2004]. Fuzzy cmeans can be corrected for imbalanced class proportions [Jawahar et al., 1997]. The list is virtually endless. However, we found it more useful to (i) put some effort into suitable image enhancement prior to thresholding, (ii) alleviate the impact of bias by histogram clipping, and (iii) use the average after outlier removal to determine thresholds for subsequent segmentation. Even with this preprocessing methodology there might be soil images that still exhibit unimodal histograms due to a lot of unresolved porosity or very imbalanced class proportions [Wang et al., 2011; Baveye et al., 2010]. The second problem can be avoided with a semiautomatic algorithm based on ROI dilations (Figure 11). For instance, small rocks constitute only 3% to volume in the soil image and are hard to distinguish as an individual class in the histogram. The threshold for the region of interest (ROI) is set to the class mean (t 5 ls 5 203). Consecutive dilations of the ROI mask lead to a clearly bimodal histogram for which a threshold between the soil and rock class can be easily identified. Note that the thus obtained local histogram minimum at 182 is very close to t 5 185 in Figure 9f, but much easier to identify. An alternative to deal with unimodal histograms, which is, however, restricted to two-class segmentation, is to estimate a threshold from the mean gray value within edge regions [Panda and Rosenfeld, 1978; Schl€ uter et al., 2010]. The problem of too much unresolved porosity or too gradual intensity changes is more severe and puts the entire concept of segmentation into question. In this case, some morphology analysis can be deployed to the intensity data directly, including distance transforms [Jang and Hong, 2001], isosurfaces [McClure et al., 2007], skeletonization [Chung and Sapiro, 2000], or tortuosity [Gommes et al., 2009]. 4.3. Local Segmentation We have corroborated the importance of using locally adaptive methods truly capable of multiclass segmentation instead of applying iterations of binarizations [Tuller et al., 2013]. Methods that fulfill this criterion are Bayesian MRF segmentation [Berthod et al., 1996; Kulkarni et al., 2012], watershed segmentation [Beucher and Lantuejoul, 1979; Vincent and Soille, 1991; Roerdink and Meijster, 2000], and converging active contours [Sheppard et al., 2004]. Bayesian MRF segmentation was originally developed for image classification in the € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3634 Water Resources Research 10.1002/2014WR015256 0.0014 no mask t=203 1. iter 2. iter 3. iter 4. iter 5. iter 6. iter 0.0012 frequency 0.001 0.0008 0.0006 0.0004 0.0002 0 0 40 80 120 160 gray value 200 240 Figure 11. Iterative algorithm to identify a truely bimodal histogram at high gray values in the soil image. presence of additive noise and is based on the assumption that individual class modes follow a Gaussian distribution. This causes failure of the method for denoised images with pronounced partial volume effects due to image blur, which leads to either one-sided or two-sided tailings in the histogram. As a consequence, special care has to be taken to correct for this histogram bias. In addition, the results depend heavily on the homogeneity parameter b. Erroneous assignment of partial volume voxels could only be suppressed with a high penalty on class boundaries (b 5 10), which at the same time removed true image features of similar size. A promising improvement of the Bayesian MRF segmentation, especially if applied to fluid images, would be to replace the unspecified penalty term (equation (14)) with real surface tensions [Knight et al., 1990; Silverstein and Fort, 2000]. In fact, any noninvasive laboratory technique which provides independent measurements about a structural property of the same sample can potentially help to condition segmentation parameters. This has been recently demonstrated for three-phase segmentation of limestone via hysteresis thresholding, where the region growing parameters were conditioned by independent porosity measurements [Mangane et al., 2013]. For the watershed segmentation in its current implementation, a benefit is that it only requires simple thresholding and a gradient image. However, the lack of additional parameters also hampers flexibility. Therefore, excessive elimination of wetting films in the synthetic image could not be avoided. In contrast, the somewhat similar converging active contours method honors local gradient and intensity information at the same time. A reported drawback of this method is its sensitivity to seed region detection and the adjustment of parameters for the speed function [Iassonov et al., 2009]. In this paper, we demonstrated strategies to define the seed regions automatically. In addition, the parameters for the speed function can be obtained directly by a careful evaluation of the gradient histogram (Figure 12). The frequency of local gradients is almost always unimodal, and therefore, difficult to threshold with standard methods. Yet, every unimodal histogram exhibits some characteristic features such as a point of maximum curvature [Tsai, 1995] or the point of maximum distance from an auxiliary line between the histogram mode and the maximum bin [Rosin, 2001] (Figure 12, inset). In addition, some gradient histograms may exhibit secondary shoulders if the intensity data comprise fairly constant edge heights. Picking one of these points as a gradient threshold would result in a fully objective segmentation method. In this study, we always used the gradient cutoff detection with Rosin’s method. In turn, adjusting this cutoff adds some flexibility to the speed function in the same way as b scales the penalty term in the Bayesian MRF method. For instance, we achieved a very good reproduction of the true interfacial area between fluids awn in the synthetic test image by setting the gradient cutoff to a higher value. As expected, objects that are close in size to the image resolution were associated with the highest uncertainty in all test images. Five voxels in diameter has been suggested as a rough estimate for an object size limit for which image analysis is reliable [Lehmann et al., 2006; Vogel et al., 2010]. Features like the thin oil € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3635 Water Resources Research 10.1002/2014WR015256 0.1 0.1 frequency 0.08 frequency 0.01 0.06 0.04 0.02 0 0.001 50 100 150 gray value 200 Sobel max. deviation max. curvature local minimum 0.0001 0 25 50 75 100 125 150 gray value 175 200 225 250 Figure 12. Gradient histogram computed on a Sobel image of ^I NL1UM for the synthetic image. Each unimodal histogram has a unique point of maximum curvature and maximum distance from the auxiliary line depicted in the inset. In addition, the shoulder in the histogram produces a local minimum, which can be detected as well. films and pendular rings at grain contacts in the synchrotron image clearly do not meet this criterion. Brown et al. [2014] compared surface area densities of smooth fluid interfaces for the same kind of three-fluid samples when improving the image resolution from 10.6 to 5.3 lm and observed an increase in the range of 5– 16%, depending on the specific fluid pair. An improvement in resolution usually comes at the cost of a smaller field of view as far as industrial scanners are concerned, as well as increased noise unless the scan time is increased as well. For heterogeneous media, especially, there is a trade-off between a representative image size and sufficient resolution to capture important details. However, the scale window can also be extended toward smaller objects by scanning the same sample at different resolutions, even with different imaging techniques, and merging the images with image registration [Latham et al., 2008], statistical fusion of images [Mohebi et al., 2009] or scale fusion applied to structure analysis data only [Vogel et al., 2010; Schl€ uter et al., 2011]. Finally, for this study, we have merely compared the outcome of different local segmentation methods with each other. Of course, several methods can also be combined. For instance, Bayesian MRF segmentation with surface tensions could be used for postprocessing the outcome of converging active contours. If necessary, postprocessing with a majority filter could also be applied in addition to denoising prior to segmentation. After all, the choice of a suitable protocol always depends on characteristics of the raw data. 5. Conclusions We have surveyed recent advances in image enhancement and image segmentation of multiphase X-ray microtomography data. Image enhancement methods included ring artifact removal, intensity bias correction, edge enhancement, image denoising, and contrast enhancement. Image segmentation methods comprised six global segmentation and five different locally adaptive segmentation methods. Some general conclusions can be drawn from our findings: 1. Image blur is the major cause of poor segmentation results in this study, since image noise and other image artifacts could be removed with current image processing methods. 2. A lot of uncertainty in threshold detection can be removed by suitable preprocessing and correction of histograms for bias in the class statistics prior to thresholding. 3. Bayesian Markov random field segmentation, watershed segmentation and converging active contours are suited for multiclass segmentation. The converging active contour method has potentially the highest flexibility to correct for partial volume effects and simultaneously conserve small image features. € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3636 Water Resources Research 10.1002/2014WR015256 Appendix A Most image processing steps presented in this study can be done with freely available software. The only exceptions are the nonlocal means filter for denoising and the watershed method for segmentation, which were carR FIRE (http://www.vsg3d.com/avizo/fire). Moreover, the converging active contours method ried out with AVIZOV for segmentation was performed with MANGO (http://physics.anu.edu.au/appmaths/capabilities/mango.php). The conversion between Cartesian and polar coordinates for ring artifact removal was performed with the freely available Polar Transformer (http://rsbweb.nih.gov/ij/plugins/polar-transformer.html) plug-in for IMAGEJ. All other image processing steps are either built on or directly implemented in the QUANTIM (http:// www.quantim.ufz.de) open-source image processing library. Acknowledgments We thank Alasdair Houston (SIMBIOS, Abertay, UK) for providing the lCT soil image and three anonymous reviewers for their helpful comments. S. Schl€ uter is grateful to the Alexander-vonHumboldt Foundation for granting a Feodor-Lynen scholarship. A. Sheppard acknowledges the support of an Australian Research Council Future Fellowship. € SCHLUTER ET AL. References Anderson, S., and J. Hopmans (Eds.) (2013), Soil–Water–Root Processes: Advances in Tomography and Imaging, vol. 61, SSSA Spec. Publ., Soil Sci. Soc. of Am., Madison, WI, doi:10.2136/sssaspecpub61. Armstrong, R., M. Porter, and D. Wildenschild (2012), Linking pore-scale interfacial curvature to column-scale capillary pressure, Adv. Water Resour., 46, 55–61. Baveye, P. C., et al. (2010), Observer-dependent variability of the thresholding step in the quantitative analysis of soil images and X-ray microtomography data, Geoderma, 157(1–2), 51–63, doi:10.1016/j.geoderma.2010.03.015. Berthod, M., Z. Kato, S. Yu, and J. Zerubia (1996), Bayesian image classification using Markov random fields, Image Vision Comput., 14(4), 285–295. Besag, J. (1986), On the statistical analysis of dirty pictures, J. R. Stat. Soc., Ser. B, 48, 259–302. Beucher, S., and C. Lantuejoul (1979), Use of watersheds in contour detection, paper presented at International Workshop Image Processing, Real-Time Edge and Motion Detection/Estimation, CCETT, Rennes, France. Blunt, M. J., B. Bijeljic, H. Dong, O. Gharbi, S. Iglauer, P. Mostaghimi, A. Paluszny, and C. Pentland (2013), Pore-scale imaging and modelling, Adv. Water Resour., 51, 197–216, doi:10.1016/j.advwatres.2012.03.003. Brown, K., S. Schl€ uter, A. Sheppard, and D. Wildenschild (2014), On the challenges of measuring interfacial characteristics of three-phase fluid flow with x-ray microtomography, J. Microsc., 253(3), 171–182, doi:10.1111/jmi.12106. Buades, A., B. Coll, and J. M. Morel (2005), A non-local algorithm for image denoising, in Computer Society Conference on Computer Vision and Pattern Recognition, 2005. CVPR 2005, vol. 2, pp. 60–65, IEEE Comput. Soc., San Diego, CA, doi:10.1109/CVPR.2005.38. Buades, A., B. Coll, and J.-M. Morel (2008), Nonlocal Image and Movie Denoising, Int. J. Comput. Vision, 76(2), 123–139, doi:10.1007/s11263007-0052-1. Canny, J. (1986), A computational approach to edge detection, IEEE Trans. Pattern Anal. Mach. Intel., 8(6), 679–698, doi:10.1109/ TPAMI.1986.4767851. Catt e, F., P. Lions, J. Morel, and T. Coll (1992), Image selective smoothing and edge detection by nonlinear diffusion, SIAM J. Numer. Anal., 29(1), 182–193, doi:10.1137/0729012. Cho, S., R. Haralick, and S. Yi (1989), Improvement of kittler and illingworth’s minimum error thresholding, Pattern Recognition, 22(5), 609– 617, doi:10.1016/0031–3203(89)90029-0. Chung, D. H., and G. Sapiro (2000), Segmentation-free skeletonization of gray-scale images via PDEs, in Proceedings 2000 International Conference on Image Processing, IEEE International Conference on Image Processing, vol. 2, pp. 927–930, Vancouver, BC, Canada. doi: 10.1109/ICIP.2000.899868. Cnudde, V., M. N. Boone (2013), High-resolution X-ray computed tomography in geosciences: A review of the current technology and applications, Earth-Science Reviews, 123, 1–17, doi:10.1016/j.earscirev.2013.04.003. Costanza-Robinson, M. S., K. H. Harrold, and R. M. Lieb-Lappen (2008), X-ray microtomography determination of air-water interfacial area-water saturation relationships in sandy porous media, Environ. Sci. Technol., 42(8), 2949–2956, doi:10.1021/ es072080d. Culligan, K., D. Wildenschild, B. Christensen, W. Gray, M. Rivers, and A. Tompson (2004), Interfacial area measurements for unsaturated flow through a porous medium, Water Resour. Res., 40, W12413, doi:10.1029/2004WR003278. de Albuquerque, M. P., I. Esquef, A. G. Mello, and M. P. de Albuquerque (2004), Image thresholding using Tsallis entropy, Pattern Recognition Lett., 25(9), 1059–1065, doi:10.1016/j.patrec.2004.03.003. Ferreira, T., and W. Rasband (2012), ImageJ User Guide—IJ 1.46r, National Institute of Health, Bethesda, MD. Freundlich, D. A. (1987), Ring artifact correction for computerized tomography, Patent [4,670,840], U.S. Patent and Trademark Off., Washington, D. C. Gommes, C., A. Bons, S. Blacher, J. Dunsmuir, and A. H. Tsou (2009), Practical methods for measuring the tortuosity of porous materials from binary or gray-tone tomographic reconstructions, AIChE J., 55(8), 2000–2012, doi:10.1002/aic.11812. Gonzalez, R., and R. Woods (2002), Digital Image Processing, pp. 793, Prentice Hall, Upper Saddle Creek, N. J. Hoshen, J., and R. Kopelman (1976), Percolation and cluster distribution. I. Cluster multiple labeling technique and critical concentration algorithm, Phys. Rev. B Solid State, 14, 3438–3445, doi:10.1103/PhysRevB.14.3438,5. Houston, A., S. Schmidt, A. Tarquis, W. Otten, P. Baveye, and S. Hapca (2013a), Effect of scanning and image reconstruction settings in X-ray computed microtomography on quality and segmentation of 3D soil images, Geoderma, 207, 154–165, doi:10.1016/ j.geoderma.2013.05.017. Houston, A., W. Otten, P. Baveye, and S. Hapca (2013b), Adaptive-window indicator kriging: A thresholding method for computed tomography images of porous media, Comput. Geosci., 54, 239–248, doi:10.1016/j.cageo.2012.11.016. Huang, T., G. Yang, and G. Tang (1979), A fast two-dimensional median filtering algorithm, IEEE Trans. Acoust. Speech Signal Process., 27(1), 13–18, doi:10.1109/TASSP.1979.1163188. Iassonov, P., and M. Tuller (2010), Application of segmentation for correction of intensity bias in X-ray computed tomography images, Vadose Zone J., 9(1), 187–191, doi:10.2136/vzj2009.0042. Iassonov, P., T. Gebrenegus, and M. Tuller (2009), Segmentation of X-ray CT images of porous materials: A crucial step for characterization and quantitative analysis of pore structures, Water Resour. Res., 45, W09415, doi:10.1029/2009WR008087. C 2014. American Geophysical Union. All Rights Reserved. V 3637 Water Resources Research 10.1002/2014WR015256 Ibanez, L., W. Schroeder, L. Ng, and J. Cates (2005), The ITK Software Guide, 2nd ed., Kitware Inc. Jang, J.-H., and K.-S. Hong (2001), A pseudo-distance map for the segmentation-free skeletonization of gray-scale images, in Proceedings of Eighth IEEE International Conference on Computer Vision, ICCV 2001, vol. 2, pp. 18–23, Vancouver, BC, Canada, doi:10.1109/ICCV.2001.937586. Jawahar, C., P. Biswas, and A. Ray (1997), Investigations on fuzzy thresholding based on fuzzy clustering, Pattern Recognition, 30(10), 1605– 1613, doi:10.1016/S0031–3203(97)00004-6. Kaestner, A., E. Lehmann, and M. Stampanoni (2008), Imaging and image processing in porous media research, Adv. Water Resour., 31(9), 1174–1187, doi:10.1016/j.advwatres.2008.01.022. Kapur, J., P. Sahoo, and A. Wong (1985), A new method for gray-level picture thresholding using the entropy of the histogram, Comput. Vision Graphics Image Process., 29(3), 273–285. Ketcham, R., and W. Carlson (2001), Acquisition, optimization and interpretation of X-ray computed tomographic imagery: Applications to the geosciences, Comput. Geosci., 27(4), 381–400. Ketcham, R. A. (2005), Computational methods for quantitative analysis of three-dimensional features in geological specimens, Geosphere, 1(1), 32–41. Ketcham, R. A. (2006), New algorithms for ring artifact removal, Proc. SPIE, 6318, 63180O7, doi:10.1117/12.680939. Kittler, J., and J. Illingworth (1986), Minimum error thresholding, Pattern Recognition, 19(1), 41–47, doi:10.1016/0031–3203(86)90030-0. Knight, R., A. Chapman, and M. Knoll (1990), Numerical modeling of microscopic fluid distribution in porous media, J. Appl. Phys., 68(3), 994–1001. Kulkarni, R., M. Tuller, W. Fink, and D. Wildenschild (2012), Three-dimensional multiphase segmentation of X-Ray CT data of porous materials using a Bayesian Markov random field framework, Vadose Zone J., 11(1), pp. 74–85, doi:10.2136/vzj2011.0082. Kurita, T., N. Otsu, and N. Abdelmalek (1992), Maximum likelihood thresholding based on population mixture models, Pattern Recognition, 25(10), 1231–1240, doi:10.1016/0031-3203(92)90024-D. Latham, S., T. Varslot, and A. Sheppard (2008), Image registration: Enhancing and calibrating X-ray micro-CT imaging, paper presented at Proceedings of the Society of Core Analysis, SCA2008-35, pp. 1–12, Abu Dhabi, UAE. Lehmann, P., P. Wyss, A. Flisch, E. Lehmann, P. Vontobel, M. Krafczyk, A. Kaestner, F. Beckmann, A. Gygi, and H. Fl€ uhler (2006), Tomographical imaging and mathematical description of porous media used for the prediction of fluid distribution, Vadose Zone J., 5, 80–97. Liao, P.-S., T.-S. Chen, and P.-C. Chung (2001), A fast algorithm for multilevel thresholding, J. Inf. Sci. Eng., 17(5), 713–728. Mangane, P. O., P. Gouze, and L. Luquot (2013), Permeability impairment of a limestone reservoir triggered by heterogeneous dissolution and particles migration during CO2-rich injection, Geophys. Res. Lett., 40, 4614–4619, doi:10.1002/grl.50595. McClure, J., D. Adalsteinsson, C. Pan, W. Gray, and C. Miller (2007), Approximation of interfacial properties in multiphase porous medium systems, Adv. Water Resour., 30(3), 354–365, doi:10.1016/j.advwatres.2006.06.010. Mohebi, A., P. Fieguth, and M. Ioannidis (2009), Statistical fusion of two-scale images of porous media, Adv. Water Resour., 32, 1567–1579. M€ unch, B., P. Trtik, F. Marone, and M. Stampanoni (2009), Stripe and ring artifact removal with combined wavelet—Fourier filtering, Opt. Express, 17(10), 8567–8591, doi:10.1364/OE.17.008567. Oh, W., and B. Lindquist (1999), Image thresholding by indicator kriging, IEEE Trans. Pattern Anal. Mach. Intel., 21(7), 590–602. Otsu, N. (1975), A threshold selection method from gray-level histograms, Automatica, 11(285-296), 23–27. Pal, N. R., and D. Bhandari (1993), Image thresholding: Some new techniques, Signal Process., 33(2), 139–158, doi:10.1016/0165– 1684(93)90107-L. Pal, N. R., and S. K. Pal (1993), A review on image segmentation techniques, Pattern Recognition, 26(9), 1277–1294, doi:10.1016/0031– 3203(93)90135-J. Panda, D., and A. Rosenfeld (1978), Image segmentation by pixel classification in (gray level, edge value) space, IEEE Trans. Comput., C-27(9), 875–879, doi:10.1109/TC.1978.1675208. P erez, P., M. Gangnet, and A. Blake (2003), Poisson image editing, ACM Trans. Graphics, 22(3), 313–318. Perona, P., and J. Malik (1990), Scale-space and edge detection using anisotropic diffusion, IEEE Trans. Pattern Anal. Mach. Intel., 12(7), 629– 639. Pizer, S. M., E. P. Amburn, J. D. Austin, R. Cromartie, A. Geselowitz, T. Greer, B. T. H. Romeny, J. B. Zimmerman, and K. Zuiderveld (1987), Adaptive histogram equalization and its variations, Comput. Vision Graphics Image Process., 39(3), 355–368, doi:10.1016/S0734189X(87)80186-X. Porter, M., and D. Wildenschild (2010), Image analysis algorithms for estimating porous media multiphase flow variables from computed microtomography data: A validation study, Comput. Geosci., 14(1), 15–30. Raven, C. (1998), Numerical removal of ring artifacts in microtomography, Rev. Sci. Instrum., 69(8), 2978–2980, doi:10.1063/1.1149043. Renard, P., and D. Allard (2013), Connectivity metrics for subsurface flow and transport, Adv. Water Resour., 51, 168–196, doi:10.1016/ j.advwatres.2011.12.001. Ridler, T., and S. Calvard (1978), Picture thresholding using an iterative selection method, IEEE Trans. Syst. Man Cybern., 8(8), 630–632, doi: 10.1109/TSMC.1978.4310039. Roerdink, J. B., and A. Meijster (2000), The watershed transform: Definitions, algorithms and parallelization strategies, Fundam. Inf., 41(1), 187–228. Rogasik, H., J. Crawford, O. Wendroth, I. Young, M. Joschko, and K. Ritz (1999), Discrimination of soil phases by dual energy X-ray tomography, Soil Sci. Soc. Am. J., 63, 741–751. Rosin, P. (2001), Unimodal thresholding, Pattern Recognition, 34(11), 2083–2096. Rudin, L., S. Osher, and E. Fatemi (1992), Nonlinear total variation based noise removal algorithms, Physica D, 60, 259–268, doi:10.1016/ 0167-2789(92)90242-F. Russ, J. C. (2006), The Image Processing Handbook, CRC Press, Boca Raton, FL. Sahoo, P., S. Soltani, and A. Wong (1988), A survey of thresholding techniques, Comput. Vision Graphics Image Process., 41(11), 233–260. Schl€ uter, S., U. Weller, and H.-J. Vogel (2010), Segmentation of X-ray microtomography images of soil using gradient masks, Comput. Geosci., 36(10), 1246–1251, doi:10.1016/j.cageo.2010.02.007. Schl€ uter, S., U. Weller, and H.-J. Vogel (2011), Soil-structure development including seasonal dynamics in a long-term fertilization experiment, J. Plant Nutr. Soil Sci., 174(3), 395–403, doi:10.1002/jpln.201000103. Serra, J. (1982), Mathematical Morphology and Image Analysis, Academic, San Diego, Calif. Sezgin, M., and B. Sankur (2004), Survey over image thresholding techniques and quantitative performance evaluation, J. Electron. Imaging, 13(1), 146–165. Sheppard, A., R. Sok, and H. Averdunk (2004), Techniques for image enhancement and segmentation of tomographic images of porous materials, Physica A, 339(1), 145–151. € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3638 Water Resources Research 10.1002/2014WR015256 Sijbers, J., and A. Postnov (2004), Reduction of ring artefacts in high resolution micro-CT reconstructions, Phys. Medicine Biol., 49(14), N247– N253. Silverstein, D., and T. Fort (2000), Prediction of air-water interfacial area in wet unsaturated porous media, Langmuir, 16(2), 829–834. Stark, J. (2000), Adaptive image contrast enhancement using generalizations of histogram equalization, IEEE Trans. Image Process., 9(5), 889–896, doi:10.1109/83.841534. Trier, O., and A. Jain (1995), Goal-directed evaluation of binarization methods, IEEE Trans. Pattern Anal. Mach. Intel., 17(12), 1191–1201, doi: 10.1109/34.476511. Tsai, D.-M. (1995), A fast thresholding selection procedure for multimodal and unimodal histograms, Pattern Recognition Lett., 16(6), 653– 666, doi:10.1016/0167-8655(95)80011-H. Tuller, M., R. Kulkarni, and W. Fink (2013), Segmentation of X-ray CT data of porous materials: A review of global and locally adaptive algorithms, in Soil–Water–Root Processes: Advances in Tomography and Imaging, SSSA Special Publications, vol. 61, edited by S. Anderson and J. Hopmans, pp. 157–182, Soil Sci. Soc. of Am., Madison, WI. van der Walt, S., J. L. Sch€ onberger, J. Nunez-Iglesias, F. Boulogne, J. D. Warner, N. Yager, E. Gouillart, T. Yu (2014), scikit-image: Image processing in Python, PeerJ PrePrints, 2, e336v2, doi:10.7287/peerj.preprints.336v2. Vincent, L., and P. Soille (1991), Watersheds in digital spaces: An efficient algorithm based on immersion simulations, IEEE Trans. Pattern Anal. Mach. Intel., 13(6), 583–598, doi:10.1109/34.87344. Vogel, H.-J., and A. Kretzschmar (1996), Topological characterization of pore space in soil sample preparation and digital image-processing, Geoderma, 73(1), 23–38. Vogel, H.-J., U. Weller, and S. Schl€ uter (2010), Quantification of soil structure based on Minkowski functions, Comput. Geosci., 36(10), 1236– 1245, doi:10.1016/j.cageo.2010.03.007. Wang, W., A. Kravchenko, A. Smucker, and M. Rivers (2011), Comparison of image segmentation methods in simulated 2D and 3D microtomographic images of soil aggregates, Geoderma, 162(3–4), 231–241, doi:10.1016/j.geoderma.2011.01.006. Wildenschild, D., and A. P. Sheppard (2013), X-ray imaging and analysis techniques for quantifying pore-scale structure and processes in subsurface porous medium systems, Adv. Water Resour., 51, 217–246, doi:10.1016/j.advwatres.2012.07.018. Wildenschild, D., J. Hopmans, C. Vaz, M. Rivers, D. Rikard, and B. Christensen (2002), Using X-ray computed tomography in hydrology: Systems, resolutions, and limitations, J. Hydrol., 267(3), 285–297. Wildenschild, D., J. Hopmans, M. Rivers, and A. Kent (2005), Quantitative analysis of flow processes in a sand using synchrotron-based X-ray microtomography, Vadose Zone J., 4(1), 112–126. Yanowitz, S., and A. Bruckstein (1989), A New Method for Image Segmentation, Comput. Vision Graphics Image Process., 46(1), 82–95. € SCHLUTER ET AL. C 2014. American Geophysical Union. All Rights Reserved. V 3639