– 391 – by E. Th¨

advertisement
– 391 –
A Bayesian approach to inferring vascular tree structure from 2D imagery
by
E. Thönnes, A. Bhalerao, W.S. Kendall and R.G. Wilson
October 2001
DEPARTMENT OF STATISTICS
UNIVERSITY OF WARWICK
A BAYESIAN APPROACH TO INFERRING VASCULAR TREE STRUCTURE FROM
2D IMAGERY
Elke Thönnes , Abhir Bhalerao , Wilfrid Kendall , and Roland Wilson
Departments of Computer Science and Statistics
University of Warwick, UK
elke|wsk @stats.warwick.ac.uk
abhir|rgw @dcs.warwick.ac.uk
ABSTRACT
We describe a method for inferring tree-like vascular structures from 2D imagery. A Markov Chain Monte Carlo
(MCMC) algorithm is employed to sample from the posterior distribution given local feature estimates, derived from
likelihood maximisation for a Gaussian intensity profile. A
multiresolution scheme, in which coarse scale estimates are
used to initialise the algorithm for finer scales, has been implemented and used to model retinal images. Results are
presented to show the effectiveness of the method.
1. INTRODUCTION
The problem of inferring vascular structure from image data
is an important one, especially in the area of surgical planning, which requires both efficient computation and effective use of prior knowledge. Previous work in the area has
tended to focus on the modelling of specific vascular features [1] or to use approaches such as adaptive thresholding
[4].
The aim of the work described here is to formulate a
general method for the inference, which can be applied in
two or three dimensions and makes effective use of prior
knowledge, yet which is sufficiently general to be applied
to a wide range of problems. The common statistical methods for such medical image analysis have typically used
likelihood techniques, such as Expectation-Maximisation
(EM) [6, 5]. Although EM methods can be efficient computationally, they have only limited scope for incorporating prior knowledge. A more powerful way of including
prior information is to use a Bayesian method, such as maximum a posteriori (MAP) estimation. The principal difficulty with Bayesian techniques is computational: they normally require the use of Markov chain Monte Carlo algorithms, which may run for hundreds of thousands of iterations to yield reliable results [3]. This has restricted their
This work was supported by the UK EPSRC.
use in applications involving large data sets, such as medical
images.
The method we have adopted combines local likelihood
maximisation using a Gaussian model of the spatial intensity profile, on a multiresolution grid and global structure
determination using a Bayesian technique derived from a
general model of vasculature as a collection of tree structures. By approaching the problem in this way, we combine
the computational efficiency of likelihood techniques with
the power and generality of a Bayesian approach. After a
brief description of the estimation algorithms, we present
results of two dimensional structure inference on real data.
The paper is concluded with some observations on the technique.
2. A STATISTICAL MODEL OF VASCULAR
STRUCTURE
To draw inferences about global structure, we employ a
Bayesian formalism: the data are modelled as a random
tree-like structure and we then use an MCMC algorithm [2]
to sample from the posterior distribution, which is conditioned on the data. The sampling distribution is an approximate equilibrium of a random process whose configuration
space is the space of tree-like structures and whose equilibrium is designed to be the target conditional distribution.
As well as gaining information about the global structure,
variation in the posterior samples enables us to quantify uncertainties about the image interpretation.
The prior distribution defines the global structure as a forest of a random number of trees. Each such tree is binary:
branches divide only into two sub-branches at any given
position. A physical realisation of such a tree needs each
vertex to be located in space. Unfortunately, the simplistic approach of displacing each vertex from its parent by
a Gaussian displacement of zero mean (a “random-walk”
tree) leads to a tangled local structure (left hand figure 1).
We therefore introduce a correlation by allowing the mean
displacement to be a small linear multiple of the displacement of the parent vertex from the grandparent vertex (an
“AR(1)” tree) (right hand of figure 1). To model intensity,
with each vertex in the tree we associate a Gaussian kernel that represents the spatial grey level profile of the corresponding vessel segment.
of
The posterior distribution for a random number
trees is given by
+*$, . - - / 02123 5 4 6 "!$#&%(')
78 9;:<=
Fig. 1. Illustration of a random-walk tree and an AR(1) tree
(1)
>?
,
where is the image. The distribution of , penalises the number of trees; in the examples, a Poisson
distribution was used. This ensures that a ‘minimal’ expla are the probabilities
nation of the data is found.
%A@ %
%(B process and C D4 EGF
defining the degree of the branching
is the degree of the vertex 4 . , H- - / 0I1J3 54 is the
distribution of the parameters of the vertex, - . In the examples, this is an autoregressive (AR(1)) process. The parameters - represent the positions of vertices, K , and the
amplitude and width parameters of the edges, while 8 L:&
is the likelihood function. The observation model is based
on the approximation of linear structures by a sum of Gaussian kernels:
NM
=Q
;
K 9"!$#PO
KSR
K parent +* ETK &V U)
(2)
where is a magnitude scale factor and K =V the mean
O
and covariance
parameters of the kernel. Rather than sample
directly on the covariance parameters, they are sampled in
a normalised form, in terms of the orientation and the standard deviations of their principal components. The orientation is defined by the line joining a vertex with its parent.
The likelihood is then computed based on a normal model
of the observations within a region
K;
M
A
K WEYX KA
graft) and the chances of each move are balanced against
its opposite, it is straightforward to compute the required
equilibrium distribution and to design the move probabilities to give the required posterior as equilibrium, using the
Metropolis-Hastings technique (MH) [3]. The MH method
iterates in two steps: the first proposes a move from the collection of moves and the second accepts or rejects the move
so as to ensure that the equilibrium coincides with the desired posterior. The randomised decision whether to accept
or reject a move depends on whether the resulting new tree
will be a more adequate representation from the posterior
than the current tree.
The efficiency of the algorithm depends crucially on the
proposed moves. As all moves influence the likelihood only
locally, the likelihood evaluation can be implemented efficiently. Global structure that can be inferred with high
certainty locally will lead to a tree structure that is stable
over time, while low local certainty results in a volatile treestructure that alternates between different explanations for
the global structure. A summary of the moves currently implemented for the algorithm is shown in figure 2.
Each iteration of the sampler consists of the random selection of one such move and its random acceptance or rejection, according to posterior probability.
3. EXPERIMENTS
(3)
where X K; respresents the noise in the observations, which
again is assumed to be normally distributed, with zero mean
and independent at each pixel. The simulation, whose configuration at any one time is a collection of random trees,
is designed to have moves which take it from one configuration to the next. In order to arrive at a Maximum a Posteriori
(MAP) estimate from a given image, it is necessary to sample from the above posterior distribution. This is done by
a simulation involving a number of different moves, affecting the state of the forest at each iteration. As long as each
move has an ‘opposite’ (e.g. add versus delete, split versus
Figure 3 illustrates results of the ML model estimation. We
have used part of a 2D retinal angiographic image size 404
7
404 pixels (fig. 3(a)) for our 2D experiments. To improve the efficiency of the estimation, we start from a set of
maximum likelihood local feature estimates, based on the
Gaussian kernel of (3), within a multiresolution framework,
combined with a piecewise linear model of the background
intensity to account for the obvious large scale variation in
intensity across the images such as fig. 3(a). In other words,
each region in a quadtree tessellation of the image is modelled as the sum of a linear background term and a Gaussian
intensity profile, whose parameters are estimated using an
structure. After the first 200 iterations, the scale is halved
and the appropriate local feature estimates are used to guide
the sampler; after 3000, the scale is halved again, as it is
after 6000 iterations, at which point, the highest spatial resolution is reached. This approach has been found to speed
convergence to the equilibrium distribution, while avoiding
becoming trapped in local modes, in a similar manner to
many coarse-fine algorithms. Note that one iteration consists of the generation and acceptance of a single proposal
(for ‘editing’ the tree). In other words, 100000 iterations
is comparable, in terms of computation, to a single scan
through the image. It has been noted from experiments that
equilibrium is reached in approximately 50000 iterations, a
comparatively low burden computationally.
4. CONCLUSIONS
Fig. 2. Moves used in MCMC simulation on trees.
iterative, EM-type algorithm to maximise likelihood
O
M
KA Q
K2R
K &V M
K R
B S
Q
K =V (4)
K M
V U M
M
KA
KA Q
K R
5. REFERENCES
K; Q
K R
K &V KSR
K K R
K KA Q
M
K; Q
K R
K Some encouraging preliminary results have been achieved
using the approach described in section 2, demonstrating
its potential for modelling vascular structure globally in a
computationally efficient way. Fine-tuning the algorithm
will lead to significant improvements. These will include,
for example, the use of the local estimates to produce initial
configurations for the MCMC algorithm. Such improvements are currently being implemented. The work is also
being tested on other types of data and extended to three
dimensions.
K &V (5)
K2R
K &V (6)
where the data K; are windowed with a cosine window,
whose size is twice the block width at a given scale, to
give the data KA used in the estimator. The index denotes iteration number; typically 4-5 iterations are sufficient to give accurate estimates. Figures 3(b)-(c) show
reconstructions using the 2D Gaussians in each block (at
corresponding block sizes) based on the ML feature estimates at block sizes of 7 and F 7 F respectively.
Clearly, at lower spatial resolutions, the model cannot easily describe the presence of multiple vessels within the window, such as occur at bifurcations, and the resulting lowamplitude, isotropic Gaussians are locally the ‘best’ description of these regions. However, these blocks can be
modelled accurately at higher spatial resolutions. The second set of images shows how the local estimates from different window scales is used in a coarse-fine stochastic simulation, in order to get a Bayesian estimate of the forest
[1] P. Datlinger A. Pinz, S. Bernogger and A. Kruger. Mapping the human retina. IEEE Trans. Medical Imaging,
17(1):606–619, 1998.
[2] S.P. Brooks. The Markov Chain Monte Carlo Method
and its Application. The Statistician, 47:69 – 100, 1998.
[3] W. R. Gilks, S. Richardson, and D. J. Spiegelhalter.
Markov Chain Monte Carlo in Practice. Chapman &
Hall, 1996.
[4] M. E. Martinez-Perez, A. D. Hughes, A. V. Stanton,
A. S. Thom, A. A. Bharath, and K. H. Parker. Segmentation of retinal blood vessels based on the second
directional derivative and region growing. In Proc. of
IEEE ICIP-99, pages 173–176, Kobe, Japan, 1999.
[5] W. M. Wells, R. Kikinis, W. E. L. Grimson, and
R. Jolesz. Adaptive segmentation of MRI data. IEEE
Trans. Medical Imaging, 15:429–442, 1996.
[6] D. L. Wilson and J. A. Noble. An adaptive segmentation algorithm for extracting arteries and aneurysms
from time-of-flight MRA data. IEEE Trans. Medical
Imaging, 18(10):938–945, 1999.
(a)
(a)
(b)
(b)
(c)
(c)
Fig. 3. (a) 2D retinal angiogram size 404 7 404 pixels.
(b)
model parameters estimates
Reconstruction
for block sizesof ofdata64from
and (c) 16.
Fig. 4. Estimates from the tree-based sampler, at (a) 200, (b)
1000 and (c) 50000 iterations, showing how use is made of
the local multiresolution feature estimates, in a coarse-fine
approach to the MCMC algorithm. After 50000 iterations,
few changes occur.
Other University of Warwick Department of Statistics
Research Reports authored or co–authored by W.S. Kendall.
161: The Euclidean diffusion of shape.
162: Probability, convexity, and harmonic maps with small image I: Uniqueness and fine existence.
172: A spatial Markov property for nearest–neighbour Markov point processes.
181: Convexity and the hemisphere.
202: A remark on the proof of Itô’s formula for C 2 functions of continuous
semimartingales.
203: Computer algebra and stochastic calculus.
212: Convex geometry and nonconfluent Γ-martingales I: Tightness and
strict convexity.
213: The Propeller: a counterexample to a conjectured criterion for the
existence of certain convex functions.
214: Convex Geometry and nonconfluent Γ-martingales II: Well–posedness
and Γ-martingale convergence.
216: (with E. Hsu) Limiting angle of Brownian motion in certain two–
dimensional Cartan–Hadamard manifolds.
217: Symbolic Itô calculus: an introduction.
218: (with H. Huang) Correction note to “Martingales on manifolds and
harmonic maps.”
222: (with O.E. Barndorff-Nielsen and P.E. Jupp) Stochastic calculus, statistical asymptotics, Taylor strings and phyla.
223: Symbolic Itô calculus: an overview.
231: The radial part of a Γ-martingale and a non-implosion theorem.
236: Computer algebra in probability and statistics.
237: Computer algebra and yoke geometry I: When is an expression a tensor?
238: Itovsn3: doing stochastic calculus with Mathematica.
239: On the empty cells of Poisson histograms.
244: (with M. Cranston and P. March) The radial part of Brownian motion
II: Its life and times on the cut locus.
247: Brownian motion and computer algebra (Text of talk to BAAS Science
Festival ’92, Southampton Wednesday 26 August 1992, with screenshots of illustrative animations).
257: Brownian motion and partial differential equations: from the heat equation to harmonic maps (Special invited lecture, 49th session of the ISI,
Firenze).
260: Probability, convexity, and harmonic maps II: Smoothness via probabilistic gradient inequalities.
261: (with G. Ben Arous and M. Cranston) Coupling constructions for hypoelliptic diffusions: Two examples.
280: (with M. Cranston and Yu. Kifer) Gromov’s hyperbolicity and Picard’s
little theorem for harmonic maps.
292: Perfect Simulation for the Area-Interaction Point Process.
293: (with A.J. Baddeley and M.N.M. van Lieshout) Quermass-interaction
processes.
295: On some weighted Boolean models.
296: A diffusion model for Bookstein triangle shape.
301: COMPUTER ALGEBRA: an encyclopaedia article.
308: Perfect Simulation for Spatial Point Processes.
319: Geometry, statistics, and shape.
321: From Stochastic Parallel Transport to Harmonic Maps.
323: (with E. Thönnes) Perfect Simulation in Stochastic Geometry.
325: (with J.M. Corcuera) Riemannian barycentres and geodesic convexity.
327: Symbolic Itô calculus in AXIOM: an ongoing story.
328: Itovsn3 in AXIOM: modules, algebras and stochastic differentials.
331: (with K. Burdzy) Efficient Markovian couplings: examples and counterexamples.
333: Stochastic calculus in Mathematica: software and examples.
341: Stationary countable dense random sets.
347: (with J. Møller) Perfect Metropolis-Hastings simulation of locally stable
point processes.
348: (with J. Møller) Perfect implementation of a Metropolis-Hastings simulation of Markov point processes
349: (with Y. Cai) Perfect simulation for correlated Poisson random variables conditioned to be positive.
350: (with Y. Cai) Perfect implementation of simulation for conditioned
Boolean Model via correlated Poisson random variables.
353: (with C.J. Price) Zeros of Brownian Polynomials.
371: (with G. Montana) Small sets and Markov transition densities.
382: (with A. Brix) Simulation of cluster point processes without edge effects.
391: (with E. Thönnes, A. Bhalerao, R.G. Wilson) A Bayesian approach to
inferring vascular tree structure from 2D imagery.
392: (with A. Bhalerao, E. Thönnes, R.G. Wilson) Inferring vascular tree
structure from 2D and 3D imagery.
Also see the following related preprints
317: E. Thönnes: Perfect Simulation of some point processes for the impatient user.
334: M.N.M. van Lieshout and E. Thönnes: A Comparative Study on the
Power of van Lieshout and Baddeley’s J-function.
359: E. Thönnes: A Primer on Perfect Simulation.
366: J. Lund and E. Thönnes: Perfect Simulation for point processes given
noisy observations.
If you want copies of any of these reports then please email your requests to
the secretary using statistics@warwick.ac.uk (mail address: the Department of Statistics, University of Warwick, Coventry CV4 7AL, UK).
Download