Decomposed Contour Prior for Shape Recognition

advertisement
Decomposed Contour Prior for Shape Recognition
Zhi Yang, Yu Kong and Yun Fu
Department of CSE, SUNY at Buffalo, NY 14260, USA
{zhiyang, yukong7, yunfu}@buffalo.edu
Abstract
GolfSwing
Embedding Force
In this paper, we address the problem of representing
objects using contours for the purpose of recognition.
We propose a novel segmentation method for the integration of a novel contour matching energy into level
set based segmentation schemes. The contour matching
energy is represented by major components of Elliptic
Fourier shape descriptors and serves as a shape prior
to guide the curve evolution. The contours in training
dataset serve as templates and are utilized to match an
unknown image to infer its category. Our method is
evaluated on the UCF sports dataset and Caltech 101
dataset. Experiments show that our method achieves
promising recognition accuracy and is robust to noisy
low-level features and background clutter.
1. Introduction
Image segmentation and shape recognition in computer vision are usually driven both by low-level cues
such as intensity and texture, and by prior knowledge
about objects. Currently, modeling the interaction between data-driven and model-based processes has become the focus in image segmentation. In this paper, we
consider prior knowledge given by the shape of human
or object and focus on how to exploit such knowledge
for shape recognition.
Level set based contour representations have become
a popular framework for image segmentation [14, 5].
They allow us to elegantly model topological changes
of the implicitly represented boundary. Level set segmentation schemes can be formulated to utilize various
low-level image features such as intensity, color, texture, etc. However, as shown in [5], low-level features
can be misleading due to noise or background clutter.
To overcome this problem, some studies have investigated the integration of shape prior knowledge into
level set method. Leventon et al.[10] utilized shape
and pose information derived by principle component
Original
Picture
Region
Forces
Decomposed
Curvature
SKS
Converged
Contour
SwingBench
SKS
Contour
Converging
Skate
Boarding
SKS
Figure 1. Flowchart of our method.
analysis(PCA) on a given contour. Etyngier et al.[6]
proposed a method to learn shape using a non-linear
method. Unlike [10], the algorithm in [6] extracts shape
prior using manifold learning, and in particular, a diffusion map is constructed using Nyström extension. Cremers [3] proposed another non-linear shape prior using
autoregressive (AR) model to approximate linear combination. Jiang et al.[7] presented a TPS-RPM based
method to measure similarity between two points sets.
In this paper, we propose novel shape constraints to
guide contour evolution. The flowchart of our method
is shown in Fig.1. An object or a human is represented
by a set of Fourier coefficients. We introduce a new
curve energy function based on curvature to measure the
shape similarity between the target curve and an evolving curve. The curve energy is smoothly integrated with
region energy to guide segmentation. This makes the
segmentation process robust to noisy low-level features.
We use the obtained contours as templates for matching
and recognize human actions or objects from videos or
images.
2
2.1
Our Method
Contour Representation
The shape of an object can be effectively represented
by Fourier series on its contour [9]. In our work, we
to be segmented. We define the shape similarity of two
objects by the curvatures on corresponding segments.
Let C(s) be a curve parameterized by s, and Cψ≡0 (s)
be the target curve, and Cevolve (s) be the evolving curve.
Then the curvature of the target curve κψ and the evolving curve κC can be given by
κψ = O ·
OCevolve (s)
OCψ≡0 (s)
, κC = O ·
kOCψ≡0 (s)k
kOCevolve (s)k
κ
Figure 2. [Best viewed in color.] Contour
evolution with respect to target contours.
adopt the Elliptic Fourier descriptor (EFD) parameterized by s to represent object contour:
∞
2πns
2πn
2πns
∂x X 2nπ
=
an sin(
)+
bn cos(
)
−
∂s
T
T
T
T
n=1
∞
X
∂y
2nπ
2πns
2πn
2πns
−
=
cn sin(
)+
dn cos(
)
∂s n=1
T
T
T
T
where x and y are 2D coordinates on an image.
A contour in our work is described by the first N order Fourier coefficients Fig.2. These coefficients characterize the shape of an object and are utilized as the
shape prior to guide curve evolution. This type of shape
prior plays an important role in curve evolution and thus
facilitates the recognition task.
Motivated by [11], we use convex keypoints for initialization. The keypoints are defined as multi-scale organization of curvature zero crossing points of a planar
curve
∂2y
∂2x
=
0,
= 0,
∂s2
∂s2
The condition shows that a convex keypoint is an inflection point if the curve is continuous. With alignment of
such a set of convex points, our level-set based method
can start.
2.2
Contour Evolution
Our method is based on a region-based level set
scheme introduced by Chan and Vese [14]. Their
method generates a segmentation of an input image with
two gray value constants. However, their method relies
on low-level features and may fail to converge to the desired segmentation due to unfavorable background clutter. To cope with such degraded low-level information,
we propose a novel curvature matching energy function
Ecurve to guide contour evolution.
Our curvature matching energy function measures
shape similarity between the target object and the object
We use the ratio of curvature κ(s) = κCψ to denote the
shape similarity of a local segment s. Then the shape
matching energy is defined as
I
Ecurve = µ
|OH(ψ(C(s))κ(s)|ds,
(1)
ϕ(s)≡0
where H(z) is the Heaviside function: H(z) = 1 if
z > 0, otherwise H(z) = 0. ψ(C(s)) is used to find
corresponding segments between the target curve and a
evolving curve.
Shape energy Ecurve encodes the shape information
of an object and can provide the shape prior for segmentation. In this work, we incorporate this shape energy into the region energy to guide curve evolution.
Therefore, the total energy is expressed as
I
E = Ecurve + Eregion = µ
|OH(ψ(C(s))κ(s)|ds
ϕ(s)≡0
Z
+λ1
|µ0 (x, y) − c1 |2 H(ψ(x, y))dxdy
Ω1
Z
+λ2
|µ0 (x, y) − c2 |2 (1 − H(ψ(x, y)))dxdy
Ω2
(2)
where Eregion is the inhomogeneity energy [14], µ, λ1
and λ2 are parameters, Ω1 denotes the region enclosed
within Cϕ(s)≡0 (s), Ω2 is the region outside Ω1 .
2.3
Discussion
The advantage of the energy function in Eq.(2) is that
it can remove background noise caused by local shape
irregularities (see Fig.3).
Let Ê be an estimator. Suppose evolving contour
X is estimated by an observed contour vector Y and
Ê[X 2 ] < +∞ . We define X, Y1 ,..., Yn as random
process with finite second moments; all on the same
probability space. A guiding curve Y can be decomposed into harmonics of Fourier Series. Here we assume Ê[Yi ] = 0 for all i >= 1 and Ê[Yi Yj ] = 0 for
i 6= j, then
Êcurve [X|Y1 , Y2 , ..., Yn ] = Ê[X] +
n
X
Ê[X − E[X]|Yi ]
i=1
(3)
Our Method with Contour Curvature Decomposition
Figure 3. [Best viewed in color.] Contour
evolution to the target contours.
D−S
0.50
0.00
0.10
0.10
0.00
0.10
0.00
0.10
0.10
0.00
0.00
0.00
0.00
G−S−B
0.00
0.70
0.00
0.00
0.20
0.00
0.00
0.00
0.00
0.00
0.00
0.10
0.00
G−S−F
0.10
0.10
0.40
0.10
0.00
0.00
0.00
0.10
0.00
0.00
0.00
0.10
0.10
G−S−S
0.00
0.00
0.00
0.80
0.00
0.10
0.10
0.00
0.00
0.00
0.00
0.00
0.00
K−F
0.00
0.10
0.00
0.00
0.60
0.10
0.00
0.00
0.00
0.20
0.00
0.00
0.00
K−S
0.20
0.00
0.00
0.00
0.10
0.70
0.00
0.00
0.00
0.00
0.00
0.00
0.00
L
0.10
0.10
0.10
0.00
0.00
0.20
0.50
0.00
0.00
0.00
0.00
0.00
0.00
R−H
0.10
0.00
0.00
0.10
0.00
0.10
0.00
0.60
0.00
0.10
0.00
0.00
0.00
R−S
0.00
0.00
0.00
0.00
0.00
0.20
0.00
0.00
0.70
0.10
0.00
0.00
0.00
S−B−F
0.00
0.10
0.00
0.00
0.00
0.00
0.10
0.00
0.10
0.70
0.00
0.00
0.00
S−B
0.20
0.00
0.00
0.00
0.00
0.00
0.10
0.00
0.10
0.00
0.50
0.10
0.00
S−SA
0.00
0.10
0.00
0.10
0.10
0.10
0.00
0.00
0.10
0.00
0.00
0.50
0.00
W−F
0.00
0.00
0.10
0.00
0.00
0.00
0.00
0.00
0.00
0.20
0.00
0.00
0.70
K−F
K−S
L
R−H
R−S
S−B−F
S−B
S−SA
W−F
Evolving error is given by:
D−S
êcurve = X − Êcurve [X|Y1 , Y2 , ..., Yn ]
(4)
Recall that local shape irregularities are often represented by short period components in frequency domain. According to Eq.(2) and Eq.(4), these irregularities can be reduced during energy optimization.
2.4
Shape Matching
Many effective methods have been proposed to identify shape, such as Signed Distance Map (SDM) [10],
Curvature Scale Space (CSS) [8], etc. In our work, we
use the SKS algorithm [8] to compare two shapes. SKS
algorithm is a robust 2-d shape recognition algorithm
which uses the evidence accumulation.
Let feature vector v = (κ1 , κ2 , ..., κn ) be a vector of
scaled curvatures with respect to a reference point. The
similarity of two shapes can be computed by
sk = e
||v−vk ||
δ2
.
(5)
We use sk as the shape similarity and find the best
match for an unknown image from training dataset.
Figure 5. Confusion matrix of our method.
The confusion matrix of our method is displayed
in Figure 5. Our method achieves 60.77% recognition accuracy. Comparison results with other methods are shown in Table 1. Results indicate that our
method achieves the best overall accuracy over other
methods in comparison. The underlying reason is that
our method reduces local irregularity which essentially
is high-frequency noise. It decomposes the contour curvature into components and select the top N components. Thus high-frequency noise can be reduced.
To further investigate the noise reduction in our
method, we compare our method with Zhu and Chan’s
method [2]. We show intermediate results (the number of iterations is set to 100) in Figure 4. Our method
can efficiently remove background noise due to heterogeneity of local regions. The proposed method ignores high-frequency components which are sensitive
to background noise and thus achieves robust segmentation results. Therefore, our method achieves better
recognition accuracy.
3.2
3
Results on Caltech 101 Dataset
Experiment
We evaluate the proposed method, namely decomposed contour curvature prior (DCCP) with embedded
contour curvature prior (ECCP) on UCF sports dataset
and Caltech 101 dataset. We conduct two experiments
to evaluate our methods. First, we compare level set
based methods with DCCP and ECCP on UCF databset. Then segmentation results are compared on Caltech
101 object dataset.
3.1
G−S−B G−S−F G−S−S
Results on UCF-Sports Dataset
On this dataset, we compare the recognition accuracy of our method with 6 benchmark level set based
methods [12, 15, 14, 1, 13, 16]. Since these methods do
not have shape prior, to conduct a fair comparison, we
embed contour curvature prior ECCP into these methods using the embedding strategy described in [4].
We compare segmentation results with the method in
[2]. Results are shown in Figure 6. It can be seen that
our method is not sensitive to local heterogeneity of local regions. Our method selects components which are
insensitive to background noise while Zhu and Chan’s
method considers all high-frequency components which
would suffer from local irregularities.
4
Conclusion
We have proposed a decomposed contour curvature
prior for level set method for segmentation and shape
recognition. Our method ignores high-frequency components and can reduce background noise such as heterogeneity of local regions and irregularities. We use
segmentation results to conduct shape recognition task.
Results on two datasets show that our method is insensitive to background noise and achieves promising results
in recognition.
Table 1. Recognition results (%) of our method and methods in [12, 15, 14, 1, 13, 16].
Methods
Bernard’s [12] Casselles’s [15] Chan’s [14] Lankton’s [13] Li’s [1] Shi’s [16]
Ours
Accuracy
44.61
46.92
50.77
46.92
53.85
50.77
60.77
Figure 4. Contour evolution of our method (top row) and the method in [2] (bottom row).
Figure 6. Segmentation results of our
method (top row) and the method in [2]
(bottom row) on Caltech 101 dataset.
References
[1] J. G. Z. D. C. Li, C.Y. Kao. Minimization of region-scalable
fitting energy for image segmentation. IEEE Transactions on
Image Processing, 17(10):1940–1949, 2008.
[2] T. Chan and W. Zhu. Level set based shape prior segmentation.
In CVPR 05, pages 1164–1170, Washington, DC, USA, 2005.
IEEE Computer Society.
[3] D. Cremers. Nonlinear dynamical shape priors for level set
segmentation. In Computer Vision and Pattern Recognition,
2007. IEEE Conference on, pages 1–7, 2007.
[4] D. Cremers, T. Kohlberger, and C. Schnörr. Nonlinear shape
statistics in mumford-shah based segmentation. In Proceedings
of the 7th European Conference on Computer Vision-Part II,
ECCV ’02, pages 93–108, 2002. Springer-Verlag.
[5] D. Cremers, M. Rousson, and R. Deriche. A review of statistical approaches to level set segmentation: Integrating color,
texture, motion and shape. International Journal of Computer
Vision, 72(2):195–215, 2007.
[6] P. Etyngier, F. Sgonne, and R. Keriven. Active-contour-based
image segmentation using machine learning techniques. In
MICCAI (1)’07, pages 891–899, 2007.
[7] T. Jiang, F. Jurie, and C. Schmid. Learning shape prior models for object matching. In Conference on Computer Vision &
Pattern Recognition, jun 2009.
[8] K. Krish, S. B. Heinrich, W. E. Snyder, H. Cakir, and S. Khorram. Global registration of overlapping images using accumulative image features. Pattern Recognition Letters, 31(2):112–
118, 2010.
[9] F. P. Kuhl and C. R. Giardina. Elliptic fourier features of a
closed contour. Computer Graphics and Image Processing,
0
18(3):236–258, 1982.
[10] O. F. M. E. Leventon, W. E. L. Grimson. Statistical shape
influence in geodesic active contours. volume 1, pages 316–
323, 2000.
[11] M. S. D. M.A. Charmi, M.A. Mezghich and F. Ghorbel.
Geometric shape prior to region-based active contours using
fourier-based shape alignment. In 2010 IEEE International
Conference on: Imaging Systems and Techniques (IST), pages
478 – 481, 2010.
[12] P. T. M. U. O. Bernard, D. Friboulet. Variational b-spline
level-set: A linear filtering approach for fast deformable
model evolution. In IEEE Transactions on Image Processing,
18(06):1179–1191, 2009.
[13] A. T. S. Lankton. Localizing region-based active contours.
IEEE Transactions on Image Processing, 17(11):2029–2039,
2008.
[14] T.F.Chan and L.A.Vese. Active contours without edges. IEEE
Transactions on Image Processing, 10(2):266–277, 2001.
[15] R. K. V. Caselles and G. Sapiro. Geodesic active contours.
International Journal of Computer Vision, 22:61–79, 1997.
[16] W. C. K. Yonggang Shi. A real-time algorithm for the approximation of level-set-based curve evolution. In IEEE Transactions on Image Processing, 22(05):645–656, 2008.
Download