Mean Shift - Embodied Immersion

CSCE643: Computer Vision Mean-Shift Object Tracking Jinxiang Chai Many slides from Yaron Ukrainitz & Bernard Sarel & Robert Collins Appearance-based Tracking Slide from Collins Review: Lucas-Kanade Tracking/Registration • Key Idea #1: Formulate the tracking/registration as a nonlinear least square minimization problem arg min p 2   I ( w ( x ; p ))  H ( x )  x Review: Lucas-Kanade Tracking/Registration • Key Idea #2: Solve the problem with Gauss-Newton optimization techniques arg min p 2   I ( w ( x ; p ))  H ( x )  x Review: Gauss-Newton Optimization arg min p  x   W p  H ( x)  I ( w ( x ; p ))   I p   2 Rearrange Review: Gauss-Newton Optimization arg min p  arg min p  x  x   W p  H ( x)  I ( w ( x ; p ))   I p  2   W  I  p  ( H ( x )  I ( w ( x ; p )))    p   2 Rearrange Review: Gauss-Newton Optimization arg min p  arg min p  x  x   W p  H ( x)  I ( w ( x ; p ))   I p  2   W  I  p  ( H ( x )  I ( w ( x ; p )))    p   A b 2 Rearrange Review: Gauss-Newton Optimization arg min p  arg min p  x  x   W p  H ( x)  I ( w ( x ; p ))   I p  2 T x Rearrange   W  I  p  ( H ( x )  I ( w ( x ; p )))    p   A p  ( 2  W    W  1  I  I     ) (  p  p x     (ATA)-1 b T  W   I   ( H ( x )  I ( w ( x ; p )))  p   ATb Lucas-Kanade Registration T p  ( x  W    W  1  I  I    )   p  p x     Initialize p=p0: Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) T  W   I   ( H ( x )  I ( w ( x ; p ))  p   Lucas-Kanade Registration T p  ( x  W    W  1  I  I    )   p  p x     Initialize p=p0: Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image T  W   I   ( H ( x )  I ( w ( x ; p ))  p   Lucas-Kanade Registration T p  ( x  W    W  1  I  I    )   p  p x     Initialize p=p0: Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image 3. Warp the gradient  I with w(x;p) T  W   I   ( H ( x )  I ( w ( x ; p ))  p   Lucas-Kanade Registration T p  ( x  W    W  1  I  I    )   p  p x     Initialize p=p0: Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image 3. Warp the gradient  I with w(x;p) w 4. Evaluate the Jacobian  p at (x;p) T  W   I   ( H ( x )  I ( w ( x ; p ))  p   Lucas-Kanade Registration T p  ( x  W    W  1  I  I    )   p  p x     T  W   I   ( H ( x )  I ( w ( x ; p ))  p   Initialize p=p0: Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image 3. Warp the gradient  I with w(x;p) w 4. Evaluate the Jacobian  p at (x;p) 5. Compute the  p using linear system solvers Lucas-Kanade Registration T p  ( x  W    W  1  I  I    )   p  p x     T  W   I   ( H ( x )  I ( w ( x ; p ))  p   Initialize p=p0: Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image 3. Warp the gradient  I with w(x;p) w 4. Evaluate the Jacobian  p at (x;p) 5. Compute the  p using linear system solvers 6. Update the parameters p  p   p Object Tracking • Can we apply Lucas-Kanade techniques to nonrigid objects (e.g., a walking person)? Motivation of Mean Shift Tracking • To track non-rigid objects (e.g., a walking person), it is hard to specify an explicit 2D parametric motion model. • Appearances of non-rigid objects can sometimes be modeled with color distributions Mean-Shift Tracking The mean-shift algorithm is an efficient approach to tracking objects whose appearance is defined by histograms. - not limited to only color, however. - Could also use edge orientation, texture motion Slide from Robert Collins Mean Shift Tracking: Demos Mean Shift Mean Shift [Che98, FH75, Sil86] - An algorithm that iteratively shifts a data point to the average of data points in its neighborhood. - Similar to clustering. - Useful for clustering, mode seeking, probability density estimation, tracking, etc. Mean Shift Reference • Y. Cheng. Mean shift, mode seeking, and clustering. IEEE Trans. on Pattern Analysis and Machine Intelligence, 17(8):790–799, 1998. • K. Fukunaga and L. D. Hostetler. The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Trans. on Information Theory, 21:32– 40, 1975. • B. W. Silverman. Density Estimation for Statistics and Data Analysis. Chapman and Hall, 1986. Mean Shift Theory Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls Intuitive Description Region of interest Center of mass Objective : Find the densest region Distribution of identical billiard balls What is Mean Shift ? A tool for: Finding modes in a set of data samples, manifesting an underlying probability density function (PDF) in RN PDF in feature space • Color space Non-parametric • Scale spaceDensity Estimation • Actually any feature space you can conceive Discrete PDF Representation •… Data Non-parametric Density GRADIENT Estimation (Mean Shift) PDF Analysis Non-Parametric Density Estimation Assumption : The data points are sampled from an underlying PDF Data point density implies PDF value ! Assumed Underlying PDF Real Data Samples Non-Parametric Density Estimation Assumed Underlying PDF Real Data Samples Non-Parametric Density Estimation ? Assumed Underlying PDF Real Data Samples Parametric Density Estimation Assumption : The data points are sampled from an underlying PDF P D F( x ) = c  i e ( x -μ i ) 2 i 2 2 i Estimate Assumed Underlying PDF Real Data Samples Kernel Density Estimation Parzen Windows - Function Forms P (x)  1 n K (x - x  n i ) i 1 A function of some finite number of data points x1…xn Data In practice one uses the forms: d K ( x )  c  k ( x i ) or K ( x )  ck  x  i 1 Same function on each dimension Function of vector length only Kernel Density Estimation Various Kernels P (x)  1 n K (x - x  n i 1 Examples: i ) A function of some finite number of data points x1…xn Data   c 1 x  • Epanechnikov Kernel K E ( x )    0  2  x 1 otherw ise • Uniform Kernel  c K U (x)    0 x 1 • Normal Kernel  1 K N ( x )  c  exp   x 2  otherw ise 2    Kernel Density Estimation Gradient  P (x)  1 n n  K (x - xi ) i 1 Give up estimating the PDF ! Estimate ONLY the gradient  x-x i K ( x - x i )  ck   h  Using the Kernel form: We get :  P (x)  2     Size of window c n n  i 1 c  n   ki  g  i  n  i 1   n  x g  i i  i 1  n  x   gi   i  1  g ( x )   k ( x ) Computing Kernel Density The Estimation Mean Shift Gradient  P (x)  c n n  i 1 c  n   ki  g  i  n  i 1   n  x g  i i  i 1  n  x   gi   i  1  g ( x )   k ( x ) Computing The Mean Shift  P (x)  c n n  i 1 c  n   ki  g  i  n  i 1   n  x g  i i  i 1  n  x   gi   i  1  Yet another Kernel density estimation ! Simple Mean Shift procedure: • Compute mean shift vector  n   x-x 2  i   xi g      h  i 1    m (x)    x  2 n   x x   i g        h   i 1   •Translate the Kernel window by m(x) g ( x )   k ( x ) Mean Shift Mode Detection What happens if we reach a saddle point ? Perturb the mode position and check if we return back Updated Mean Shift Procedure: • Find all modes using the Simple Mean Shift Procedure • Prune modes by perturbing them (find saddle points and plateaus) • Prune nearby – take highest mode in the window Mean Shift Strengths & Weaknesses Strengths : Weaknesses : • Application independent tool • The window size (bandwidth selection) is not trivial • Suitable for real data analysis • Does not assume any prior shape (e.g. elliptical) on data clusters • Can handle arbitrary feature spaces • Only ONE parameter to choose • h (window size) has a physical meaning, unlike K-Means • Inappropriate window size can cause modes to be merged, or generate additional “shallow” modes  Use adaptive window size Mean Shift Applications Mean Shift Object Tracking • D. Comaniciu, V. Ramesh, and P. Meer. Real-time tracking of non-rigid objects using mean shift. In IEEE Proc. on Computer Vision and Pattern Recognition, pages 673–678, 2000. (Best paper award) • Journal version: Kernel-Based Object Tracking, PAMI, 2003. Non-Rigid Object Tracking … … Non-Rigid Object Tracking Real-Time Surveillance Driver Assistance Object-Based Video Compression Mean-Shift Object Tracking General Framework: Target Representation Choose a reference model in the current frame … Current frame Choose a feature space … Represent the model in the chosen feature space Mean-Shift Object Tracking General Framework: Target Localization Start from the position of the model in the current frame Search in the model’s neighborhood in next frame Find best candidate by maximizing a similarity func. Repeat the same process in the next pair of frames … Model Candidate Current frame … Mean-Shift Object Tracking Target Representation Choose a reference target model Represent the model by its PDF in the feature space Choose a feature space 0.35 Quantized Color Space Probability 0.3 0.25 0.2 0.15 0.1 0.05 0 1 2 3 . color Kernel Based Object Tracking, by Comaniniu, Ramesh, Meer . . m Mean-Shift Object Tracking Target Model Target Candidate (centered at 0) (centered at y) 0.35 0.3 0.3 0.25 0.25 Probability Probability PDF Representation 0.2 0.15 0.1 0.2 0.15 0.1 0.05 0.05 0 0 1 2 3 . . . m 1 2 color q   q u  u 1.. m 3 . . . m color m q u p  y    p u  y  1 u 1 Similarity Function: f  y  f  q , p  y   m u  1.. m  u 1 pu  1 Mean-Shift Object Tracking Smoothness of Similarity Function Similarity Function: Problem: Solution: f  y  f  p  y  , q  Target is represented by color info only Spatial info is lost f is not smooth Gradientbased optimizations are not robust Mask the target with an isotropic kernel in the spatial domain f(y) becomes smooth in y f Large similarity variations for adjacent locations  y Mean-Shift Object Tracking Finding the PDF of the target model  x i  i 1 .. n candidate model Target pixel locations y 0 k (x) A differentiable, isotropic, convex, monotonically decreasing kernel • Peripheral pixels are affected by occlusion and background interference The color bin index (1..m) of pixel x Probability of feature u in model qu  C   k Normalization factor Probability b ( xi )  u 2 xi Probability of feature u in candidate  yx i pu  y   C h  k   h b ( xi )  u   0.3 0.3 0.25 0.25 0.2 Pixel weight 0.15 0.1 Normalization factor Probability b( x) Pixel weight 0.1 0 0 1 2 3 . color . . m     0.2 0.15 0.05 0.05 2 1 2 3 . color . . m Mean-Shift Object Tracking Similarity Function Target model: q   q1 , Target candidate: p y  Similarity function: f  y  , qm   p  y, , pm  y   1 f  p  y  , q   ? The Bhattacharyya Coefficient q   q1 , , qm  p1  y  , p y    1 , pm  y  y  p y  q 1 T f  y   cos  y  q p y   q m   u 1 pu  y  qu p y  Mean-Shift Object Tracking Target Localization Algorithm Start from the position of the model in the current frame q Search in the model’s neighborhood in next frame p y Find best candidate by maximizing a similarity func. arg max y   f [ p ( y ), q ] Mean-Shift Object Tracking Function Optimization arg max y   f [ p ( y ), q ] Or arg min y   1  f [ p ( y ), q ] Mean-Shift Object Tracking Function Optimization arg max y   f [ p ( y ), q ] Or arg min y   1  f [ p ( y ), q ] This is similar to Lucas Kanade registration! - we can use gradient based optimization techniques to solve the problem. Mean-Shift Object Tracking Function Optimization arg max y   f [ p ( y ), q ] Or arg min y   1  f [ p ( y ), q ] Mean Shift provides an efficient way to optimize the function! Mean-Shift Object Tracking Approximating the Similarity Function m f  y   pu  y  qu u 1 Linear approx. (around y0) f  y  1 m  2 Model location: y 0 Candidate location: y pu  y0  qu  u 1 m 1  2 pu  y  u 1 qu pu  y0   yx i pu  y   C h  k   h b ( xi )  u  Independent of y Ch 2 n  i 1  yx i wi k   h  2     2     Density estimate! (as a function of y) Mean-Shift Object Tracking Maximizing the Similarity Function The mode of Ch 2 n  i 1  yx i wi k   h  Important Assumption: The target representation provides sufficient discrimination One mode in the searched neighborhood 2     = sought maximum Mean-Shift Object Tracking Applying Mean-Shift The mode of Ch 2 n  i 1  yx i wi k   h  2     = sought maximum  y x i xi g  0  h  n  yx Original i c k Find mode of    Mean-Shift: h i 1  n 2      using i 1 y1  n  i 1 n n Extended Find mode of c  Mean-Shift: i 1  yx i wi k   h  2   using    y1  i 1 n  i 1  y x i g 0  h  2  y x i xi wi g  0  h   y x i wi g  0  h      2     2 2         Mean-Shift Object Tracking About Kernels and Profiles A special class of radially symmetric kernels: K  x   ck  x  2 The profile of kernel K k x   g  x n n Extended Find mode of c  Mean-Shift: i 1  yx i wi k   h  2   using    y1  i 1 n  i 1  y x i xi wi g  0  h   y x i wi g  0  h  2 2         Mean-Shift Object Tracking Choosing the Kernel A special class of radially symmetric kernels: if x  1   otherw ise  n  y1  i 1 n  i 1  1 g  x  k  x   0  y x i xi wi g  0  h   y x i wi g  0  h   2 x Uniform kernel: Epanechnikov kernel: 1  x k x    0 K  x   ck 2 2         n xw i y1  i 1 n w i 1 i i if x  1   otherw ise  Mean-Shift Object Tracking Adaptive Scale Problem: The scale of the target changes in time The scale (h) of the kernel must be adapted Solution: Run localization 3 times with different h Choose h that achieves maximum similarity Mean-Shift Object Tracking Results Feature space: 161616 quantized RGB Target: manually selected on 1st frame Average mean-shift iterations: 4 Mean-Shift Object Tracking Results Partial occlusion Distraction Motion blur Mean-Shift Object Tracking Results Mean-Shift Object Tracking Summary Mean-Shift Object Tracking Summary • Key idea #1: Formulate the tracking problem as nonlinear optimization by maximizing appearance/histogram consistency between target and template. q p y arg max y   f [ p ( y ), q ] Mean-Shift Object Tracking Summary • Key idea #2: Solving the optimization problem with mean-shift techniques Mean-Shift Object Tracking Discussion • How to deal with scaling and rotation?

Mean Shift - Embodied Immersion

Related documents

Products

Support

Mean Shift - Embodied Immersion

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib