Archit Rathore Arpit Agarwal Masare Akshay Sunil

advertisement

Gaussian Process Dynamic Models for

Human Motion

Wang, Fleet, Hertzmann

Presented by Archit Rathore

Arpit Agarwal

Masare Akshay Sunil

B

A

Gaussian Process Dynamics

x

1 x

2 x

3 x

4

X x t

= f(x t-1

,A) + n x,t y t

= g(x t

,B) + n y,t y

1 y

2 y

3 y

4

Y p(X,Y,α,β,W) = p(Y|X,β,W) p(X|α) p(α) p(β) p(W)

Conditional GPDM:

Given the learned model Τ = {Y,X,α,β,W}, the distribution over a new sequence Y {*} and its associated latent trajectory X {*} is given by: p (Y {*} ,X {*} |T) = p (Y {*} |X {*} ,T) p (X {*} |T) p (Y {*} ,X {*} |T) = |W| M exp(-0.5 tr(K -1

Y{*}

√(2π) MD |K

Y{*}

| D

Z

Y

W 2 Z

Y

T )) p (X {*} |T) = p (x

1

(*) ) exp(-0.5 tr(K -1

X{*}

Z

X

Z

Y

Z

X

T ))

√(2π) (M-1)d |K

X{*}

| d

Skeletal Model

The skeleton used is a simplified version of the default skeleton in the CMU mocap database. The numbers in parentheses indicate the number of DOFs for the joint directly above the labeled body node in the kinematic tree.

Each pose is defined by 44 Euler angles for joints, three global (torso) pose angles, and three global

(torso) translational velocities.

The data is mean subtracted.

GPDM Learning

MAP Estimation: Minimizing the -ve log-posterior of the unknowns -ln p (X,α,β,W|Y) given by:

L = L

Y

+ L

X

+ Σ ln β j

+ 1/(2κ 2 ) tr(W 2 ) + Σ ln α j

L

Y

= D/2 ln|K

Y

|+ 0.5

tr(K

Y

-1 YW 2 Y T ) - N ln|W|

L

X

= d/2 ln|K

X

|+ 0.5tr(K -1

X

X

2:N

X T

2:N

)+ 0.5x

1

T x

1

Balanced GPDM: The L

X

term penalizes unsmooth trajectory, therefore increasing weight on L

X during optimization encourages smoothness. The weight used is D/d - the weight between dimensions of data and latent spaces.

Manually specified hyperparameters: Smooth trajectories can also be ensured by selecting α by hand by examining samples from p(X|α).

Two-stage Map estimation: First estimate hyparameters Θ={X,α,β,W} from unknown latent trajectories X, then estimate X while holding Θ fixed.

PCA Initializer Latent Coordinates from GPLVM

Dynamic Predictive Distribution by HMC

Mean Prediction Sample

Latent Coordinates from GPDM

High Confidence Latent Space Random Trajectories Longer Random Trajectories

GOLF SWING MODEL

Dynamic Predictive Distribution by HMC

Mean Prediction Sample

Latent Coordinates from GPLVM Latent Coordinates from GPDM High Confidence Latent Space

FOUR WALKER MODEL

Random Trajectories

Dynamic Predictive Distribution by HMC

Mean Prediction Sample

Latent Coordinates from GPLVM Latent Coordinates from GPDM High Confidence Latent Space Random Trajectories

Samples stay close to training data in Golf Swing Model and form Clumpy Point Clusters in Four Walker Model

Walk Synthesized by taking the mean of the Predictive Distribution

MAP Two-Stage MAP

Six Walks Synthesized by Sampling from the Predictive Distribution

B-GPDM

Applications:

Monocular Human Tracking:

The technique presented in the paper is used for human tracking and synthesizing high quality poses from mocap data. The smoothed trajectories obtained from the GPDM approach can transfer the cost of high quality camera equipment to the relatively cheaper computational cost for estimating the parameters.

Computer Animation:

The prevalence of mocap data to produce realistic animations leads to the need to estimate the non linear manifold on which the high dimensional human motion data lies on.

The approach paves way for motion synthesis subject to sparse user constraints. This can also be used to estimate missing frames from data or reduce impact of noise from sensors when translating the video data to animation.

Download