Early Approaches toNon-Rigid Reconstruction - Deformable Surface 3D Reconstruction from Monocular Images

Image Processing Reference

In-Depth Information

defines a mapping from the low-dimensional representation to the high-dimensional one. This

mapping can be written as

w i f i ( c ) + ,

(2.5)

where w i are the weights of the possibly nonlinear functions f i of the low-dimensional representation

of the manifold c . By placing a simple Gaussian prior on the weights w i , they can be marginalized

out. This yields a multivariate Gaussian conditional density for the data, which can be written as

exp

2 tr K − 1 XX T ,

( 2 π) ND

C ,) =

−

p( X

(2.6)

where X and C are the matrices containing the ND -dimensional training examples and their latent

representations respectively. K is a positive-definite covariance matrix whose elements are obtained

by evaluating a kernel function k , such that K i,j = k( c i , c j ) . This kernel function is entirely defined

by its hyper-parameters , which are optimized at training together with the latent variables C ,so

as to maximize p( X

C , )p( C )p() . At inference, the predictive distribution p( x ∗ |

c ∗ , C , X ) of a

new deformation x

given its latent representation c

is a Gaussian with mean and variance

∗

X T K − 1 k

μ( c

)

(2.7)

∗

k T

K − 1 k ∗ ,

σ( c ∗ )

k( c ∗ , c ∗ )

−

(2.8)

∗

N × 1 is the vector containing the covariance function evaluated between the training

and the test data. This has the advantage of modeling the uncertainty on the output space to account

for the high or low density of training examples in different regions of the space. As a consequence,

it allows to build a prior for the shape and its latent representation.

Several extensions of the original GPLVM have been proposed. For instance, to extend the

GPLVM to motion data, the Gaussian Process Dynamical Model (GPDM) Wang et al. [ 2005 ]

was introduced. The GPDM allows to model nonlinear relationships between the latent variables

corresponding to consecutive frames in a sequence. In a different context, to overcome the burden of

evaluating the kernel function between each training latent variable, and thus of having a computation

time cubic in N , sparse representations were proposed Lawrence [ 2007 ]. In this sparse GPLVM, the

kernel is defined in terms of a much smaller number of inducing variables. This makes the GPLVM

practical for problems involving many degrees of freedom, therefore requiring large training sets, as

is the case of deformable surfaces.

where k ∗ ∈

2.2.2 LEARNEDMODELS FORNON-RIGIDMODELING

In Computer Vision, the linear learning techniques quickly became very popular. The original Ac-

tive Shape Models Cootes and Taylor [ 1992 ] were extended to full 2D Active Appearance Models

(AAM) Cootes et al. [ 1998 ], Matthews and Baker [ 2004 ] to track 2D face deformations. In this

case, the model is separated into shape and texture components, both modeled as linear combi-

nations of basis vectors. Adaptations of this were also proposed to group appearance and shape in

Deformable Surface 3D Reconstruction from Monocular Images

Search WWH ::

Custom Search

Home