Graphics Reference
In-Depth Information
1.3.2 Person dependency
Facial appearance variations are highly person-dependent‚ because the dif-
ferent people have different facial surface properties and different styles of
motion. To deal with the variations across different people‚ one approach is to
collect training data from a variety of people. Nonetheless‚ the amount of data
needed to achieve good results could be huge.
For a Lambertian object‚ the ratio image [Liu et al.‚ 2001a] of its two aligned
images removes its surface albedo dependency thus allowing illumination terms
(geometry and lighting) to be captured and transferred. Therefore‚ the subtle
appearance changes due to detailed facial motion can be captured independent
of the face albedo. Then‚ the appearance changes can be mapped to people with
different albedos.
1.3.3 Online appearance model
The Expectation-Maximization (EM) technique [Dempster et al.‚ 1977] is
a framework for optimization with partial information. It is widely applied
for computing maximum likelihood estimates for parameters in incomplete
data models. By alternating between an expectation step (E-step)‚ which finds
expected completions of data given the current parameterization‚ and a maxi-
mization step (M-step)‚ which re-estimates parameters on the basis of completed
data‚ the EM algorithm gradually improves the likelihood for the observed data
until convergence at a local maximum.
Using the EM framework‚ Jepson et al. [Jepson et al.‚ 2001] propose an
online appearance model‚ which is updated at every frame by the EM algorithm
for adapting to newly observed facial appearance variations. However‚ only
current stable mode of facial appearance is modeled and the non-rigid facial
motions are not interpreted by the model.
An analogy of the the appearance model adaption problem in speech recog-
nition domain is speaker adaptation. A good survey can be found in [Woodland‚
1999]. One of the most popular schemes is to adjust model parameters such
that the likelihood or posterior probability of new adaptation data is maximized.
The EM algorithm can usually be used for this maximization process‚ treating
the parameters to be adjusted as “missing” parameters of the model. One type
of approach‚ called Maximum Likelihood Linear Regression (MLLR) [Gales
and Woodland‚ 1996]‚ is to linearly transform the parameters of a speaker-
independent model such that the likelihood of the adaptation data of a particular
person is maximized. MLLR has the advantage that the same linear transfor-
mation can be used to update all parameters to be adjusted‚ even if the number
is large. If relatively few parameters to be adjusted‚ MLLR will be robust and
unsupervised adaptation can be used. In practice‚ research has shown that it is
effective to adjust the mean and covariance of the acoustic GMM model using
MLLR.
Search WWH ::




Custom Search