Dense Correspondence and Its Applications - Computer Vision for Visual Effects

Graphics Reference

In-Depth Information

dense correspondence across the whole image is to make the support of the window

large enough (for example, to make

large if we're using a Gaussian) to ensure

both eigenvalues are far from zero. Using large windows may not do a very good job

of estimating flow, since as we observed in Figure 4.11 , large square windows are

unlikely to remain square in the presence of camera motion.

On the other hand, the Lucas-Kanade algorithm is local, in that the flow vector

can be computed at each pixel independently, while the Horn-Schunck algorithm

is global, since all the flow vectors depend on each other through the differential

equations ( 5.22 ). This makes the Lucas-Kanade problem computationally easier to

solve. Since the linear system in Equation ( 5.25 ) also resulted from the assumption

that the flow vectors

are small, a pyramidal implementation can be used in the

same way as in the previous section to handle large motions [ 55 ].

(

u , v

)

5.3.3

Refinements and Extensions

Many modern optical flow algorithms modify Equation ( 5.21 ) in some way — for

example, by changing the data term that measures deviation from the brightness

constancy assumption, the smoothness term that measures deviation from prior

expectations of realistic flow fields, or the formof the cost functions used to combine

the two terms. In this section, we briefly describe common modifications; many of

the best-performing algorithms combine several of these modifications.

5.3.3.1 Changes to the Data Term

The original Horn-Schunck data term encapsulates the assumption that pixel inten-

sities don't change over time, which isn't realistic for many real-world scenes. Uras

et al. [ 506 ] proposed instead the gradient constancy assumption that

∇

(

u , y

v , t

) =∇

(

x , y , t

)

(5.26)

where

is the spatial gradient. This allows some local variation to illumination

changes; consequently, Brox et al. [ 74 ] proposed a modified data term reflecting

both brightness and gradient constancy:

∇

x , y (

2 (5.27)

E data

(

u , v

) =

I 2

(

u , y

) −

I 1

(

x , y

))

+ γ ∇

I 2

(

u , y

) −∇

I 1

(

x , y

)

where

is around 100). Note that

Equation ( 5.27 ) directly expresses the deviation from the constancy assumptions,

insteadof using theTaylor approximation inEquation ( 5.17 ) that is only valid for small

u and v . Xu et al. [ 558 ] claimed that it was better to use either the brightness constancy

or gradient constancy assumption at each pixel (but not both), and introduced a

binary switch variable for this purpose. An alternate approach is to explicitly model

an affine change in brightness at each pixel, as proposed by Negahdaripour [ 346 ] (in

which case these parameters also need to be estimated and regularized).

weights the contribution of the terms (typically

Computer Vision for Visual Effects

Search WWH ::

Custom Search

Home