Dense Correspondence and Its Applications - Computer Vision for Visual Effects

Graphics Reference

In-Depth Information

from the modern advances in optical flow and stereo correspondence discussed in

the rest of the chapter.

Recent research in both optical flow and stereo correspondence has substantially

benefited from high-quality ground-truthed data sets and benchmarks hosted at

Middlebury College. Reports by Baker et al. [ 27 ] (on optical flow) and Scharstein

and Szeliski [ 425 ] (on stereo) detail the data generation and testing methodol-

ogy that is now used as a worldwide benchmark for comparing new flow and

stereo algorithms. The high-quality datasets and constantly-updated benchmarks

for hundreds of algorithms are available at http://vision.middlebury.edu/flow/ and

http://vision.middlebury.edu/stereo/ .

Barron et al. [ 31 ] proposed an earlier benchmark for optical flow, now superceded

by the Middlebury database but responsible for driving much earlier research in

the field. Sun et al. [ 475 ] investigated the effects of several simple refinements to the

classical Horn-Schunck algorithm (such as the choice of robust penalty function, the

type of image interpolation, and theuse of amedianfilter in thewarping step) to arrive

at a set of recommendations and best practices that result in simple, competitive

optical flow algorithms. Brown et al. [ 70 ] gave a good survey of advances in stereo

algorithms, although many of the competitive algorithms discussed in Section 5.5

were proposed after this publication. While we didn't mention it in Section 5.3 , the

graph-cut stereomethods in Section 5.5.2 can be easily extended to estimate discrete

optical flow. In this case, the label at each pixel is multi-valued (i.e., a flow vector

(

instead of a disparity d ).

Szeliski et al. [ 485 ] surveyed methods to efficiently minimize cost functions of

the form of Equation ( 5.50 ), including the

u , v

)

-expansion and loopy belief propagation

methods discussed in Section 5.5 , as well as a promising variant of belief propagation

called tree-reweighted message passing [ 525 ]. They also provided efficient reference

implementations for the various algorithms at http://vision.middlebury.edu/MRF/ .

Felzenszwalb and Zabih [ 139 ] recently gave a good survey of dynamic programming

and graph-based techniques, with common applications to computer vision.

So far, our discussion of dense correspondence has beenmotivated almost entirely

as a 2D-to-2Dmatching problem. However, the remainder of the topic focuses on 3D

considerations, and we will see that disparity is often interpreted as an inverse depth

map . That is, objects that are closer to the pair of cameras have high disparities and

far-away objects have lowdisparities. 22 Recovering the actual depths froma disparity

map requires further physical knowledge about the camera setup, whichwediscuss in

thenext chapter. Ahigh-quality optical flowfieldor disparitymap cangreatly improve

the results of other visual effects algorithms, such as matting, and is at the heart of

algorithms for converting monocular films to stereo in post-production. Conversely,

an independent estimate of depth (e.g., sparse measurements from a low-resolution

range sensor) can definitely improve the results of a dense correspondence algorithm

(see Yang et al. [ 564 ]).

α

22 As a simple experiment, hold a finger close to your face and alternate winking your eyes. The

relative position of your finger changes substantially compared the the position of a fixed point in

the background. If your finger is centered in front of your nose, you will also observe the double

nail illusion — that is, a violation of the monotonicity constraint.

Computer Vision for Visual Effects

Search WWH ::

Custom Search

Home