Matchmoving - Computer Vision for Visual Effects

Graphics Reference

In-Depth Information

X

x'

x

Figure 6.9. Once the camera matrices are estimated, the 3D point corresponding to a given

feature match can be estimated by triangulation. The point is chosen as the midpoint of the

shortest line segment connecting the two rays.

An alternate approach to the above is to simultaneously recover the internal

parameters and the relative rotation/translation for a stereo rig by showing it multi-

ple images of a planar calibration pattern (an extension of Section 6.3.2 described by

Malm and Heyden [ 312 ]). We can also estimate the camera parameters by observing

the changing positions of feature correspondences as the rig is moved freely though

a stationary environment ([ 202 , 109 ]); two stereo views are minimally required for

zero-skew cameras. The latter approach is fundamentally related to the algorithms

for image sequence calibration that we discuss in more detail in the next section.

6.5

IMAGE SEQUENCE CALIBRATION

We now have the ingredients to discuss the main topic of the chapter, the problem

of estimating the varying internal and external parameters of a camera as it moves

freely through an environment. This problem of image sequence calibration is the

core of the matchmoving process required for virtually any visual effects shot that

composites 3D computer-generated elements into footage from a moving camera.

In computer vision, the process is also called structure from motion , since we're

estimating the coordinates of 3D points corresponding to image features (i.e., struc-

ture) as the camera position is varied (i.e., motion). We assume the camera is always

translating between frames, so that any frame pair is related by a fundamental matrix.

Just as in the stereo case, image sequence calibration contains an inherent pro-

jective ambiguity analogous to Equation ( 6.30 ). Therefore, the first step is usually to

estimate a projective reconstruction of the cameras and scene points that matches

the image feature locations as well as possible. This projective reconstruction is

then “upgraded” to what is called a Euclidean or metric reconstruction that dif-

fers from the true configuration by an unknown similarity transformation. Again,

Euclidean reconstruction is not possible without some additional assumptions about

the camera calibrationmatrices or the structure of the environment—but inpractical

scenarios these assumptions are usually easy tomake. Once we have a good estimate

Search WWH ::

Custom Search

Home