Matchmoving - Computer Vision for Visual Effects

Graphics Reference

In-Depth Information

cameras

observed

feature locations

scene points

Figure 6.10. Bundle adjustment for projective reconstruction. We want to adjust the “bundles”

of rays emanating fromeach camera to the scene points in order to bring the estimated projections

onto the image plane as close as possible to the observed feature locations. The reprojection error

corresponds to the sum of squared distances between every observation and the intersection of

the corresponding line and image plane.

equations ( 6.36 ) into a large matrix equation:

11 x 11

12 x 12

···

1 n x 1 n

P 1

P 2

P m

X 1 X 2

21 x 21

22 x 22

···

2 n x 2 n

···

(6.38)

X n

. . .

m 1 x m 1

m 2 x m 2

···

mn x mn

Here, the

λ ij are the unknown scalar multiples such that the

∼

in Equation ( 6.36 )is

an equality:

P i X j . These are called the projective depths , and the matrix of

feature locations on the left-hand side of Equation ( 6.38 ) is called the measurement

matrix .

Since Equation ( 6.38 ) expresses the 3 m

ij x ij

n measurement matrix as the product of

a3 m

4 matrix containing the cameras

{

P i

}

anda4

n matrix containing the scene

points

, we can see this large matrix has rank at most four. This suggests a natural

factorization algorithm based on the SVD. That is, given a guess for the projective

depths, we form the measurement matrix on the left-hand side of Equation ( 6.38 )

(call it M ) and determine the SVD M

{

X j

}

UDV . U is 3 m

n , V is n

n , and D is a

n diagonal matrix of singular values, which from this reasoning should ideally

only have four nonzero elements. Therefore, we define D 4 as the left-hand n

matrix of D . We estimate the 3 m

4 matrix of cameras on the right-hand side of

Equation ( 6.38 )as UD 4 and the 4

n matrix of scene points on the right-hand side of

Equation ( 6.38 ) as the first four rows of V .

As for previous algorithms, normalization of the data prior to applying the algo-

rithm is critical to get good results if the data is noisy. A simple approach is to first

normalize the feature locations in each image in the usual way (i.e., apply a similarity

transformation to each image plane so that the features have zero mean and average

distance from the origin of √ 2). Next, we rescale each row of M to have unit norm,

Computer Vision for Visual Effects

Search WWH ::

Custom Search

Home