Triangulation-Based Approaches to Three-Dimensional Scene Reconstruction - 3D Computer Vision: Efficient Methods and Applications

Graphics Reference

In-Depth Information

Fig. 1.16 ( a ) Pair of stereo images, acquired by two cameras with non-parallel optical axes.

( b ) Image pair rectified to standard geometry

W x

W c i +

λB − 1

S i

and

λ (s) B (s i − 1 S (s)

(1.98)

W x

W c i +

x ,

which directly implies the rectification equation

S (s)

λ (s) B (s)

B − 1

S i

(1.99)

(Fusiello et al., 2000 ).

At this point it is necessary to compute a grey value for each pixel of the rec-

tified image, where the pixel coordinates are integer-valued. The pixel coordinates

in the original image obtained according to ( 1.99 ), however, are real-valued. Ac-

cordingly, an interpolation technique such as bilinear interpolation has to be applied

(Fusiello et al., 2000 ). An example of a stereo image pair originally acquired with a

convergent camera setup and rectified to standard geometry is shown in Fig. 1.16 .

The first two coordinates of the projective vector S (s)

x correspond to the rectified

pixel values u (s)

v (s 1 of image 2

now form a corresponding pair of epipolar lines. As a consequence, the problem of

stereo image analysis becomes a problem of establishing corresponding points along

image rows. We will now regard the computation of three-dimensional scene struc-

ture from the coordinates of image points measured in stereo image pairs rectified

to standard geometry. Without loss of generality, at this point the coordinate system

of camera 1 is used as the world coordinate system, such that the corresponding

rotation matrix R 1 corresponds to the identity matrix and the translation vector t 1 is

zero, and we define t

and v (s)

.Therow v (s)

of image 1 and the row v (s)

between the optical centres of the two

cameras corresponds to the baseline distance of the stereo camera system. As the

images are rectified to standard geometry, both optical axes are orthogonal to the

baseline (Horn, 1986 ). Due to the rectification to standard geometry, both cameras

have the same effective principal distance b 0 .

The image coordinates in standard geometry are denoted by u (s)

≡

t 2 . The distance

and v (s)

in im-

age 1 and by u (s)

2 and v (s 2 in image 2, respectively. As it is assumed that the world

coordinate system corresponds to the coordinate system of camera 1, a scene point

can be described by

(x,y,z) T . We furthermore assume square pixels

with k u = k v , corresponding to a pixel edge length d p =

W x

C 1 x

1 /k u , and without skew.

On the basis of Horn ( 1986 ), ( 1.1 ) then implies the equations

3D Computer Vision: Efficient Methods and Applications

Search WWH ::

Custom Search

Home