Graphics Reference
In-Depth Information
5.5
STEREO CORRESPONDENCE
Suppose that the two images I 1 and I 2 for which we want to compute dense corre-
spondence are taken at the same time instant by two cameras in different positions.
Thus, we can estimate the epipolar geometry and rectify the image pair, as described
in the previous section. In this case, the dense correspondence problem reduces to
the stereo correspondence problem, one of the most well-studied problems in com-
puter vision. We can think of stereo correspondence as a special case of optical flow,
with a few key differences:
The dense correspondence field is specified by one number at each pixel,
the disparity . The disparity is simply defined as x
x for a correspondence
x , y ) }
. 16 Note that y =
{ (
x , y
)
,
(
y since the images are already rectified. An
image d
(
x , y
)
formed from estimates of the disparity d at each point
(
x , y
)
I 1
is called a disparity map for I 1 .
Unlike optical flow, where
takes on continuous values, the disparity
values in stereo are conventionally quantized , most frequently into units of
pixels. The physical configuration of the cameras also introduces a maximum
range of possible disparity values. The set of possible disparity values being
discrete and finite allows us to use powerful, efficient optimization methods
based on graph cuts and belief propagation.
(
u , v
)
Occlusions in the scene are often explicitly modeled in stereo algorithms as
opposed to implicitly modeled as in optical flow.
In addition to an analogue of the smoothness assumption from optical
flow, stereo algorithms sometimes (but not always) introduce a monotonic-
ity assumption stating that corresponding points appear in the same order
along matching scanlines. This simplifies the problem even further at the
risk of poorly modeling certain real-world scenes containing thin objects (see
Section 17 ).
Stereo algorithms sometimes (but not always) assume that additional calibra-
tion information is known about the cameras that acquired the images. This
camera calibration results in immediate, high-quality estimates of the epipo-
lar geometry and maximum disparity range. However, feature matches can
typically be used to estimate the epipolar geometry to set up a stereo problem,
as described in the previous section.
We must always remember that stereo algorithms assume that the images are
acquired simultaneously , which means there can be no non-rigid motion of
the scene or independent motion of objects between the two images.
We now discuss the main approaches to the stereo correspondence problem,
which we will see have a very different character than methods for optical flow. In
their overview of optical flow algorithms, Baker et al. [ 27 ] noted that the current top
16 We conventionally assume I 1 is to the left of I 2 with respect to the scene. Therefore, a point in I 1
should appear to be further to the right than its matching position in I 2 (see Figure 5.11 b), and
we define the disparity to be the positive number x
x . Note that we're assuming a Cartesian
coordinate system for pixels (i.e., the x axis is horizontal and the y axis is vertical).
 
Search WWH ::




Custom Search