Game Development Reference
In-Depth Information
discrete event systems. The 3D shape of the body parts is reconstructed by
selectively integrating the apparent contours from three mutually orthogonal
views.
Several methods are proposed for 3D motion recovery from monocular images.
DiFranco et al. (2001) describe a method for computing the 3D motion of
articulated models from 2D correspondences. The authors use kinematic
constraints based on a 3D kinematic model, joint angle limits, dynamic smoothing
and 3D key frames which can be specified by the user. In Sminchisescu & Triggs
(2001), the authors propose a method for recovering 3D human body motion from
monocular video sequences using robust image matching, joint and non-self-
intersection constraints. To reduce correspondence ambiguities, the authors use
a matching cost metric that combines robust optical flow, edge energy, and
motion boundaries. In Howe et al. (2000), the authors present a 3D motion
capture via a single camera. The method depends on prior knowledge about
human motion to resolve the ambiguities of the 2D projection.
A geometric model is an approximation of the shape and of the deformations of
the object. This model can be two-dimensional (modeling the contours of the
projections of the object in the images), or three-dimensional (modeling the
surfaces of the object). 2D shape models are generally made of curves, snakes,
segments, sticks, etc., whereas 3D shape models are either systems of rigid
bodies (spheres, superquadrics, etc.) or deformable surfaces (mesh). The
articulations may be modeled by joints or by the motion of control points of B-
splines. The choice between a 2D or a 3D model depends on the application, e.g.,
needed precision, number of cameras, and type of motion to be recognized.
2D
Several researchers work with 2D features to recognize human movement.
Gavrila (1999), Goddard (1994) and Guo et al. (1994) use model-based recog-
nition techniques, namely stick-figures, for this purpose. Other researchers who
used 2D models are Papageorgiu & Poggio (1999), Comaniciu et al. (2000) and
Isard & MacCormick (2001). Wachter & Nagel (1999) proposed a method to
track the human body in monocular sequences. Their method depends on contour
information and moving regions between frames.
Most of the work in this area is based on the segmentation of different body parts.
Wren et al. (1999) proposed a system, Pfinder, to track people in 2D by using
blobs that represent different body parts. The system uses a Maximum A
Posteriori Probability (MAP) approach to detect and track people. The authors
extend their work to obtain 3D estimates of the hands and head by applying two
Pfinder algorithms (Wren et al., 2000). Pfinder uses blob features to detect a
Search WWH ::




Custom Search