MPEG video compression - The MPEG

Information Technology Reference

In-Depth Information

Figure 5.43: In MPEG-4 four types of objects are coded.

Using two-dimensional warping, a still texture object can be made to move. In three-dimensional graphic rendering,

mesh coding allows an arbitrary solid shape to be created which is then covered with texture. Perspective

computation then allows this three-dimensional object to be viewed in correct perspective from any viewpoint.

MPEG-4 provides tools to allow two- or three-dimensional meshes to be created in the decoder and then oriented

by vectors. Changing the vectors then allows realistic moving images to be created with an extremely low bit rate.

Face and body animation is a specialized subset of three-dimensional mesh coding in which the mesh represents a

human face and/or body. As the subject moves, carefully defined vectors carry changes of expression which allow

rendering of an apparently moving face and/or body which has been almost entirely synthesized from a single still

picture.

In addition to object coding, MPEG-4 refines the existing MPEG tools by increasing the efficiency of a number of

processes using lossless prediction. This improves the performance of the motion compensation and coefficient

coding allowing either a lower bit rate or improved quality. MPEG-4 also extends the idea of scaleability introduced

in MPEG-2. Multiple scaleability is supported, where a low-bit-rate base- level picture may optionally be enhanced

by adding information from one or more additional bitstreams. This approach is useful in network applications

where the content creator cannot know the bandwidth which a particular user will have available. Scaleability

allows the best quality in the available bandwidth.

Although most of the spatial compression of MPEG-4 is based on the DCT as in earlier MPEG standards, MPEG-4

also introduces wavelet coding of still objects. Wavelets are advantageous in scaleable systems because they

naturally decompose the original image into various resolutions.

5.20 Video objects

Figure 5.44 shows an example of a video object intersecting video object planes , or VOPs. At each plane, the

shape and the texture of the object must be portrayed. Figure 5.45 shows this can be done using appropriate

combinations of intra- and inter-coding as described for the earlier standards. This gives rise to I-VOPs, P-VOPs

and B-VOPs. A group of VOPs is known as a GOV.

The MPEG

Search WWH ::

Custom Search

Home