Advances of MPEG Scalable Video Coding Standard - Intelligent Multimedia Data Hiding

Cryptography Reference

In-Depth Information

motion compensated filtering to achieve the temporal scalability. Other possi-

ble structures will be discussed in Sec. 3.3.4. To achieve the spatial scalability,

the 2-D wavelet decomposition is done for each frame after the MCTF. To

further reduce the bit rate and generate a scalable bit-stream, the wavelet

coe cients are coded using a scalable entropy coder such as zero-tree coder

or other types of arithmetic coder. By means of proper extraction techniques

a single bit-stream with scalable spatial, temporal and SNR parameters can

be produced. Similar wavelet video coding schemes but with specific features

have been suggested by several other researchers [20, 21].

Historically, the wavelet-based schemes fell short in video quality in the

MPEG standard competition in 2004. An ad hoc group in MPEG with a

coined name VidWav was formed and has continued to study the wavelet

video coding technique for the past two years. Based on the software originally

developed by Microsoft Research Asia, this group produced a common sim-

ulation platform for testing and improving wavelet video algorithms [22, 23].

Additional reports on the recent VidWav group activities are summarized in

[24] and [25].

In the following subsections, we give a brief overview of the basic con-

cepts of the wavelet video scheme. The MPEG VidWav evaluation software

will often be used for illustration. Although there are a number of on-going

research works outside MPEG, our description focuses only on the MPEG

related wavelet video projects. An interesting phenomenon is that over the

past 3 years, several tools such as MCTF originally proposed for the wavelet

schemes are now included in the AVC-based SVC with an open-loop concept.

These tools have been discussed in the previous sections and we will minimize

the overlap here. Some AVC tools such as the AVC base-layer are tested under

the wavelet structure and the Variable Block-Size Motion Compensation.

3.3.1 Temporal Scalability

To provide the temporal scalability, the wavelet-based scheme also adopts the

MCTF as described in Section 3.2.1. The MCTF concept was first proposed

for wavelet coding [5]. Typically, Haar or (5, 3) wavelets are used.

To improve the accuracy of the motion field so as to improve coding per-

formance, several techniques for better motion estimation have been proposed

in [26, 27]. Particularly, in [27], a novel structure, known as Barbell-Lifting,

is presented. The basic idea is to use a barbell function for generating the

prediction/update values in the lifting structure. Specifically, for each pixel in

the high-pass frame, the prediction value is obtained by using a set of pixels

as the input to the barbell function. It has been shown that the prediction

using the barbell function offers a superior performance when compared to

the conventional scheme using a single-pixel. It often reduces the mismatch of

motion in the prediction and update steps. However, in the conventional t+2D

structure, the motion vectors are estimated at the highest spatial resolution.

In reconstructing the spatial low-resolution images the temporal prediction

Intelligent Multimedia Data Hiding

Search WWH ::

Custom Search

Home