Transform Coding - Introduction to Data Compression

Databases Reference

In-Depth Information

The various quantization and binary coding techniques have been described at some length

in previous chapters, so we will spend the next section describing various transforms. We will

then discuss quantization and coding strategies in the context of these transforms.

13.3 The Transform

All of the transforms we deal with will be linear transforms; that is, we can get the sequence

{ θ n }

from the sequence

{

x n }

−

θ n =

x i a n , i

(9)

This is referred to as the forward transform . For the transforms that we will be considering,

a major difference between the transformed sequence

{ θ n }

{

x n }

and the original sequence

that the characteristics of the elements of the

sequence are determined by their position

within the sequence. For instance, in Example 13.2.1 the first element of each pair of the

transformed sequence is more likely to have a large magnitude compared to the second element.

In general, we cannot make such statements about the source output sequence

. Ameasure

of the differing characteristics of the different elements of the transformed sequence

{

x n }

{ θ n }

n of each element. These variances will strongly influence how we encode

the transformed sequence. The size of the block N is dictated by practical considerations.

In general, the complexity of the transform grows more than linearly with N . Therefore,

beyond a certain value of N , the computational costs overwhelm any marginal improvements

that might be obtained by increasing N . Furthermore, in most real sources, the statistical

characteristics of the source output can change abruptly. For example, when we go from a

silence period to a voiced period in speech, the statistics change drastically. Similarly, in

images, the statistical characteristics of a smooth region of the image can be very different

from the statistical characteristics of a busy region of the image. If N is large, the probability

that the statistical characteristics change significantly within a block increases. This generally

results in a larger number of transform coefficients with large values, which in turn leads to a

reduction in the compression ratio.

The original sequence

the variance

{

x n }

can be recovered from the transformed sequence

{ θ n }

via the

inverse transform :

−

x n =

0 θ i b n , i

(10)

The transforms can be written in matrix form as

θ =

(11)

(12)

where A and B are N

N matrices, and the

(

)

th element of the matrices is given by

[

] i , j

a i , j

(13)

[

] i , j

b i , j

(14)

Introduction to Data Compression

Search WWH ::

Custom Search

Home