Transform Coding - Introduction to Data Compression

Databases Reference

In-Depth Information

To cancel out the aliasing in the second half of the block, we need

CAq

CBr

DAp

DBq

From this we can get the requirements for the transform:

(64)

(65)

(66)

Note that the same requirements will help cancel the aliasing in the first half of block i by

using the second half of the inverse transform of block i

−

1. One selection that satisfies the

last condition is

2 (

−

)

(67)

2 (

)

(68)

where J is the counteridentity matrix.

The forward modified discrete transform is given by the following equation:

x n cos 2

−

N (

2 )(

2 +

4 )

X k =

(69)

where x n are the audio samples and X k are the frequency coefficients. The inverse MDCT is

given by

2 −

X k cos 2

N (

2 )(

2 +

4 )

y n =

(70)

or in terms of our matrix notation,

cos 2

N (

2 )(

2 +

4 )

[ P ] i , j

(71)

N cos 2

N (

2 )(

2 +

4 )

[ Q ] i , j

(72)

It is easy to verify that, given a value of N , these matrices satisfy the conditions for alias

cancellation.

Thus, while the inverse transform for any one block will contain aliasing, by using the

inverse transform of neighboring blocks, the aliasing can be canceled. What about blocks that

do not have neighbors—that is, the first and last blocks? One way to resolve this problem is

to pad the sampled audio sequence with N

2 zeros at the beginning and end of the sequence.

In practice, this is not necessary, because the data to be transformed is windowed prior to the

transform. For the first and last blocks, we use a special window that has the same effect as

introducing zeros. For information on the design of windows for the MDCT, see [ 200 ]. For

more on how the MDCT is used in audio compression techniques, see Chapter 16.

Introduction to Data Compression

Search WWH ::

Custom Search

Home