Subband Coding - Introduction to Data Compression

Databases Reference

In-Depth Information

rates. If this is not true, we need to weight the rates of the individual subbands. The functional

to be minimized becomes

D k + λ

J

=

β k R k

(116)

where

β k is the weight reflecting the relative length of the sequence generated by the k th filter.

The distortion contribution from each subband might not be equally relevant, perhaps because

of the filter construction or because of the perceptual weight attached to those frequencies [ 210 ].

In these cases we can modify our functional still further to include the unequal weighting of

the distortion:

=

w k D k + λ

β k R k

(117)

J

14.10 Application to Speech Coding __ G.722

ITU-T recommendation G.722 provides a technique for wideband coding of speech signals

that is based on subband coding. The basic objective of this recommendation is to provide

high-quality speech at 64 kbits per second (kbps). The recommendation also contains two

other modes that encode the input at 56 and 48kbps. These two modes are used when an

auxiliary channel is needed. These two modes provide for auxiliary channels of 8 and 16kbps,

respectively.

The speech output or audio signal is filtered to 7kHz to prevent aliasing, then sampled at

16,000 samples per second. Notice that the cutoff frequency for the anti-aliasing filter is 7kHz,

not 8kHz, even though we are sampling at 16,000 samples per second. One reason for this is

that the cutoff for the anti-aliasing filter is not going to be sharp like that of the ideal low-pass

filter. Therefore, the highest frequency component in the filter output will be greater than

7kHz. Each sample is encoded using a 14-bit uniform quantizer. This 14-bit input is passed

through a bank of two 24-coefficient FIR filters. The coefficients of the low-pass quadrature

mirror filter are shown in Table 14.7 .

The coefficients for the high-pass quadraturemirror filter can be obtained by the relationship

n h LP , n

h HP , n = ( −

1

)

(118)

The low-pass filter passes all frequency components in the range of 0 to 4kHz, while the

high-pass filter passes all remaining frequencies. The output of the filters is downsampled by

a factor of two. The downsampled sequences are encoded using adaptive differential PCM

(ADPCM) systems.

The ADPCM system that encodes the downsampled output of the low-frequency filter uses

6 bits per sample, with the option of dropping 1 or 2 least significant bits in order to provide

room for the auxiliary channel. The output of the high-pass filter is encoded using 2 bits per

sample. Because the 2 least significant bits of the quantizer output of the low-pass ADPCM

system could be dropped and thus not be available to the receiver, the adaptation and prediction

at both the transmitter and receiver are performed using only the 4 most significant bits of the

quantizer output.

Introduction to Data Compression

Search WWH ::

Custom Search

Home