Arithmetic Coding - Introduction to Data Compression

Databases Reference

In-Depth Information

We have already shown that

T X (

) l ( x ) >

F X (

−

)

. Therefore, all we need to do is

show that

2 l ( x )

F X (

) −

T X (

) l ( x ) >

This is true because

F X (

) −

T X (

) l ( x ) >

F X (

) −

T X (

)

(

)

2 l ( x )

This code is prefix free; and by taking the binary representation of T X (

)

and truncating it to

1 bits, we obtain a uniquely decodable code.

Although the code is uniquely decodable, how efficient is it? We have shown that the

number of bits l

(

) =

log

P ( x ) +

(

)

required to represent F X (

)

with enough accuracy such that the code for

different values of x is distinct is

log

(

) =

(

)

Remember that l

is the number of bits required to encode the entire sequence x . So, the

average length of an arithmetic code for a sequence of length m is given by

(

)

l A ( m ) =

(

)

(

)

(14)

log

(

)

(15)

(

)

log

(

)

) +

(16)

(

2 P

=−

(

)

log P

(

) +

(

)

(17)

X m

(

) +

(18)

Given that the average length is always greater than the entropy, the bounds on l A ( m ) are

X ( m ) )

X ( m ) ) +

(

l A ( m ) <

(

The length per symbol, l A , or rate of the arithmetic code is l A ( m )

. Therefore, the bounds on l A

are

X ( m ) )

(

l A <

(19)

We have shown in Chapter 3 that for iid sources

X ( m ) ) =

(

)

(20)

Therefore,

(

)

l A <

(

) +

(21)

By increasing the length of the sequence, we can guarantee a rate as close to the entropy as we

desire.

Introduction to Data Compression

Search WWH ::

Custom Search

Home