Mathematical Preliminaries for Lossy Coding - Introduction to Data Compression

Databases Reference

In-Depth Information

This also makes sense. For the compression scheme described here, if we know the source

output, we know 4 bits, the first 3 of which are the reconstruction. Therefore, in this example,

knowledge of the source output at a specific time completely specifies the corresponding

reconstruction.

8.4.2 Average Mutual Information

We make use of one more quantity that relates the uncertainty or entropy of two random

variables. This quantity is called mutual information and is defined as

log P

(

x k |

y j )

(

x k ;

y j ) =

(19)

(

x k )

We will use the average value of this quantity, appropriately called average mutual information ,

which is given by

log P

N − 1

−

(

x i |

y j )

(

;

) =

(

x i ,

y j )

(20)

(

x i )

j = 0

log P

−

M − 1

(

x i |

y j )

0 P

(

x i |

y j )

(

y j )

(21)

(

x i )

We can write the average mutual information in terms of the entropy and the conditional

entropy by expanding the argument of the logarithm in Equation ( 21 ):

log P

−

(

x i |

y j )

(

;

) =

(

x i ,

y j )

(22)

(

x i )

−

(

x i ,

y j )

log P

(

x i |

y j )

N − 1

M − 1

−

(

x i ,

y j )

log P

(

x i )

(23)

i =

j =

(

) −

(

)

(24)

where the second term in Equation ( 23 )is H

. Thus, the

average mutual information is the entropy of the source minus the uncertainty that remains

about the source output after the reconstructed value has been received. The average mutual

information can also be written as

(

)

, and the first term is

−

(

)

(

;

) =

(

) −

(

) =

(

;

(25)

We can show this easily by using Bayes' theorem. According to Bayes' theorem

(

y j |

x i )

(

x i )

(

x i |

y j ) =

(

y j )

Introduction to Data Compression

Search WWH ::

Custom Search

Home