Analysis/Synthesis and Analysis by Synthesis Schemes - Introduction to Data Compression

Databases Reference

In-Depth Information

on speech coding have changed. We have looked at several algorithms that have been designed

to operate in this changed environment. For images, fractal coding provides a very different

way to look at the problem. Instead of using the physical structure of the system to generate

the source output, it uses a more abstract view to obtain an analysis/synthesis technique.

Further Reading

1. For information about various aspects of speech processing, Voice and Speech Processing,

by T. Parsons [ 119 ], is a very readable source.

2. The classic tutorial on linear prediction is “Linear Prediction,” by J. Makhoul [ 252 ],

which appeared in the April 1975 issue of the Proceedings of the IEEE .

3. For a thorough review of recent activity in speech compression, see “Speech Coding

Methods, Standards, and Applications,” by J.D. Gibson [ 245 ], which appeared in IEEE

Circuits and Systems Magazine .

4. An excellent source for information about speech coders is Digital Speech: Coding for

Low Bit Rate Communication Systems , by A. Kondoz [ 285 ].

5. An excellent description of the G.728 algorithm can be found in “A Low Delay CELP

Coder for the CCITT 16kb/s Speech Coding Standard,” by J.-H. Chen, R.V. Cox, Y.-C.

Lin, N. Jayant, and M.J. Melchner [ 240 ], in the June 1992 issue of the IEEE Journal on

Selected Areas in Communications .

6. A good introduction to fractal image compression is Fractal Image Compression: Theory

and Application , Y. Fisher (ed.) [ 250 ].

7. The October 1993 issue of the Proceedings of the IEEE contains a special section on

fractals with a tutorial on fractal image compression by A. Jacquin.

18.8 Projects and Problems

1. Write a program for the detection of voiced and unvoiced segments using the AMDF

function. Test your algorithm on the test.snd sound file.

2. The testf.raw file is a female voice saying the word test . Isolate 10 voiced and

unvoiced segments from the testm.raw file and the testf.snd file. (Try to pick the

same segments in the two files.) Compute the number of zero crossings in each segment

and compare your results for the two files.

3.

(a) Select a voiced segment from the testf.raw file. Find the fourth-, sixth-, and

tenth-order LPC filters for this segment using the Levinson-Durbin algorithm.

(b) Pick the corresponding segment from the testf.snd file. Find the fourth-, sixth-,

and tenth-order LPC filters for this segment using the Levinson-Durbin algorithm.

(c) Compare the results of (a) and (b).

4. Select a voiced segment from the test.raw file. Find the fourth-, sixth-, and tenth-

order LPC filters for this segment using the Levinson-Durbin algorithm. For each of the

filters, find the multipulse sequence that results in the closest approximation to the voiced

signal.

Search WWH ::

Custom Search

Home