Feature extraction by shape matching - Feature Extraction and Image Processing

Image Processing Reference

In-Depth Information

5

Feature extraction by

shape matching

5.1

Overview

High-level feature extraction concerns finding shapes in computer images. To be able to

recognise faces automatically, for example, one approach is to extract the component

features. This requires extraction of, say, the eyes, the ears and the nose, which are the

major face features. To find them, we can use their shape: the white part of the eyes is

ellipsoidal; the mouth can appear as two lines, as do the eyebrows. Shape extraction

implies finding their position, their orientation and their size. This feature extraction process

can be viewed as similar to the way we perceive the world: many books for babies describe

basic geometric shapes such as triangles, circles and squares. More complex pictures can

be decomposed into a structure of simple shapes. In many applications, analysis can be

guided by the way the shapes are arranged. For the example of face image analysis, we

expect to find the eyes above, and either side of, the nose and we expect to find the mouth

below the nose.

In feature extraction, we generally seek invariance properties so that the extraction

process does not vary according to chosen (or specified) conditions. That is, techniques

should find shapes reliably and robustly whatever the value of any parameter that can

control the appearance of a shape. As a basic invariant , we seek immunity to changes in

the illumination level: we seek to find a shape whether it is light or dark. In principle, as

long as there is contrast between a shape and its background, the shape can be said to exist,

and can then be detected. (Clearly, any computer vision technique will fail in extreme

lighting conditions, you cannot see anything when it is completely dark.) Following

illumination, the next most important parameter is position : we seek to find a shape

wherever it appears. This is usually called position- , location- or translation-invariance .

Then, we often seek to find a shape irrespective of its rotation (assuming that the object or

the camera has an unknown orientation): this is usually called rotation- or orientation-

invariance . Then, we might seek to determine the object at whatever size it appears, which

might be due to physical change, or to how close the object has been placed to the camera.

This requires size- or scale-invariance . These are the main invariance properties we shall

seek from our shape extraction techniques. However, nature (as usual) tends to roll balls

under our feet: there is always noise in images. Also since we are concerned with shapes,

note that there might be more than one in the image. If one is on top of the other, it will

occlude , or hide, the other, so not all the shape of one object will be visible.

But before we can develop image analysis techniques, we need techniques to extract the

shapes. Extraction is more complex than detection , since extraction implies that we have

Search WWH ::

Custom Search

Home