Introduction - Hierarchical Neural Networks for Image Interpretation

Information Technology Reference

In-Depth Information

1.2 Organization of the Thesis

The thesis is organized as follows:

Part I: Theory

Chapter 2. The next chapter gives some background information on the human

visual system. It covers the visual pathways, the organization of feature maps, com-

putation in layers, neurons as processing units, and synapses as adaptable elements.

At the end of the chapter, some open questions are discussed, including the binding

problem and the role of recurrent connections.

Chapter 3. Related work is discussed in Chapter 3, focussing on two aspects of

the proposed architecture: hierarchy and recurrence. Generic signal decompositions,

neural networks, and generative statistical models are reviewed as examples of hier-

archical systems for image analysis. The use of recurrence is discussed in general.

Special attention is paid to models with specific types of recurrent interactions: lat-

eral, vertical, and the combination of both.

Chapter 4. The proposed architecture for image interpretation is introduced in

Chapter 4. After giving an overview, the architecture is formally described. To illus-

trate its use, several small example networks are presented. They apply the architec-

ture to local contrast normalization, binarization of handwriting, and shift-invariant

feature extraction.

Chapter 5. Unsupervised learning techniques are discussed in Chapter 5. An un-

supervised learning algorithm is proposed for the suggested architecture that yields

a hierarchy of sparse features. It is applied to a dataset of handwritten digits. The

produced features are used as input to a supervised classifier. The performance of

this classifier is compared to other classifiers, and it is combined with two existing

classifiers.

Chapter 6. Supervised learning is covered in Chapter 6. After a general discus-

sion of supervised learning problems, gradient descent techniques for feed-forward

neural networks and recurrent neural networks are reviewed separately. Improve-

ments to the backpropagation technique and regularization methods are discussed,

as well as the difficulty of learning long-term dependencies in recurrent networks. It

is suggested to combine the RPROP algorithm with backpropagation through time

to achieve stable and fast learning in the proposed recurrent hierarchical architec-

ture.

Part II: Applications

Chapter 7. The proposed architecture is applied to recognize the value of postage

meter marks. After describing the problem, the dataset, and some preprocessing

steps, two classifiers are detailed. The first one is a hierarchical block classifier that

recognizes meter values without prior digit segmentation. The second one is a neural

Search WWH ::

Custom Search

Home