Detecting distorted and benign blood cells using the Hough transform based on neural networks and decision trees - Emerging Trends in Image Processing, Computer Vision, and Pattern Recognition

Image Processing Reference

In-Depth Information

The activation function may be a simple threshold function, a sigmoid hyperbolic tangent,

or a radial basis function [ 15 ] .

(1)

Back propagation is a common training technique for an NN. This training process requires

the NN to perform a particular function by adjusting the values of the connections (weights)

between elements [ 6 , 16 ] . Actually, three important issues related to the NN need to be ad-

dressed: selection of data samples for network training, selection of an appropriate and ei-

cient training algorithm, and determination of network size [ 17 , 18 ] .

Moreover, an NN has many advantages, such as the good learning ability, less memory de-

mand, suitable generalization, fast real-time operating, simple and convenient utilization, ad-

eptness at analyzing complex paterns, and so on. On the other hand, an NN has some disad-

vantages, including its requirement for high-quality data, the need for careful a priori selection

of variables, the risk of over-iting, and the required deinition of architecture [ 14 ] .

5 Overview of the classification and regression tree

C&R trees are the most common and popular nonparametric DT learning technique. In this

chapter, I only use a regression tree for numeric data values. C&R builds a binary tree by split-

ting the records at each node according to a function of a single input variable. The measure

used to evaluate a potential spliter is diversity. This method uses recursive partitioning to

split the training records into segments with similar output variable values [ 4 ] . Moreover, the

impurity used at each node can be defined in the tree by two measures: entropy, as in Equa-

tion (2) , and Gini, which has been chosen for this chapter. The equation for entropy follows.

(2)

The Gini index, on the other hand, generalizes the variance impurity, which is the variance

of distribution related to the two classes. As in Equation (3) , the Gini index can also be useful

as the expected error rate if the class label is randomly chosen from the class distribution at

the node. In such a case, this impurity measure would have been slightly stronger at equal

probabilities (for two classes) than the entropy measure. The Gini index, which is defined by

the following equation, holds some advantages for an optimization of the impurity metric at

the nodes [ 19 ] .

Search WWH ::

Custom Search

Home