Image Processing Reference
In-Depth Information
The activation function may be a simple threshold function, a sigmoid hyperbolic tangent,
or a radial basis function [ 15 ] .
(1)
Back propagation is a common training technique for an NN. This training process requires
the NN to perform a particular function by adjusting the values of the connections (weights)
between elements [ 6 , 16 ] . Actually, three important issues related to the NN need to be ad-
dressed: selection of data samples for network training, selection of an appropriate and ei-
cient training algorithm, and determination of network size [ 17 , 18 ] .
Moreover, an NN has many advantages, such as the good learning ability, less memory de-
mand, suitable generalization, fast real-time operating, simple and convenient utilization, ad-
eptness at analyzing complex paterns, and so on. On the other hand, an NN has some disad-
vantages, including its requirement for high-quality data, the need for careful a priori selection
of variables, the risk of over-iting, and the required deinition of architecture [ 14 ] .
5 Overview of the classification and regression tree
C&R trees are the most common and popular nonparametric DT learning technique. In this
chapter, I only use a regression tree for numeric data values. C&R builds a binary tree by split-
ting the records at each node according to a function of a single input variable. The measure
used to evaluate a potential spliter is diversity. This method uses recursive partitioning to
split the training records into segments with similar output variable values [ 4 ] . Moreover, the
impurity used at each node can be defined in the tree by two measures: entropy, as in Equa-
tion (2) , and Gini, which has been chosen for this chapter. The equation for entropy follows.
(2)
The Gini index, on the other hand, generalizes the variance impurity, which is the variance
of distribution related to the two classes. As in Equation (3) , the Gini index can also be useful
as the expected error rate if the class label is randomly chosen from the class distribution at
the node. In such a case, this impurity measure would have been slightly stronger at equal
probabilities (for two classes) than the entropy measure. The Gini index, which is defined by
the following equation, holds some advantages for an optimization of the impurity metric at
the nodes [ 19 ] .
 
 
Search WWH ::




Custom Search