Information Technology Reference
In-Depth Information
w 0
−5.4626
w 0
−11,864
−5.4628
−5.463
−11,866
−5.4632
1
w 1
1
w 1
0.5
10.926
4.747
0.5
x 10 −3
x 10 −4
0
4.7465
0
10.9259
4.746
−0.5
−0.5
w 2
−1
4.7455
w 2
−1
−1.5
4.745
10.9258
(a)
(b)
−23.4754
w 0
−5.4628
−23.4755
w 0
−5.4629
−23.4756
2
−5.4629
−5.463
w 1
1
11.1
w 1
x 10 −4
−5.4631
9.3903
11
0
9.3903
−5.4631
9.3902
10.9
−1
w 2
1.5
x 10 −4
w 2
9.3902
1
0.5
0
9.3901
−0.5
−2
−1
10.8
9.3901
−1.5
(c)
(d)
Fig. 3.15 H S for bivariate Gaussian inputs, computed on a ( w 1 ,w 2 ,w 0 ) grid
around w (central dot). The dot size represents H S value (larger size for higher
H S ). (a) The distant-classes case, μ 1 =[50]
T ,with w the min H S solution. (b)
T ,with w a minimizer for w 1 and w 2 and a
maximizer for w 0 . (c) A zoom of the central layer showing the maximum for w 0 .
(d) The same as (a) for the logistic activation function.
The close-classes case,
μ 1 =[10]
Similar results are obtained if the logistic activation function ϕ ( x )=1 / (1+
e −x ) is considered. In this case
exp
m t 2
ln t−e
1 −t + e
1
2 σ t
2 πσ t ( t
f E|t ( e )=
1+ t,t [ ( e ) .
(3.45)
]
e )(1
t + e )
For the same “distant classes” setting we find the H S -MEE solution
[ w 1 0 w 0 ] T
23 . 4755] T
=[9 . 3902 0
(corresponding to the optimal solu-
tion
w 0 /w 1 =2 . 5) which is a local minimum as shown in Fig. 3.15(d). For
the “close classes” setting, and in the same way as for the tanh activation
function, a saddle point is found.
 
Search WWH ::




Custom Search