Information Technology Reference
In-Depth Information
3.5.6
Other Simple Inhibition Functions
Continue to increase the
hidden_k
parameter, and
Run
.
You should observe that this provides increasingly
distributed patterns in the hidden layer (with some vari-
ation in the total number of active units due to the ef-
fects of ties as just discussed). The advantage of con-
trolling the number of active units through the kWTA
inhibition function instead of the trial-and-error manip-
ulation of the
g_bar_l
parameter is that you have
precise and direct control over the outcome. Once
this parameter is set, it will also apply regardless of
changes in many network parameters that would affect
the
g_bar_l
parameter.
Now, let's explore the use of kWTA inhibition in the
distributed, feature-based network.
,
!
Historically speaking, the predecessor to the kWTA in-
hibition function was the single winner-takes-all (WTA)
function. A simple case of WTA was used in the
competitive learning
algorithm (Rumelhart & Zipser,
1986; Grossberg, 1976), in which the most excited
unit's activity is set to 1, and the rest to zero. A “softer”
version of this idea was developed by Nowlan (1990)
using some of the same kinds of Bayesian mathemat-
ics discussed in section 2.7. In this case, units were
activated to the extent that their likelihood of generat-
ing (predicting) the input pattern was larger than that of
other units. Thus the activity of each unit had the form:
(3.7)
Set
network
to
DISTRIBUTED_NETWORK
instead
of
LOCALIST_NETWORK
.Set
hidden_k
back to 1,
Apply
, and press
Run
.
Notice how, except when there were ties, we were
able to force this distributed network to have a single
hidden unit active — this would have been very difficult
to achieve by setting the
g_bar_l
parameter due to the
fine tolerances involved in separating units with very
similar levels of excitation.
where
l
j
is the likelihood measure (
P (datajh)
)for
each unit, and the denominator comes from replacing
P(data)
with a sum over the conditional probabilities
for all the mutually exclusive and exhaustive hypothe-
ses (as represented by the different units).
This mutual exclusivity assumption constitutes a sig-
nificant limitation of this type of model. As we saw
above, distributed representations obtain their consid-
erable power by enabling things to be represented by
multiple cooperating units. The exclusivity assumption
is inconsistent with the use of such distributed represen-
tations, relegating the single WTA models to using lo-
calist representations (even when the units have graded
“soft” activation values). Nevertheless, these simpler
models are important because they admit to a firm math-
ematical understanding at the level of the entire net-
work, which is not generally possible for the kWTA
function.
A related form of activation function to the simple
WTA function is the
Kohonen network
(Kohonen,
1984), which builds on ideas that were also proposed
by von der Malsburg (1973). Here, a single “winner”
is chosen as before, but now a
neighborhood
of units
around this winner also gets to be active, with the acti-
vation trailing off as a function of distance from the win-
ner. When used with learning, such networks exhibit
many interesting properties deriving from their built-in
tendency to treat neighboring items in similar ways. We
Do a
Cluster
on the
CLUSTER_HIDDEN
unit activ-
ities.
,
!
Question 3.14 (a)
How well does one feature do at
representing the similarity structure of the digits?
(b)
What value of the
hidden_k
parameter produces a
cluster without any collapsed distinctions?
(c)
How
many of the hidden patterns actually have that many
units active?
(d)
What mechanism explains why not ev-
ery pattern had
hidden_k
units active?
(e)
Keeping
the
hidden_k
value you just found, what happens to
the activity levels and clustering when you reduce the
leak current to 1? 0?
Go to the
PDP++Root
window. To continue on to
the next simulation, close this project first by selecting
.projects/Remove/Project_0
. Or, if you wish to
stop now, quit by selecting
Object/Quit
.
Search WWH ::
Custom Search