Networks of Neurons - Computational Explorations in Cognitive Neuroscience

Information Technology Reference

In-Depth Information

Learning: Distributed representations allow for the

bootstrapping of small changes that is critical for

learning (see section 1.6.5 and chapters 4-6). Fur-

thermore, learning tends not to result in perfectly

interpretable representations at the single unit level

(i.e., not template matching units).

these units will be active whenever one of these features

is present in the input.

Press Run .Do View , EVENTS to see the input pat-

terns.

Now verify for yourself in the GridLog that the firing

patterns of the hidden units make sense given the fea-

tures present in the different digits. The only case that

is somewhat strange is the third hidden unit firing for

the digit “0” — it fires because the left and right sides

match the weight pattern. There is an important lesson

here — just because you might have visually encoded

the third hidden unit as the “middle horizontal line” de-

tector, it can actually serve multiple roles . This is just

a simple case of the kind of complexity that surrounds

the attempt to describe the content of what is being de-

tected by neurons. Imagine if there were 5,000 weights,

with a much more complicated pattern of values, and

you can start to get a feel for how complicated a neu-

ron's responses can be.

Now, let's see what a cluster plot of the hidden unit

representations tells us about the properties of the trans-

formation performed by this distributed network.

, !

As should be clear, distributed representations are

critical for many of the key properties of neural net-

works described in the introductory chapter. Despite

this, some researchers maintain that localist representa-

tions are preferable, in part for their simplicity, which

was admittedly useful in the previous section. Never-

theless, most of the remaining models in this topic em-

ploy distributed representations. In section 3.5 we also

discuss the important difference between sparse dis-

tributed representations and generic distributed repre-

sentations.

3.3.3

Exploration of Distributed Representations

Now, we will explore the difference between localist

and distributed representations.

Select NOISY_DIGITS for

the env_type

Open the project loc_dist.proj.gz in

chapter_3 to begin.

This project looks very similar to the previous one —

it is in fact a superset of it, having both the previous lo-

calist network, and a new distributed one. To begin, we

will first replicate the results we obtained before using

the localist network.

( Apply ), and Run .

Then, do Cluster and choose

, !

CLUSTER_HIDDEN .

This should produce a cluster plot like that shown in

figure 3.13. Although there are a couple of obvious dif-

ferences between this plot and the one for the localist

network shown in figure 3.8b, it should be clear that

the distributed network is also generally emphasizing

the distinctions between different digits while deempha-

sizing (collapsing) distinctions among noisy versions of

the same digit.

One difference in the distributed network is that it

sometimes collapsed noisy versions of different digits

together (a 2 with the 5's, and a 0 with the 4's), even

though in most cases the different versions of the same

digit were properly collapsed together. It also did not

always collapse all of the different images of a digit to-

gether, sometimes only getting 2 out of 3. The reason

for these problems is that the noisy versions actually

shared more features with a different digit representa-

tion. In most cases where we use distributed represen-

tations, we use a learning mechanism to discover the

individual feature detectors, and learning will usually

Press View on the overall control panel (now called

loc_dist_ctrl ) and select GRID_LOG . Then do Run .

You should see the responses of the same localist net-

work just as before.

, !

Set network to DISTRIBUTED_NETWORK instead

of LOCALIST_NETWORK , and hit Apply .

You will now see the distributed network take the

place of the localist one. This network contains only 5

hidden units. Let's explore this network by examining

the weights into these hidden units.

, !

Select r.wt in the network window, and click on

each of the units.

You will notice that these units are configured to de-

tect parts or features of digit images, not entire digits

as in the localist network. Thus, you can imagine that

Computational Explorations in Cognitive Neuroscience

Search WWH ::

Custom Search

Home