Perception and Attention - Computational Explorations in Cognitive Neuroscience

Information Technology Reference

In-Depth Information

Item Summary

Generalization Test

itm

err

%tot

%itm

correl

uniq

In addition to all of the above receptive field measures

of the network's performance, we can perform a behav-

ioral test of its ability to generalize in a spatially invari-

ant manner, using the two objects (numbers 18 and 19 in

figure 8.12) that were not presented to the network dur-

ing training. We can now train on these two objects in

a restricted set of spatial locations and sizes, and assess

the network's ability to respond to these items in novel

locations and sizes. The sweep test above indicates that

the network should generalize, because it produced spa-

tially invariant responses in layer V4 for these novel

stimuli. Thus, presumably the bulk of what the net-

work needs to do is learn an association between these

V4 representations and the appropriate output units, and

good generalization should result to all other spatial lo-

cations.

In addition to presenting the novel objects during

training, we also need to present familiar objects; other-

wise the network will suffer from catastrophic interfer-

ence (see chapters 7 and 9 for more discussion of this is-

sue). The following procedure was used. On each trial,

there was a 1 in 4 chance that a novel object would be

presented, and 3 in 4 chance that a familiar one was pre-

sented. If a novel object was presented, its location was

chosen at random from a 6 x 6 grid at the center of the

visual field (i.e., 36 possible locations, out of a total of

256, or roughly 14% of the locations), and its size was

chosen at random to be either 5 or 9 pixels (i.e., 2 out

of the 4 possible sizes, or 50% of the sizes). If a famil-

iar object was presented, then its size and position was

chosen completely at random from all the possibilities.

This procedure was repeated for 60 epochs of 150 ob-

jects per epoch, with a learning rate of .001. Although

the network started getting the novel objects correct af-

ter only 3 epochs, the longer training ensured that the

new knowledge was well consolidated before testing.

Testing entailed another sweep analysis after training

on the new objects (table 8.2). The detailed results are

contained in the file objrec.swp.err . In particu-

lar, pay attention to the %itm column, which shows the

errors as a function of total presentations of that item

(object). The overall results were that object 18 had

roughly 15 percent errors (85% correct) during testing

1.71

3.71

96.02

5.93

110

4.96

10.74

88.53

9.53

0.99

2.14

97.54

3.97

142

6.40

13.86

90.04

5.76

2.97

6.44

92.70

7.57

1.75

3.80

96.19

7.18

1.89

4.10

96.03

5.71

176

7.94

17.18

87.56

7.74

2.57

5.56

94.41

5.99

150

6.76

14.64

86.93

8.54

4.46

9.66

90.65

6.95

106

4.78

10.35

91.99

4.96

159

7.17

15.52

87.13

8.26

2.88

6.25

94.30

6.23

2.48

5.37

93.80

5.93

229

10.33

22.36

83.39

6.26

3.70

8.00

92.28

9.25

2.93

6.34

92.94

8.27

162

7.31

15.82

85.10

10.74

353

15.92

34.47

73.74

9.16

size

Size Summary

921

41.56

17.98

86.35

7.19

353

15.92

6.89

89.03

7.19

290

13.08

5.66

94.75

7.20

652

29.42

12.73

92.12

7.21

itm,size

Item by Size Summary

18,0

1.21

10.54

84.98

10.72

18,1

3.15

27.34

82.99

10.73

18,2

1.39

12.10

84.67

10.74

18,3

1.53

13.28

87.76

10.75

19,0

122

5.50

47.65

66.16

9.14

19,1

102

4.60

39.84

70.40

9.16

19,2

1.48

12.89

79.15

9.16

19,3

4.33

37.5

79.24

9.18

Tab le 8 . 2 : Summary of sweep testing results after generaliza-

tion training on the novel items (18 and 19) in a limited num-

ber of positions and sizes. Columns are as in previous table,

with additional data for novel items in different sizes shown.

The %itm column for the novel items (15.82 and 34.47) gives

the generalization error, which is not bad considering the lim-

ited range of training locations and sizes (covering only 7% of

the total possible). The Item by Size data shows that size gen-

eralization to the untrained sizes (1 and 3) is also fairly good

(27.34 and 13.28 for object 18, 39.84 and 37.5 for 19). More

errors are also generally made in the smaller sizes. Finally,

there is some evidence of interference from the generalization

training in comparison with table 8.1.

Computational Explorations in Cognitive Neuroscience

Search WWH ::

Custom Search

Home