Optimization in Brain? – Modeling Human Behavior and Brain Activation Patterns with Queuing Network and Reinforcement Learning Algorithms - Computational Neuroscience

Information Technology Reference

In-Depth Information

sidering the effect of error, entities will choose the optimal route with the shortest

processing time if they want to maximize the reward of performance.

Consequently, to model learning, it is first necessary to quantify the learning

process in individual servers. Based on that, the condition under which an entity

switches between the two routes shown in Fig. 9.6 can be established and proved by

integrating queuing network with reinforcement learning. Finally, this quantitative

condition of route switching can be applied to the more complex model of 18 servers

with two routes (see Fig. 9.5) to generate the basic PRP and the reduction of PRP

during the learning process.

Fig. 9.6: The simplest queuing network with two routes.

9.3.1.1 Learning Process in Individual Servers

Based on the functions of the servers in Table 9.1, the two long-term memory servers

(LTDSM and LTPM) play the major roles in learning phonological judgments (task

1) and choice reaction (task 2) [2]. Because the learning effects of long-term mem-

ory are represented as speed of retrieval of production rules and motor programs

from the two long-term memory servers at the Hicog and the BG servers, it is im-

portant to quantify the processing time of the Hicog and the BG servers. In addition,

because the premotor cortex (PM) server is activated in learning visuomotor associ-

ations [32], changes in the processing speed of the PM server is also to be considered

in the learning process of the model.

Because the exponential function fits the learning processes in memory search,

motor learning, visual search, and mathematic operation tasks better than the power

law [18], it was again applied to model the learning process in the individual servers

here

=

+

( − α

) ,

1

/ μ

A i

B i Exp

i N i

(9.7)

i

i ) is its processing time; A i : the minimal

of processing time of server i after intensive practice; B i : the change of expected

value of processing time of server i from the beginning to the end of practice;

μ

i : processing speed of the server i ;(1

/ μ

α i :

learning rate of server i ; N i : number of customers processed by server i .

For the BG server, 1

/ μ BG : motor program retrieving time; ABBGB: the minimal

of processing time of BG server after practice (314 ms, [35]); B BG : the change of

expected value of processing time from the beginning to the end of practice (2

×

314

=

628 ms, assumed);

α BG : the learning rate of server BG (0.00142, [18]); N BG :

Computational Neuroscience

Search WWH ::

Custom Search

Home