A Distributed Architecture for Real-time Dialogue and On-task Learning of Efficient Co-operative Turn-taking - Coverbal Synchrony in Human-Machine Interaction

Graphics Reference

In-Depth Information

Figure 4. The heart of turn-taking control in the system consists of a set of eight semi-global

context-states and 11 deciders. In context-state I-Have-Turn (IHT), both I-Give-Turn-Decider

(IGTD) and Other-Wants-Turn-Decider (OWTD) are active. Unlike other modules, the Dialog

Planner (DP) can transition independently from the system's current context-state and

override the decisions from the reactive deciders. A Timeout-Decider handles transitions

if one of the negotiating contexts is being held unacceptably long (but its transitions are

not included in this diagram; also not shown are which modules are active during which

contexts).

to Other-Gives-Turn after a two-second silence). If the agent waits too

long, and the speaker does not continue, there will be an unwanted

silence; if he starts too soon and the speaker continues speaking,

overlapping speech will result. We solve this by having OGTD-2 use

information about past prosody, which occurs right before the latest

silence, to select an optimal silence tolerance window (STW), as will

now be described in detail.

4. The Learner

The learning mechanism is implemented as an independent component

(Learner module) in the modular architecture described above. It is

based on the Actor-Critic distribution of functionality (Sutton and

Barto, 1998), where one or more actors make decisions about which

actions to perform and a critic evaluates the effect of these actions

on the environment; the separation between decision and action is

important because in our system a decision can be made to act in the

future. In the highly general and distributed learning mechanism we

have implemented, any module in the system can take the role of an

actor by sending out decisions and receiving, in return, an updated

Search WWH ::

Custom Search

Home