Natural Language Interpretation for an Interactive Service Robot in Domestic Domains - Agents and Artificial Intelligence

Information Technology Reference

In-Depth Information

Ta b l e 4 . Response times (in seconds) depending on the two types of difficulty

#of tree

#actions/#entities

obj. depth

1/1

1/5

5/1

5/5

1

3

0.15

0.32

0.48

1.27

2

5

0.47

0.96

1.61

3.50

3

7

2.54

4.83

7.40

13.92

4

9

18.77

34.00

39.72

68.19

5

11

153.40 267.55 154.97 276.20

three pre-selection methods. The last row shows the performance of the system includ-

ing all three enhancements. As we can see, the full combination yields an improvement

except for utterance i where the difference is negligible. The relative improvement of

the enhancements increases with the complexity of the utterances. That is to say, the

more complex the utterance, the more the speed-ups pay off.

Altogether, the complexity of the search tree is affected by the different branching

factors at each level, and the depth which depends on the number of spoken objects. The

branching factor at the first level depends on the number of actions that have the spoken

verb as a synonym. The branching factor at the second level depends on the number of

entities that have the spoken object as a synonym. At the third level the branching factor

depends on the number of parameters of the respective skill. We further evaluated our

optimised system by varying the two complexity factors independently.

Along the rows of Table 4 we varied the number of spoken objects. Along the

columns we varied the number of actions that have the spoken verb as a synonym and

the number of entities that have the spoken object as a synonym. The number of pa-

rameters of the appropriate skill are not varied, since this number already depends on

the amount of spoken objects. In this test scenario the parameters of a skill became

distinguishable for the system by providing distinct prepositions for each parameter.

Different entities became distinguishable through their attributes and the skills were

distinguishable by the number of parameters. So we had five skills with 1, 2, 3, 4 and 5

parameters, respectively.

Table 4 shows that the number of spoken objects has a greater influence on the com-

putation time than has ambiguity. This is indicated by the last two rows which only

contain measurements greater than 10 seconds. That is unacceptable for fluent human-

robot interaction. We can also observe that action pre-selection performs very well in

this test scenario. All tests in the last row address a skill with five parameters. In this test

scenario there was no other skill involving five or more parameters. As a consequence,

the action pre-selection can rule out the other four skill candidates which implies noth-

ing less than reducing the branching factor of the top node from 5 to 1 and thus reducing

the computation time by a factor of approximately 5. This also results in comparable

computation times for the combinations 1/1 (153.40 sec) and 5/1 (154.97 sec) as well

as 1/5 (267.55 sec) and 5/5 (276.20 sec).

Finally, we analysed whether the lexicon size poses a computational problem. There-

fore, we simply added 50,000 nouns to the lexicon and used the full combination test

setup from Table 3. Now, Table 5 indicates that the additional computational effort to

process the utterances with a large lexicon plays no significant role.

Agents and Artificial Intelligence

Search WWH ::

Custom Search

Home