CONSTRUCTION AND ANALYSIS OF A MULTI-LAYERED IN-CAR SPOKEN DIALOGUE CORPUS - DSP for In-Vehicle and Mobile Systems

Digital Signal Processing Reference

In-Depth Information

Figure 1-3. Task distribution of the corpus.

3.

LAYERED INTENTION TAG

To develop a spoken dialogue system based on speech corpus [4]‚ certain

pre-specified information is required for each sentence corresponding to a

particular response of the system. Additionally‚ to perform the response to

satisfy the user‚ we need to presume the intention of the user's utterances.

From our preliminary trials‚ we have learned that user's intention has a wide

range even for a rather simple task‚ which could necessitate the creation of

dozens of intention tags. To organize and expedite the process‚ we have

stratified tags into several layers‚ which have resulted in an additional benefit

of a hierarchical approach in analyzing users' intentions.

Our Layered Intention Tags (LIT) are described in Table 1-5 and the

structure is shown in Figure 1-4. Each LIT is composed of four layers. The

discourse act layer signifies the role of the speech unit in a given dialogue‚

which are labeled as “task independent tags”. However‚ some units do not

have a tag at this layer.

Action layer denotes the action taken. Action tags are subdivided into

“task independent tags” and “task dependent tags”. “Confirm” and “Exhibit”

are task independent‚ whereas “Search”‚ “ReSearch”‚ “Guide”‚ “Select” and

“Reserve” are the task dependent ones.

Object layer stands for the objective of a given action including “Shop”‚

“Parking.”

Finally‚ the argument layer denotes other miscellaneous information about

the speech unit. Argument layer is often decided directly from some specific

keywords in a given sentence. As it is shown in Figure 1-4‚ the lower layered

intention tags are explicitly depended on the upper layered ones.

Search WWH ::

Custom Search

Home