Information Technology Reference
In-Depth Information
5.2.1. Syntactic analyses
Syntactic analysis consists of highlighting and representing the structure
of a sentence in computational data, with its constituents - the verb, the
subject, the direct object complement, etc. To this end, it requires a list of
language words with, in each entry, the category (verb and noun) and
morphological properties (gender, number and person). The latter helps it
manage the morphology at the same time as the syntactic analysis is running.
Thanks to the work of R. Montague [MUS 96], automatic understanding,
especially for written dialogue, took the path that consists of carrying out a
syntactic analysis of the sentence before a semantic analysis, which will itself
lead to the determination of the logical form or forms describing the possible
meanings. A pragmatic analysis that takes into account the contextual aspects
then enriches this semantic representation to achieve a propositional form, the
result of all the automatic understanding processes.
To process spontaneous oral dialogue, this process, which goes through a
global syntactic analysis of the sentence, is not always adapted: as we have
seen earlier, a dialogue utterance can be an incomplete sentence, and other
mechanisms have to be implemented for the system not to crash due to a
syntactic analysis which is impossible to run. We then have to implement
syntactic analyzers able to create partial analyses, i.e. able to manage the
underspecification of an actant, for example, or local analyses, which
generate a result with the available data even if it is incomplete.
Whether it is a global or a local or partial analysis, or even a macrosyntactic
analysis, they all supplement each other so as to use the input data as best they
can. The local analysis principle notably allows us to temper the importance
of syntax in the understanding process and to highlight the semantic analysis,
which, in a way, leads to the operations and requires local syntactic analyses
when necessary. This type of mechanism is more adapted to the characteristics
of oral language, such as distortion or fragmentation phenomena, or simply
the frequent presence of questions, a type of sentence that does not appear
very often in written texts, and is thus not well taken into account by analyzers
who are designed and trained on written texts. It can also turn out to be more
robust to potential errors in speech recognition. This is especially the case in
closed domain MMD: for a train reservation task, we almost know in advance
all the elements that a user query can cover, and these elements can thus help
the analysis according to a top-down approach and supplement the bottom-up
approach started with speech recognition.
Search WWH ::




Custom Search