Practical Approaches to the Many-Answer Problem - Advanced Database Query Systems

Database Reference

In-Depth Information

This problem, which is known as empty-answer problem, happens when the user submits a very

restrictive query. A simple way to remedy this problem is to retry a particular query repeatedly with

alternative values of certain conditions until obtaining satisfactory answers from a database. This solu-

tion, however, can be applied only if the user is aware of the close alternatives, otherwise it is infeasible

(especially for users who lack knowledge about the contents of the database they wish to access). Many

techniques are proposed to overcome this problem, namely query relaxation (Section 2.1.1) and similar-

ity based search (Section 2.1.2).

2.1.1 Query Relaxation

Query relaxation aims to modify the failed query to provide the user with some alternative answers or

at least to identify the cause of the failure, rather than just to report the empty result. A database sys-

tem with such capability is also known as a cooperative information system (Gaasterland, Godfrey &

Minker, 1994).

Consider a database EmpDB with information on employees, including their Name, Age, Gender,

Salary, Job and Department, and a query ' get all employees who make less than 15K€ and work in the

R&D department '. Note that this query may fail for two different reasons (Motro, 1986): either no em-

ployee in R&D department makes less than 15K€ or the company does not have an R&D department.

The former is a genuine null answer (i.e., the null answer is appropriate since query fails to match any

data), while the latter is a fake null answer (i.e., it is due to the erroneous presupposition that the com-

pany has an R&D department).

The first system (CO-OPi) i ) with such human behavior was developed by Kaplan (Kaplan, 1983) and

was designed for natural language interaction. The main idea of CO-OP is to follow up a query that

failed with several more general queries (i.e., the query with some of its conditions relaxed). If even

these general queries fail, then the conclusion is that some of the presuppositions of the user who com-

posed the original query are erroneous. If all these general queries succeed, a query that fails produces

a genuine null. Furthermore, assume q′ and q′′ are both relaxations of the failed query q , but q′′ is more

general than q′ . If both succeed, then the partial answer returned by q′ is better (i.e., the best the system

could do to satisfy the initial query q ). If both fail, then the erroneous presupposition indicated by q′′

is stronger. This leads to the conclusion that only Minimal Generalizations that Succeed (MGSs) and

maXimal Generalizations that Fail (XGFs) are significant. Indeed, XGFs provide explanation for the

failure and some assistance for relaxing the query into a non-failing query, whereas MGSs produce

alternative answers to the failing query.

Since then, several systems that adapt CO-OP's techniques to relational databases have been pro-

posed, including SEAVE (Motro, 1986), CoBase (Chu, Yang, Chiang, Minock, Chow & Larson, 1996)

and Godfrey's system (Godfrey, 1997). These systems differ only in the way they perform generaliza-

tions. SEAVE considers all possible generalizations of the query. CoBase uses prior knowledge of the

domain to guide the generalization process. Godfrey's system generalizes the query by only removing

some of its conditions.

SEAVE

Given a query that fails, SEAVEii ii (Motro, 1986) constructs a lattice of its generalized queries and uses

it to find all MGSs/XGFs. The query generalizations are obtained by relaxing to a degree some of the

Search WWH ::

Custom Search

Home