Practical Approaches to the Many-Answer Problem - Advanced Database Query Systems

Database Reference

In-Depth Information

Table 10.

1

z 00

z 01

z 1000

z 101

z 11

2

z 00 + z 01

z 1000 + z 101

z 11

3

z 00 + z 01

z 1000 + z 101 + z 11

4

z 00 + z 01 +z 1000 + z 101 + z 11

Algorithm 2 describes ESRA . It is a modified version of ESA (Algorithm 1) with the following new

assumptions:

• it returns a summary ( z* ) rather than a list of summaries;

• function AddChild appends a node to caller's children;

• function NumberofChildren returns the number of caller's children;

• function uniqueChild returns the unique child of the caller;

• function BuildIntent builds caller's intent (hyperrectangles) from intents of its children;

• Z res and Z′ are local variables of type summary.

The ESRA cost is only a small constant factor γ larger than that of ESA . In fact, the rearranging pro-

cess is done at the same time the queried summary hierarchy is being scanned. It means that no post-

processing task (but the query evaluation itself) have to be performed at query time. More precisely, the

time complexity of the ESRA Algorithm is in the same order of magnitude (i.e., O(L) ) than the ESA

Algorithm:

L

d

−

1

T

ESRA = + ⋅

γ

ε

∈

O L

where the coefficient γ is the time cost for the additional operations (addChild, uniqueChild and Build-

Intent). L is the number of leaves (cells) of the queried summary hierarchy and d its average width.

3.5 Discussion about Extension to any Fuzzy Predicate

A limitation of our proposal relies in the fact that ESRA restrains the user to queries using a controlled

vocabulary (i.e., the summary vocabulary). Indeed, consider a user looking for cheap houses. What is

meant by cheap can vary from one user to another one, or from one kind of users to another one (e.g.,

cheap does not have the same meaning for house buyers and for apartment tenants). Regarding the

query cost and accuracy, the ideal solution consists in building a summary hierarchy for every user such

that queries and summaries share the same vocabulary. Thus, ESRA can be used in its current state. But

maintaining user-specific summaries is hardly conceivable in a system with multiple users. Two alterna-

tives are then envisaged.

The first one is to consider group profiles in order to reduce the number of managed summaries

(e.g., one for buyers and the other for tenants). In this case, ESRA can also be used directly since us-

ers formulate queries within the vocabulary of their groups. In this approach, users would subscribe to

Advanced Database Query Systems

Search WWH ::

Custom Search

Home