Alternative semantics - Foundations of Data Exchange

Database Reference

In-Depth Information

the naıve semantics is that, while it does compute certain answers of unions of conjunctive

queries, it fails to do so for any proper extension of this class of queries (see Section 2.3 ).

Therefore, it is not strange that beyond the class of positive queries, the certain answers

semantics exhibits a counter-intuitive behavior.

Thus, to define a CWA semantics that avoids as many anomalies as possible, we need

to keep in mind that target instances in data exchange are tables with nulls. In particular,

we need to make use of the techniques that have been developed for handling this kind of

tables, and that were introduced in Section 2.3 . Those techniques, however, only apply to

the OWA, and hence we need to adjust them now for the case of the CWA.

Recall that the semantics of an instance T with nulls is given by the set Rep ( T ) of

complete instances that are represented by T . An instance T without nulls belongs to

Rep ( T ) if there is a homomorphism h : T

T . Notice that the previous definition leaves

complete instances represented by T open for adding new facts. The reason is that instances

T in Rep ( T ) may contain tuples that do not belong to the homomorphic image of T in T .

But this certainly does not go well with our goal of having a semantics based on the CWA.

Hence we need to refine the previous notion, and define the set of complete instances

represented by T under the CWA. This is what we do next.

As expected, an instance T with nulls represents, under the CWA, the set Rep CWA ( T )

of complete instances that are homomorphic images of T . Notice that in this way, under

the CWA, we no longer let instances be open for adding new tuples. Moreover, the sizes

of instances in Rep CWA ( T ) do not exceed the size of T itself (contrary to Rep ( T ),which

contains instances of arbitrarily large sizes).

In order to evaluate a query Q over an incomplete instance T , the standard approach is

to compute the set:

→

Q ( T )=

{

Q ( D )

∈

Rep CWA ( T )

}

This set is usually called the certain answers of Q with respect to T under the CWA in the

incomplete information literature, but we prefer to represent it by

Q ( T ) here, in order

to avoid confusion with the certain answers as defined for data exchange. Note that in

general this set may be different from {

∈

}

;however,if Q is a union of

conjunctive queries, it does not matter whether Rep or Rep CWA is used.

We now define the notion of query answering under the CWA.

Q ( D )

Rep ( T )

Definition 8.14 (CWA certain answers) Let

Σ st ) be a mapping, Q a query

over R t ,and S a source instance. Then the CWA certain answers of Q with respect to S

under

=(R s , R t ,

, denoted by certain CWA

( Q , S ), is the set of tuples that would be in Q ( D ) for every

CWA solution T and every D

∈

Rep CWA ( T ).Thatis:

( Q , S )=

certain CWA

S OL CWA

{

Q ( T )

∈

( S )

}

We illustrate the definition with an example.

Example 8.15 ( Example 8.11 continued) We use again the source instance S =

{

P ( a , b )

}

Foundations of Data Exchange

Search WWH ::

Custom Search

Home