Database Reference
In-Depth Information
which is avoided if only value updates are allowed. On the other hand, a repairing
strategy using tuple insertions suffers from the problem that often there is no reason-
able way to guess the values which should be assigned to the non-measure attributes
in the tuples to be inserted.
This topic provides a study of the problem of extracting reliable information from
databases violating a set of aggregate constraints in a setting where an attribute-level
repairing strategy is adopted. Before presenting the organization of the topic, a brief
overview on the main issues and results in the area of inconsistent databases is
provided.
1.3 Overview of Inconsistency Handling in Databases
Although first theoretical approaches to the problem of dealing with incomplete and
inconsistent information date back to 80s, these works mainly focus on issues re-
lated to the semantics of incompleteness [37]. The semantics of queries posed on
inconsistent integrated databases was first investigated in [3], where an extension of
relational algebra (namely flexible algebra ) was proposed to evaluate queries on data
inconsistent w.r.t. key constraints. An extension of the the flexible relational model,
called integrated relational model , was proposed in [19] where it was argued that
the semantics of integrating inconsistent data is captured by the maximal consis-
tent subset (with null values) of the integrated data. In [42] a different semantics
(namely, merging by majority rule ) based on the cardinality of the source databases
containing the same tuples was introduced in order to manage data inconsistent w.r.t.
first-order integrity constraints.
The first proof-theoretic notion of consistent query answer was introduced in [14],
expressing the idea that tuples involved in an integrity violation should not be con-
sidered in the evaluation of consistent query answers. In [4] a different notion of
consistent answer was introduced, based on the notion of repair : a repair of an in-
consistent database D is a database D , on the same scheme as D , satisfying the
given integrity constraints and which is minimally different from D . Thus, the con-
sistent answer of a query q posed on D is the answer which is in every result of q
posed on each repair D . In particular, in [4] the authors show that, for quantifier-free
conjunctive queries and binary universal constraints, consistent answers can be eval-
uated without computing repairs, but by looking only at the specified constraints and
rewriting the original query q into a query q such that the answer of q on D is equal
to the consistent answer of q on D . The query-rewriting technique introduced in [4]
was further developed in [16] where an implementation of the rewriting operator is
also presented. Later the query-rewriting technique was extended in [31, 29] to work
for a subclass of conjunctive queries with existential quantification in the presence
of key constraints. Recently, the this technique was further studied in [51, 52].
Starting from the notions of repair and consistent query answer introduced in [4],
the problems of repairing and querying inconsistent databases were investigated in
several works where more expressive classes of queries and constraints were con-
Search WWH ::




Custom Search