Schema Mapping Evolution Through Composition and Inversion - Schema Matching and Mapping

Databases Reference

In-Depth Information

data (without loss) via the chase. In general, this recovery is up to homomorphic

equivalence due to the presence of nulls; in the ideal case when the original source

instance is recovered exactly, we call the chase-inverse an exact chase-inverse. The

second type of operational inverses, which we call relaxed chase-inverses , 1 are

relaxations of chase-inverses that work in situations where there is information loss

and, hence, chase-inverses do not exist. Intuitively, a relaxed chase-inverse recovers

the original source data as well.

In this chapter, we use various concrete examples of schema evolution to illus-

trate the main developments and challenges behind composition and inversion and

their applications to schema evolution. We note that we are focused here on com-

position and inversion; a companion topic chapter [ Hartung et al. 2011 ] will give a

separate overview of the schema evolution area in general. In our survey, we illus-

trate the concept of composition, and then discuss the two flavors of operational

inverses mentioned above. At the same time, we discuss the languages in which

such composition and inversion can be expressed. In the context of the schema evo-

lution scenarios that we consider, these languages vary in complexity from GAV

schema mappings to LAV and GLAV schema mappings (the latter are also known as

source-to-target tuple-generating dependencies, or s-t tgds [ Fagin et al. 2005a ]) and

then to mappings specified by second-order (SO) tgds [ Fagin et al. 2005b ]. During

the exposition, we will proceed from simpler, easier scenarios of schema evolu-

tion to more challenging scenarios, and illustrate how composition and inversion

techniques can be put together into a framework that deals with schema evolution

problems.

In a separate section, we examine in detail two systems that implement one or

both of the above schema mapping operators to deal with aspects of schema evo-

lution. The first one is an implementation of mapping composition [ Yu and Popa

2005 ] that is part of the Clio system is based on the SO tgds introduced in Fagin et al.

[ 2005b ] and is specifically targeted at the problem of mapping adaptation in the con-

text of schema evolution. The second system is the PRISM workbench [ Curino et al.

2008 ] for query migration in the presence of schema evolution. This system is based

on query rewriting under constraints and in particular on the chase and backchase

framework [ Deutsch et al. 1999 ]. However, before it can apply such query rewriting,

the PRISM system needs to implement both mapping composition and inversion.

The notion chosen here for inversion is based on quasi-inverses [ Fagin et al. 2008b ].

We end the paper with a discussion of the main open research questions that

still remain to be solved. Perhaps the most important open issue here is to find a

unifying schema-mapping language that is closed under both composition and the

various flavors of inverses, and, additionally, has good algorithmic properties.

1 These were introduced in Fagin et al. [ 2009b ] under a different name: universal-faithful inverses .

However, the term relaxed chase-inverses , which we use in this paper, is a more suggestive term

that also reflects the relationship with the chase-inverses.

Search WWH ::

Custom Search

Home