Pattern-Based Schema Mapping and Query Answering in Peer-to-Peer XML Data Integration System - Advanced Database Query Systems

Database Reference

In-Depth Information

Figure 2. Two XML trees as equivalent instances of S 1 ; J 2 is a canonical instance, whereas J 1 is not

•

( I , r ) ≤ (/ top [ E ], ω), iff ∧ n ∧ N e ( child ( r, n ) ∧ ( I , n ) ≤ ( top [ E ], ω));

•

( I , n ) ≤ ( l [ E 1 , ..., E k ], ω), iff λ( n ) = l and ∧ n 1 , ..., n k ∧ N e ( child ( n, n 1 ) ∧ ( I , n 1 ) ≤ ( E 1 , ω) ∧ ... ∧

child ( n, n k ) ∧ ( I , n k ) ≤ ( E k , ω));

•

( I , n ) ≤ ( l / E , ω), iff λ( n ) = l ∧ ∧ n' ∧ N e ( child ( n, n' ) ∧ ( I , n' ) ≤ ( E , ω));

•

( I , n ) ≤ ( l = x , ω), iff λ( n ) = l and ∧ n' ∧ N t ( child ( n, n' ) ∧ ν( n' ) = ω( x )).

In fact, a description ( S , Ω) represents a class of instances of S with the same set of valuations Ω,

since elements in instance trees can be grouped and nested in different ways.

For example, both XML trees J 1 and J 2 in Figure 2 conform to the schema S 1 from Example 1, and

satisfy the description ( S 1 , {( XML , 2005 , Ann , LA ), ( XML , 2005 , John , NY )}, although they are organized

in different ways.

By a canonical instance we will understand the instance with the maximal width, i.e. the instance

where subtrees corresponding to valuations are pair-wise disjoint. For example, the instance J 2 in Figure

2 is a canonical instance, whereas J 1 is not since two authors are nested under one publication.

SCHEMA MAPPINGS

Further on in this chapter, we will refer to the running example depicted in Figure 3. There are three XML

schema trees S 1 , S 2 , S 3 , along with their instances I 1 , I 2 , and I 3 , respectively. S 1 is the same as S 1 in Figure

1, and its instance I 1 is empty. The schemas and their instances are on peers P 1 , P 2 , and P 3 , respectively.

The key issue in data integration is this of schema mapping . Schema mapping is a specification

defining how data structured under one schema (the source schema ) is to be transformed into data struc-

tured under another schema (the target schema ). In the theory of relational data exchange, source-to-

Search WWH ::

Custom Search

Home