Enhancing the Capabilities of Attribute Correspondences - Schema Matching and Mapping

Databases Reference

In-Depth Information

The monotonicity principle refers to the performance of complete schema match-

ings from which the performance of individual attribute correspondences can be

derived. Assume that out of the n n 0 attribute matchings, there are c n n 0 cor-

rect attribute matchings, with respect to the exact matching. Also, let t c be the

number of matchings, out of the correct matchings, that were chosen by the match-

ing algorithm and f n n 0 c be the number of incorrect attribute matchings.

Then, precision is computed to be

t

t C f

and recall is computed as c

. Clearly, higher

values of both precision and recall are desired. From now on, we shall focus on the

precision measure, where p./ denotes the precision of a schema matching .

We first create equivalence schema matching classes on 2 S . Two matchings 0

and 00 belong to a class p if p. 0 / D p. 00 / D p,wherep 2 Œ0; 1. For each

two matchings 0 and 00 , such that p. 0 /<p. 00 /, we can compute their schema

matching level of certainty, ˝. 0 / and ˝. 00 /. We say that a matching algorithm is

monotonic if for any two such matchings p. 0 /<p. 00 / ! ˝. 0 /<˝. 00 /. Intu-

itively, a matching algorithm is monotonic if it ranks all possible schema matchings

according to their precision level.

A monotonic matching algorithm easily identifies the exact matching. Let be

the exact matching, then p. / D 1. For any other matching 0 , p. 0 /<p. /.

Therefore, if p. 0 /<p. /, then from monotonicity ˝. 0 /<˝. /. All one has

to do then is to devise a method for finding a matching that maximizes ˝. 1

Figure 3.2 provides an illustration of the monotonicity principle using a matching

of a simplified version of two Web forms. Both schemata have nine attributes, all of

which are matched under the exact matching. Given a set of matchings, each value

Fig. 3.2

Illustration of the monotonicity principle

1 In Gal et al. [ 2005a ], where the monotonicity principle was originally introduced, it was shown

that while such a method works well for fuzzy aggregators (e.g., weighted average) it does not

work for t-norms such as min.

Search WWH ::

Custom Search

Home