Querying Multimedia Data by Similarity in Relational DBMS - Advanced Database Query Systems

Database Reference

In-Depth Information

Although this approach is suitable to specific applications, the amount of operators needed to repre-

sent queries can be excessively high. The operators employed to identify objects should be specialized

to each application domain to acquire higher semantics, yielding a variety of functions to represent each

(class of) object(s). Moreover, although there are works aimed at retrieving images by spatial similarity

between segmented regions, such as (Huang et al., 2008, Yeh and Chang, 2008), queries are usually

represented using a sketch or an example of the desired spatial relationships, because representing the

association among various objects can be very difficult.

The representation of more complex classes of queries is even harder using content-based operators.

Queries Q3, Q4 and Q5 would require defining respectively: what determines a video to be copyrighted,

what is the visual pattern that represents the act of playing, and what a drum solo is. It is easy to notice

that this approach would lead to an explosion on the amount of operators to meet every situation and

the variety of relationships among complex elements and among their components.

The Similarity Operators Approach

We define the Similarity Operators approach as the one that provides a few similarity-based operators

to be employed in a wide range of situations. These operators have the goal of computing the similarity

regarding a predefined pattern between the stored data and the (set of) element(s) provided in the query,

verifying when this similarity satisfies a given criteria.

The support for the Similarity Operators approach is twofold: providing an example of the desired

pattern is generally more feasible than trying to describe it; and providing a restricted set of operators

is sufficient to represent queries over complex data, because the peculiarity of each domain is carried

by its definition of similarity.

The first supporting idea is derived from the query-by-example strategy that has been employed in

databases for long time. Most of the existing content-based multimedia retrieval tools use this method.

The second idea aims at simplifying the overall query answering process to allow developing a consistent

retrieval environment. This simplification can be done without losing the generality of the approach

because the definition of similarity is problem-dependent . For instance, a good similarity measure to

answer Q2 would focus on the color distribution pattern of the images. This measure would probably

produce poor results if applied to Q1, because what identifies a mass in a radiograph is usually the

difference between the mass and its surrounding texture tissue, which is not adequately captured by

a standard color-based evaluation, thus requiring a texture extractor. The domain knowledge of each

problem class is embedded into the similarity evaluation process, which must be specialized according

to the requirements of each case. This allows stating specialized query conditions to a broad range of

domains employing few query operators.

The reader may ask whether we are not just transferring the aforementioned problem of the explosion

on the amount of operators for multimedia data querying to the explosion of similarity measures. In part

this observation is correct. Nevertheless, maintaining a reduced set of similarity operators allows devel-

oping a common framework with well-defined query optimization alternatives and with basic primitives

and structures that can be applied to a variety of situations. Providing such common framework is not

feasible using the Content-based Operators approach.

Therefore, this chapter adopts the Similarity Operators approach for representing queries. The next

section discusses similarity evaluation regarding multimedia data.

Search WWH ::

Custom Search

Home