Querying Multimedia Data by Similarity in Relational DBMS - Advanced Database Query Systems

Database Reference

In-Depth Information

Figure 1. The basis of a similarity space instance - a feature vector and a distance function - and the

fields and techniques employed to drive its semantics

A Look on the Feature Vector Component

Features describing multimedia data can be classified in three abstraction levels, according to the kinds

of queries that could be answered (Eakins and Graham, 1999).

•

Level 1. Comprises retrieval by primitive features directly obtained from the multimedia object,

without referring to external knowledge. Examples of such features for still images are general

patterns regarding color, texture, shape or location of elements in the scene. Features describing

videos include these elements plus motion patterns.

•

Level 2. Comprises retrieval by derived attributes, identified through some degree of logical infer-

ence about the data content. Features capable of identify elements (e.g. a person or an object) and/

or simple movements (e.g. an airplane taking off) are included in this level.

•

Level 3. Comprises retrieval by abstract features, involving a significant amount of high-level

reasoning. Features in this level can represent more elaborated actions, such as dancing pictures

or videos of children playing with pets.

The abstraction levels present increasing complexity and a tendency to narrow the domain. The lack

of coincidence between the low-level features automatically extracted from the multimedia data (Level

1) and the high-level human interpretation (based on features of levels 2 and 3) is known in the litera-

ture as the semantic gap (Smeulders et al., 2000). Many research efforts have been directed to extract

higher-semantic features in order to bridge the gap.

Feature extraction is the first step on which the multimedia data is processed generating a feature

vector. The algorithms that identify the features that aim at inferring the data content are called feature

extractors. There are many feature extractors for multimedia data proposed in the literature. Images are

usually represented through their color, texture and shape patterns extracted either globally in the im-

Search WWH ::

Custom Search

Home