View Management Techniques and Their Application to Data Stream Management - Evolving Application Domains of Data Warehousing and Mining

Database Reference

In-Depth Information

Therefore, many approaches (e.g., (Gupta and

Mumick, 2005; Theodoratos and Sellis, 1997;

Mistry et al., 2001)) aim at optimizing mainte-

nance costs instead of query response. It is also

easy to incorporate both query evaluation costs

and materialized view maintenance costs in the

goal function (cf. Section 4.1).

such as part-of. Modern tuning tools shipped with

commercial database management systems rely on

query optimizers to explore implicit interrelation-

ships between candidate views (Zilio et al., 2004;

Dageville et al., 2004; Agrawal et al., 2004).

Static view Selection

Candidate Views

Most of the view selection techniques follow the

paradigm of static view selection (or data ware-

house configuration ) (Theodoratos and Sellis,

1997), which selects views from a given input

candidate view set under storage and/or mainte-

nance constraints. The materialized views, once

determined offline, will not change over time.

Therefore, this line of work is good for cases where

the queries are relatively fixed or similar. When

the query patterns of the users change dramati-

cally, view selection has to be redone.

In the seminal work of Harinarayan et al.

(1996), both the query workload and the candidate

views are taken from the nodes in the data cube,

which are various group-bys for aggregation.

The interrelationship between candidates is then

represented by a lattice. For example, the database

in Example 1 gives rise to the lattice depicted in

Figure 2.

The edges in the lattice depict dependency

relationships. One view is dependent on another,

In general, there are prohibitively many views that

can be materialized. Even restricted to possible

combinations of selections and group-bys, the

number of candidate views is already exponen-

tial in terms of the size of the queries. Therefore,

the scope of candidate views has to be confined

to allow feasible tools. Most of the static view

selection techniques (cf. 4.3) restrict themselves

to a given set of candidate views. For example,

a natural scope is defined by all the aggregation

possibilities in the data cube (Gray et al., 1997),

which can be modeled as nodes in a lattice. Dy-

namic view selection approaches (cf. section 4.4)

make use of a windowed query history of users as

a candidate domain. Another type of candidates

are the views that are common subexpressions or

ancestors of queries. They are not optimal for a

single query, but they can benefit more than one

query and hence be a globally optimal choice.

Moreover, indexes can be also deemed as a special

type of views.

Figure 2. The lattice for the TPC-D database

Interrelationship Modeling

Candidate views have interactions. Selection of

one view may render materialization of another

view useless, while several views may have over-

lapping information and therefore duplications.

Dependencies between aggregations in a data

cube are modeled using a lattice. Analytic queries

involving selection and aggregation can be mod-

eled using hyperplanes covering. Chunks in view

selection approaches for MOLAP (Zhao et al.,

1997) usually have more intricate relationships

Evolving Application Domains of Data Warehousing and Mining

Search WWH ::

Custom Search

Home