View Management Techniques and Their Application to Data Stream Management - Evolving Application Domains of Data Warehousing and Mining

Database Reference

In-Depth Information

concluSIon And outlook

family of stateful operators are aggregation func-

tions.Arasu and Widom (2004) present techniques

for efficient resource sharing in computation of

aggregation functions. However, here we concen-

trate on sharing of join processing.

Usually in query planning, selections are

pushed down under joins so as to reduce the

amount of tuples that must be processed by the

join operator. In multi-query processing, it may

instead be useful to do the opposite, that is, to pull

up selection predicates over joins so as to reuse

join results for multiple queries with different

selection predicates. Obviously, it is best to pull

down as many selection predicates as possible.

In a DSMS, however, due to windowing it is

not uncommon that simply pulling up the other

predicates produces highly memory- and CPU-

inefficient query plans. Reusing without adapta-

tion view selection or multi-query optimization

algorithms means to assume all window definitions

are identical. However, window definition may

only intersect or even be disjoint. In that case the

supposedly common subexpressions may not be

common at all.

To overcome this problem, Wang et al. (2006)

present a way to push down every selection

predicate as far down as possible by cutting

the windows into pieces and placing selection

operators in between. The windows of a join

operator's input streams are cut into slices. The

slices are handled by distinct join operators that

are pipelined such that each operator contributes

join results to the queries whose windows end with

the corresponding slice. Tuples that do not match

the window slice are pipelined to the subsequent

join operator. Two algorithms are presented, one

to produce a memory-optimal and one to produce

a CPU-optimal query plan for the join.

In fact, both, multi-query optimization or view

selection as well as efficient handling of window

definitions are required to produce efficient

joint query plans for a workload of continuous

queries.

In this chapter we presented a comprehensive

survey on view management and data stream

management systems. View maintenance and

view selection are of particular importance in

data warehouse systems, since decision support

requires complex analytical queries. Whereas

view maintenance addresses efficient methods for

updating views if their base relations are changed,

view selection is the problem of choosing a set of

views to be materialized in order to minimize a

cost function. View selection must carefully trade

off query response time, storage space, and view

maintenance costs.

Data stream management systems deal with

new query planning and processing requirements

in systems generating tremendous amounts of

data that can only be processed once due to high

arrival rates of tuples. Typical applications again

need to pose analytical queries over these streams.

There are special challenges to be solved, such

as dealing with the new data models containing

sliding windows or efficiently creating synopses of

data. Other issues are related to existing problems.

Continuous query planning is highly related to

multi-query optimization and view selection as the

high performance requirements suggest to share

computation and resources in query processing.

On the other hand, continuous query processing

is related to view maintenance.

The relatively new field of stream data man-

agement offers many opportunities for research.

Resource sharing through joint query planning

needs further work. In particular adaptive query

planning for continuous queries is a requirement

since the statistics underlying query plans may

change during execution. As we have seen in the

context of the state-slice operator, query plans

must also deal with the new issues arising from

window models.

View maintenance and processing of continu-

ous queries in data streams can both benefit from

Evolving Application Domains of Data Warehousing and Mining

Search WWH ::

Custom Search

Home