Data Warehousing and Online Analytical Processing - Data Mining: Concepts and Techniques

Databases Reference

In-Depth Information

dimensions in the query, (2) the selection clause in the query can imply the selection in

the cuboid, and (3) the abstraction levels for the item and location dimensions in these

cuboids are at a finer level than brand and province or state , respectively.

“How would the costs of each cuboid compare if used to process the query?” It is likely

that using cuboid 1 would cost the most because both item name and city are at a lower

level than the brand and province or state concepts specified in the query. If there are

not many year values associated with items in the cube, but there are several item names

for each brand , then cuboid 3 will be smaller than cuboid 4, and thus cuboid 3 should

be chosen to process the query. However, if efficient indices are available for cuboid 4,

then cuboid 4 may be a better choice. Therefore, some cost-based estimation is required

to decide which set of cuboids should be selected for query processing.

4.4.4 OLAP Server Architectures: ROLAP versus MOLAP

versus HOLAP

Logically, OLAP servers present business users with multidimensional data from data

warehouses or data marts, without concerns regarding how or where the data are stored.

However, the physical architecture and implementation of OLAP servers must consider

data storage issues. Implementations of a warehouse server for OLAP processing include

the following:

Relational OLAP (ROLAP) servers: These are the intermediate servers that stand in

between a relational back-end server and client front-end tools. They use a rela-

tional or extended-relational DBMS to store and manage warehouse data, and OLAP

middleware to support missing pieces. ROLAP servers include optimization for

each DBMS back end, implementation of aggregation navigation logic, and addi-

tional tools and services. ROLAP technology tends to have greater scalability than

MOLAP technology. The DSS server of Microstrategy, for example, adopts the

ROLAP approach.

Multidimensional OLAP (MOLAP) servers: These servers support multidimensional

data views through array-based multidimensional storage engines . They map multi-

dimensional views directly to data cube array structures. The advantage of using a

data cube is that it allows fast indexing to precomputed summarized data. Notice

that with multidimensional data stores, the storage utilization may be low if the data

set is sparse. In such cases, sparse matrix compression techniques should be explored

(Chapter 5).

Many MOLAP servers adopt a two-level storage representation to handle dense

and sparse data sets: Denser subcubes are identified and stored as array struc-

tures, whereas sparse subcubes employ compression technology for efficient storage

utilization.

Hybrid OLAP (HOLAP) servers: The hybrid OLAP approach combines ROLAP and

MOLAP technology, benefiting from the greater scalability of ROLAP and the faster

computation of MOLAP. For example, a HOLAP server may allow large volumes

Search WWH ::

Custom Search

Home