Databases Reference
In-Depth Information
dimensions in the query, (2) the selection clause in the query can imply the selection in
the cuboid, and (3) the abstraction levels for the item and location dimensions in these
cuboids are at a finer level than brand and province or state , respectively.
“How would the costs of each cuboid compare if used to process the query?” It is likely
that using cuboid 1 would cost the most because both item name and city are at a lower
level than the brand and province or state concepts specified in the query. If there are
not many year values associated with items in the cube, but there are several item names
for each brand , then cuboid 3 will be smaller than cuboid 4, and thus cuboid 3 should
be chosen to process the query. However, if efficient indices are available for cuboid 4,
then cuboid 4 may be a better choice. Therefore, some cost-based estimation is required
to decide which set of cuboids should be selected for query processing.
4.4.4 OLAP Server Architectures: ROLAP versus MOLAP
versus HOLAP
Logically, OLAP servers present business users with multidimensional data from data
warehouses or data marts, without concerns regarding how or where the data are stored.
However, the physical architecture and implementation of OLAP servers must consider
data storage issues. Implementations of a warehouse server for OLAP processing include
the following:
Relational OLAP (ROLAP) servers: These are the intermediate servers that stand in
between a relational back-end server and client front-end tools. They use a rela-
tional or extended-relational DBMS to store and manage warehouse data, and OLAP
middleware to support missing pieces. ROLAP servers include optimization for
each DBMS back end, implementation of aggregation navigation logic, and addi-
tional tools and services. ROLAP technology tends to have greater scalability than
MOLAP technology. The DSS server of Microstrategy, for example, adopts the
ROLAP approach.
Multidimensional OLAP (MOLAP) servers: These servers support multidimensional
data views through array-based multidimensional storage engines . They map multi-
dimensional views directly to data cube array structures. The advantage of using a
data cube is that it allows fast indexing to precomputed summarized data. Notice
that with multidimensional data stores, the storage utilization may be low if the data
set is sparse. In such cases, sparse matrix compression techniques should be explored
(Chapter 5).
Many MOLAP servers adopt a two-level storage representation to handle dense
and sparse data sets: Denser subcubes are identified and stored as array struc-
tures, whereas sparse subcubes employ compression technology for efficient storage
utilization.
Hybrid OLAP (HOLAP) servers: The hybrid OLAP approach combines ROLAP and
MOLAP technology, benefiting from the greater scalability of ROLAP and the faster
computation of MOLAP. For example, a HOLAP server may allow large volumes
 
Search WWH ::




Custom Search