Database Reference
In-Depth Information
Chapter 8
ChunkSim
A Tool and Analysis of Performance
and Availability Balancing
Pedro Furtado
University of Coimbra, Portugal
ABSTRACT
Self-tuning physical database organization involves tools that determine automatically the best solu-
tion concerning partitioning, placement, creation and tuning of auxiliary structures (e.g. indexes),
based on the workload. To the best of our knowledge, no tool has focused on a relevant issue in parallel
databases and in particular data warehouses running on common off-the-shelf hardware in a shared-
nothing configuration: determining the adequate tradeoff for balancing load and availability with costs
(storage and loading costs). In previous work, we argued that effective load and availability balancing
over partitioned datasets can be obtained through chunk-wise placement and replication, together with
on-demand processing. In this work, we propose ChunkSim, a simulator for system size planning, per-
formance analysis against replication degree and availability analysis. We apply the tool to illustrate the
kind of results that can be obtained by it. The whole discussion in the chapter provides very important
insight into data allocation and query processing over shared-nothing data warehouses and how a good
simulation analysis tool can be built to predict and analyze actual systems and intended deployments.
INTRODUCTION
with the execution engine and optimizer, has a large
influence on the efficiency of the system. This is
especially relevant in parallel architectures that are
setup to handle huge data sets efficiently. The data
sets are partitioned into nodes and processed in a
parallel fashion in order to decrease the processing
burden and, above all, to allow the system to return
fast results for near-to-interactive data exploration.
Data warehouses may range from few megabytes to
huge giga- or terabyte repositories, so they must have
efficient physical database design and processing
solutions to allow for efficient operation. Physical
database design concerns the layout of data and aux-
iliary structures on the database server and, together
Search WWH ::




Custom Search