Database Reference
In-Depth Information
REPLICATED_SIZE
specifies space allocation for replicated tables
on each Compute
node
, whereas
DISTRIBUTED_SIZE
specifies space allocation for distributed tables
across the appliance
. That means
StageDB_Example
actually has 16GB of space
allocated: 10GB for replicated tables (10 Compute nodes with 1GB each), 5GB for dis-
tributed tables, and 1GB for the log.
All data is automatically compressed using page-level compression during the
PDW load process. This is not optional, and the amount of compression will vary
greatly from customer to customer and table to table. If you have SQL Server Enter-
prise or Developer Editions, you can execute the statement in
Listing 8-6
to estimate
compression results.
Listing 8-6
.
Code to Estimate Compression Savings
/* Estimate compression ratio */
EXECUTE
sp_estimate_data_compression_savings
'dbo', 'FactSales', NULL, NULL, 'PAGE';
Example results of
sp_estimate_data_compression_savings
are
shown in
Figure 8-5
.
Figure 8-5
.
Example compression savings
You can generally use 2:1 as a rough estimate. With a 2:1 compression ratio, the
5GB of distributed data specified in
Listing 8-5
actually stores 10GB of uncompressed
SQL Server data.
You still need a place in PDW to store the data you're importing. Execute the code
Listing 8-7
.
PDW Code to Create the Destination Database and Table
Search WWH ::
Custom Search