Memory-Hierarchy Tradeoffs - Models of Computation: Exploring the Power of Computing

Information Technology Reference

In-Depth Information

The upper bound follows from Theorem 11.5.5 . We advance level- L pebbles to the

outputs of each of the two bottom FFT graphs F ( 2 d ) in Fig. 11.11 and then pebble the top

FFT graph. The number of I/O and computation steps used is triple that used to pebble

one such FFT graph. In addition, we perform O ( n ) I/O and computation steps to combine

inputs to the top FFT graph.

The bounds for the I/O-limited version of the MHG for the convolution problem are

considerably larger than those for the standard MHG. They have a much stronger dependence

on S and n than do those for the FFT graph.

THEOREM 11.5.8 Let H ( n )

convolve be the graph of any DAG for the convolution of two n -tuples

using the convolution theorem, n = 2 d .L t H ( n )

convolve be pebbled in the I/O-limited MHG

with the resource vector p and let s l = j = 1 p j .If S = s L− 1 ≤

n , then the time to pebble

H ( n )

convolve at the l th level, T ( L )

( p , H ( n )

convolve ) , satisfies the following lower bounds simultaneously

≤

L :

for 1

convolve )=Ω n 3

S 2

T ( L )

( p , H ( n )

when S

n/ log n .

Proof A lower bound is derived for this problem by considering a generalization of the

graph shown in Fig. 11.11 in which the three copies of the FFT graph F ( 2 d ) are replaced by

an arbitrary DAG for the DFT. This could in principle yield in a smaller lower bound on the

time to pebble the graph. We then invoke Lemma 11.3.2 to show that a lower bound can

be derived from a reduction of this new graph, namely, that consisting of two back-to-back

DFT graphs obtained by deleting one of the bottom FFT graphs. We then derive a lower

bound on the time to pebble this graph with the red pebble game and use it together with

Theorem 11.3.1 to derive the lower bounds mentioned above.

Consider pebbling two back-to-back DAGs for the DFT on n inputs, n even, in the red

pebble game. From Lemma 10.5.4 ,the n -point DFT function is ( 2, n , n , n/ 2 ) -indepen-

dent. From the definition of the independence property (see Definition 10.4.2 ), we know

that during a time interval in which 2 ( S + 1 ) of the n outputs of the second DFT DAG

on n -inputs are pebbled, at least n/ 2

≤

2 ( S + 1 ) of its inputs are pebbled. In a back-to-

back DFT graph these inputs are also outputs of the first DFT graph. It follows that for

each group of 2 ( S + 1 ) of these n/ 2

−

2 ( S + 1 ) outputs of the first DFT DAG, at least

n/ 2

2 ( S + 1 ) of its inputs are pebbled. Thus, to pebble a group of 2 ( S + 1 ) outputs

of the second FFT DAG (of which there are at least

−

n/ ( 2 ( S + 1 ))

groups), at least

( n/ 2

−

2 ( S + 1 )) / 2 ( S + 1 )

( n/ 2

−

2 ( S + 1 )) inputs of the first DFT must be pe bb led.

n/ 4 √ 2and

Thus, T ( L )

( p , H ( n )

n 3 / ( 64 ( S + 1 ) 2 ) , since it holds both when S

convolve )

≥

≤

when S>n/ 4 √ 2.

Let's now consider a pebbling strategy that achieves this lower bound up to a multiplica-

tive constant. The pebbling strategy of Theorem 11.5.5 can be used for this problem. It

represents the FFT graph F ( d ) as a set of FFT graphs F ( e ) on top and a set of FFT graphs

F ( d−e ) on the bottom. Outputs of one copy of F ( e ) are pebbled from left to right. This

requires pebbling inputs of F ( d ) from left to right once. To pebble all outputs of F ( d ) ,2 d−e

copies of F ( e ) are pebbled and the 2 d

inputs to F ( d ) are pebbled 2 d−e times.

Models of Computation: Exploring the Power of Computing

Search WWH ::

Custom Search

Home