Database Reference
In-Depth Information
Symbol Meaning
t
I
Tuple of stream I
t
I
.ts
Timestamp of tuple t
I
λ
i
Arrival rate of stream I
|
S
I
|
Window size of state I
σ
i
Selectivity of join on state S
I
λ
i
Probe allowance of stream I, (
≤λ
i
).
Fig. 3.
List of notation
r = 750
/sec
2
σ
σ
2 = 0.005
|S
C
|
= 1000
2 = 0.005
|S
C
|
= 1000
|S
AB
|
= 200
|S
AB
|
= 200
2
2
λ
c
'
= 0
λ
ab
'
= 150
λ
ab =
r
λ
c=600
σ
σ
1
λ
c =600
A= 0.001
B= 0.005
|S
B
|
= 500
r = 750
/sec
1
σ
1 = 0.001
|S
B
|
= 500
|S
A
|
= 5000
σ
1 = 0.001
|S
B
|
= 500
|S
A
|
= 5000
|S
A
|
= 5000
1
1
λ
λ
a = 500
b =700
λ
a
'
= 0
λ
b
'
= 150
λ
λ
a = 500
b =700
(a) Single Join
(b) 2-Join Plan
λ
λ
a= 500
b =700
Fig. 4.
Join plans with parameter settings
Fig. 5.
JDA over a 2-join plan
cost model
proposed by Kang et al. [10] that computes the join cost in terms of the
three sub-tasks, namely,
probe
,
insert
and
purge
operations. For simplicity, the model
assumes
count-based
windows. The key idea is that
the cost of probe dominates the
total join cost while insert and purge operations are relatively inexpensive
.For
details on the model and its extension to
time-based
windows refer to [10]. Fig. 3 lists
the notation.
Throughput.
The run-time throughput (Eq. 1) of a join operator A
B (Fig. 4.a) con-
sists of two contributing half-way join components, namely, r
=(a
S
B
)andr
=(b
S
A
). Throughput is also called the
output rate
and is defined as
the number of joined
tuples produced per time unit
. For tuple t
A
,S
A
is the
own
state whereas S
B
is the
partner
state.
r
=
r
+
r
=
λ
×σ
×|
S
B
|+λ
×σ
×|
S
A
|
(1)
a
B
b
A
CPU Limitation in a Join.
When CPU is limited, the throughput of A
B can be re-
written as in Eq. 2. The total
available computing resources
, denoted as
,maybe
determined from the system. Terms
available resources
and
service rate
are used inter-
changeably.
μ
λ
a
×σ
B
×|
S
B
|+λ
b
×σ
A
×|
r
=
r
+
r
=
S
A
|,
λ
a
+λ
b
≤μ.
(2)
In Eq. 2, the
resources allocated to a join is divided between the two half-way joins.
Stream A is assigned a
probe allowance
, denoted by
μ
λ
a
, which is a portion
μ
a
of
μ
λ
a
= min(
λ
b
not exceeding the input rate
a
),
λ
b
). As the
probe
cost dominates the total join cost, the resource restriction only affects
λ
a
, i.e.,
μ
a
,
λ
a
). Similarly,
= min((
μ
-
μ