Information Technology Reference
In-Depth Information
•
The overall failure probability:
FP
i
.
the task finish can reduce the overhead due to
redundant task dispatch.
When a worker
j
is found to be offline by the
periodical available status check,
RR
i
of this task
is decreased by one. The worker ID is removed
from
workerID
i
[
RR
i
]. Finally, the overall failure
probability of the task is updated.
The overall failure probability is calculated as:
RR
i
FP
=
EFPs k
[
].
(2)
i
i
k
=
1
Similar to the original workflow management
mechanism, the enhanced workflow management
mechanism uses the status information to analyze
whether an ``undispatched'' task can be dispatched
or not. An ``undispatched'' task can only be dis-
patched when all the tasks that it depends on are
``finished.'' A workflow has two kinds of status:
``blocked'' and ``unblocked.'' While there is no
such ``undispatched'' task, the workflow manage-
ment mechanism uses the redundant task dispatch
to reduce the performance degradation. Such status
of a workflow is defined as ``blocked.''
The initial workflow information of each task
i
is as follows:
The Task Selection and
Dispatch Policies
While the workflow management mechanism
controls the process of a job workflow, it requires
policies to select the tasks for dispatch, and find
the task-to-worker assignment when the task
enquiries come.
Task Selection Policy
In our previous work (Wang, 2007), the
least-
RR-selected
policy has been proposed to equally
reduce the failure rate of all the ``dispatched''
tasks. It selects a task with the least redundancy
rate and dispatches the task to an idle worker. As
the least-RR-selected policy assumes a constant
task failure rate, it cannot be applied directly to the
LFPD. In this paper, therefore, a
highest-failure-
probability-selected
policy is proposed to provide
the similar function for LFPD. It selects the task
with the highest overall failure probability.
Furthermore, the failure probabilities of a task
on different workers are different in a heteroge-
neous environment. By considering the task as-
signment of multiple tasks to multiple workers, a
lower overall failure probability can be achieved.
Thus, the idea of
dispatch window
is introduced.
The dispatch window is the number of tasks that
will be dispatched together. Given a window
size
w
, the dispatcher waits for task enquiries
from workers until the dispatch window is full,
then it selects
w
tasks with the highest-failure-
probability-selected policy.
•
Status: ``undispatched.''
•
Redundancy rate:
RR
i
= 0
.
•
The overall failure probability:
FP
i
= 1
which means that a task will never finish
before it is dispatched
When the dispatcher dispatches a task
i
to a
worker
j
, it provides the required input values from
the preceding tasks, and then changes the task
i
's
status to ``dispatched'' if it was ``undispatched,''
and increases
RR
i
by one. The worker
j
is stored
in the
workerID
i
[
RR
i
]. The failure probability of
this assignment {task
i
→ worker
j
} is estimated
and stored in
EFPs
i
[
RR
i
]. The overall failure prob-
ability is calculated again.
When the dispatcher receives the result of a
task
i
from a worker
j
, it changes the status of task
i
to ``finished'', and sends a ``cancel'' message
to the workers in
workerID
i
[
RR
i
], except worker
j
. The function to cancel duplicate copies after
Search WWH ::
Custom Search