Pipelining: Basic and Intermediate Concepts - Computer Architecture: A Quantitative Approach

Hardware Reference

In-Depth Information

Performance of Branch Schemes

What is the effective performance of each of these schemes? The effective pipeline speedup

with branch penalties, assuming an ideal CPI of 1, is

Because of the following:

we obtain:

The branch frequency and branch penalty can have a component from both unconditional

and conditional branches. However, the later dominate since they are more frequent.

Example

For a deeper pipeline, such as that in a MIPS R4000, it takes at least three

pipeline stages before the branch-target address is known and an additional

cycle before the branch condition is evaluated, assuming no stalls on the re-

gisters in the conditional comparison. A three-stage delay leads to the branch

penalties for the three simplest prediction schemes listed in Figure C.15 .

FIGURE C.15 Branch penalties for the three simplest prediction schemes

for a deeper pipeline .

Find the effective addition to the CPI arising from branches for this pipeline,

assuming the following frequencies:

Search WWH ::

Custom Search

Home