Digital Signal Processing Reference
In-Depth Information
FIGURE 8.12. Dependency graph for two sums of products per iteration.
A similar dependency graph for a floating-point implementation can be obtained
using LDDW , MPYSP , and ADDSP in lieu of LDW , MPY/MPYH , and ADD , respectively,
in Figure 8.12. Note that the single-precision instructions ADDSP and MPYSP both
take four cycles to complete (three delay slots each).
8.5.3 Scheduling Table
Table 8.1 shows a scheduling table drawn from the dependency graph.
1. LDW starts in cycle 1.
2. MPY and MPYH must start five cycles after the LDW s due to the four delay slots.
Therefore, MPY and MPYH start in cycle 6.
3. ADD must start two cycles after MPY/MPYH due to the one delay slot of
MPY/MPYH. Therefore, ADD starts in cycle 8.
4. B has five delay slots and starts in cycle 3, since branching occurs in cycle 9,
after the ADD instruction.
5. SUB instruction must start one cycle before the branch instruction, since the
loop count is decremented before branching occurs. Therefore, SUB starts in
cycle 2.
Search WWH ::




Custom Search