Digital Signal Processing Reference
In-Depth Information
TABLE 8.4 Number of Cycles with Different Optimization Schemes for Both
Fixed- and Floating-Point Implementations (Count = 200)
Number of Cycles
Optimization Scheme
Fixed-Point
Floating-Point
1. No optimization
2
+
(16
¥
200)
=
3202
2
+
(18
¥
200)
=
3602
2. With parallel instructions
1
+
(8
¥
200)
=
1601
1
+
(10
¥
200)
=
2001
3. Two sums per iteration
1
+
(8
¥
100)
=
801
1
+
(10
¥
100)
+
7
=
1008
4. With software pipelining
7
+
(100)
+
1
=
108
9
+
(100)
+
15
=
124
obtained for different array sizes, since the number of cycles in the prolog and epilog
stages remain the same.
Note that for a count of 1000, the fixed- and floating-point implementations with
software pipeling take:
+
(
)
+=
Fixed
Floating
-
point
point
:
7
count 2
1
508
cycles
+
(
)
+=
-
:
9
count 2
15
524
cycles
REFERENCES
1.
TMS320C6000 Programmer's Guide
, SPRU198G, Texas Instruments, Dallas, TX, 2002.
2.
Guidelines for Software Development Efficiency on the TMS320C6000 VelociTI Archi-
tecture
, SPRA434, Texas Instruments, Dallas, TX, 1998.
3.
TMS320C6000 CPU and Instruction Set
, SPRU189F, Texas Instruments, Dallas, TX, 2000.
4.
TMS320C6000 Assembly Language Tools User's Guide
, SPRU186K, Texas Instruments,
Dallas, TX, 2002.
5.
TMS320C6000 Optimizing C Compiler User's Guide
, SPRU187G, Texas Instruments,
Dallas, TX, 2000.
Search WWH ::
Custom Search