SHARC Processor Benchmarks

Real-time signal processing tasks are I/O and computationally intensive. In addition to high-speed math units and all instructions executing in a single-cycle, including single-cycle multiply accumulates (MACs), SHARC Processors are designed for maximum I/O and memory access bandwidth. This balance of core speed, memory integration and I/O bandwidth achieves the sustained performance critical to real-time applications.

Benchmarks are important in that they show how a particular DSP performs in the context of an application. The smaller the benchmark number, the quicker the algorithm execution. If a DSP can perform the task quicker, the processor can perform more tasks in a given amount of time. Just looking at the cycle time, clock speed or MIPS of a DSP can not give an accurate indication of the true performance of the processor. Therefore it's important to analyze algorithm benchmarks, not only clock speed and cycle time.

	ADSP-21160N ADSP-21161N SIMD	ADSP-21261 SIMD	ADSP-21262 ADSP-21266 SIMD	ADSP-21371 ADSP-21375 SIMD	ADSP-21364 ADSP-21365 SIMD	ADSP-21368 ADSP-21369 SIMD	ADSP-2146x SIMD	ADSP-2158x ADSP-SC58x SIMD
Clock Cycle	100 MHz	150 MHz	200 MHz	266 MHz	333 MHz	400 MHz	450 MHz	450 MHz per core (2 DSP cores)
Instruction Cycle Time	10 ns	6.67 ns	5 ns	3.75 ns	3 ns	2.5 ns	2.22 ns	2.22 ns per core
MFLOPS Sustained	400 MFLOPS	600 MFLOPS	800 MFLOPS	1064 MFLOPS	1332 MFLOPS	1600 MFLOPS	1800 MFLOPS	3600 MFLOPS
MFLOPS Peak	600 MFLOPS	900 MFLOPS	1200 MFLOPS	1596 MFLOPS	1998 MFLOPS	2400 MFLOPS	2700 MFLOPS	5400 MFLOPS
1024-Pt Complex FFT (Radix 2) Using Library Code Using FFT H/W Accelerator	109.72 µs	73.18 µs	54.86 µs	41.15 µs	32.91 µs	27.43 us	24.36 µs	24.58 µs
	N/A	N/A	N/A	N/A	N/A	N/A	176.11 µs	5.53 µs
FIR Filter (per tap)	5 ns	3.3 ns	2.5 ns	1.88 ns	1.5 ns	1.25 ns	1.11 ns	1.11 ns
IIR Filter (per biquad)	20 ns	13.3 ns	10 ns	7.5 ns	6 ns	5 ns	4.43 ns	4.43 ns
Matrix Multiply (pipelined) [3x3] * [3x1] [4x4] * [4x1]	45 ns	30 ns	22.5 ns	16.91 ns	13.5 ns	11.25 ns	10.00 ns	10.00 ns
Matrix Multiply (pipelined) [3x3] * [3x1] [4x4] * [4x1]	80 ns	53.3 ns	40 ns	30.07 ns	24 ns	20 ns	17.78 ns	17.78 ns
Divide (y/x)	30 ns	20 ns	15 ns	11.27 ns	9 ns	7.5 ns	6.67 ns	6.67 ns
Inverse Square Root	45 ns	30 ns	22.5 ns	16.91 ns	13.5 ns	11.25 ns	10.00 ns	10.00 ns

4.43 ns