SHARC Benchmarks
SHARC Processor Benchmarks
Real-time signal processing tasks are I/O and computationally intensive. In addition to high-speed math units and all instructions executing in a single-cycle, including single-cycle multiply accumulates (MACs), SHARC Processors are designed for maximum I/O and memory access bandwidth. This balance of core speed, memory integration and I/O bandwidth achieves the sustained performance critical to real-time applications.
Benchmarks are important in that they show how a particular DSP performs in the context of an application. The smaller the benchmark number, the quicker the algorithm execution. If a DSP can perform the task quicker, the processor can perform more tasks in a given amount of time. Just looking at the cycle time, clock speed or MIPS of a DSP can not give an accurate indication of the true performance of the processor. Therefore it's important to analyze algorithm benchmarks, not only clock speed and cycle time.
ADSP-21160N ADSP-21161N SIMD |
ADSP-21261 SIMD |
ADSP-21262 ADSP-21266 SIMD |
ADSP-21371 ADSP-21375 SIMD |
ADSP-21364 ADSP-21365 SIMD |
ADSP-21368 ADSP-21369 SIMD |
ADSP-2146x SIMD |
ADSP-2158x ADSP-SC58x SIMD |
|
Clock Cycle | 100 MHz |
150 MHz | 200 MHz | 266 MHz | 333 MHz | 400 MHz | 450 MHz | 450 MHz per core (2 DSP cores) |
Instruction Cycle Time | 10 ns | 6.67 ns | 5 ns | 3.75 ns |
3 ns | 2.5 ns | 2.22 ns | 2.22 ns per core |
MFLOPS Sustained | 400 MFLOPS | 600 MFLOPS | 800 MFLOPS | 1064 MFLOPS | 1332 MFLOPS | 1600 MFLOPS | 1800 MFLOPS | 3600 MFLOPS |
MFLOPS Peak | 600 MFLOPS | 900 MFLOPS | 1200 MFLOPS | 1596 MFLOPS | 1998 MFLOPS | 2400 MFLOPS | 2700 MFLOPS | 5400 MFLOPS |
1024-Pt Complex FFT (Radix 2) Using Library Code Using FFT H/W Accelerator |
109.72 µs |
73.18 µs |
54.86 µs |
41.15 µs |
32.91 µs |
27.43 us |
24.36 µs |
24.58 µs |
N/A | N/A | N/A | N/A | N/A | N/A | 176.11 µs | 5.53 µs | |
FIR Filter (per tap) |
5 ns | 3.3 ns | 2.5 ns | 1.88 ns | 1.5 ns | 1.25 ns | 1.11 ns | 1.11 ns |
IIR Filter (per biquad) | 20 ns | 13.3 ns | 10 ns | 7.5 ns | 6 ns | 5 ns | 4.43 ns | 4.43 ns |
Matrix Multiply (pipelined) [3x3] * [3x1] [4x4] * [4x1] |
45 ns |
30 ns |
22.5 ns |
16.91 ns |
13.5 ns |
11.25 ns |
10.00 ns |
10.00 ns |
80 ns | 53.3 ns | 40 ns | 30.07 ns |
24 ns | 20 ns | 17.78 ns | 17.78 ns |
|
Divide (y/x) | 30 ns | 20 ns | 15 ns | 11.27 ns |
9 ns | 7.5 ns | 6.67 ns | 6.67 ns |
Inverse Square Root | 45 ns | 30 ns | 22.5 ns | 16.91 ns |
13.5 ns | 11.25 ns | 10.00 ns | 10.00 ns |
4.43 ns |