Hit performance goals with configurable processors
By Steven Leibson, Tensilica
October 17, 2006 -- dspdesignline.com
With mainstream DSPs, code must be hand-tuned using assembly code in order to meet performance goals. A more productive approach is to tailor the processor to the algorithm. This article explains why, using FFT, Viterbi, and MPEG4 examples.
For more than 30 years, the fixed-ISA (instruction set architecture) microprocessor has largely defined electronic system design. Similarly, single-chip DSPs have dominated DSP system design since they were introduced 20 years ago. Fixed-ISA processors have addressed application performance problems in two ways. The first approach has been to increase clock rate, mirroring the same trend in PC processors. If power dissipation is not important, rapidly escalating clock rates can go a long way towards force-fitting a particular processor to an application. The alternative approach for fixed-ISA processors is to provide more computational resources that operate in parallel so that the processor can perform more work per clock.
Designers of fixed-ISA microprocessors and DSPs attempt to develop architectures that are good at executing a wide range of algorithms, but not tailored to any specific application. This design approach can reduce clock-rate requirements, but it doesn't match the clock-rate or gate-usage efficiencies of a tailored solution. The design of complex portable and battery-powered processor-based systems increasingly calls for taking a different approach, one that more closely matches the processing resources to the application tasks so that very high clock rates (and correspondingly high power and heat dissipations) are not required. FPGA, ASIC and SOC technologies provide an ideal medium for tailoring processors to specific applications, using newly available configurable and extensible processor technology.
October 17, 2006 -- dspdesignline.com
With mainstream DSPs, code must be hand-tuned using assembly code in order to meet performance goals. A more productive approach is to tailor the processor to the algorithm. This article explains why, using FFT, Viterbi, and MPEG4 examples.
For more than 30 years, the fixed-ISA (instruction set architecture) microprocessor has largely defined electronic system design. Similarly, single-chip DSPs have dominated DSP system design since they were introduced 20 years ago. Fixed-ISA processors have addressed application performance problems in two ways. The first approach has been to increase clock rate, mirroring the same trend in PC processors. If power dissipation is not important, rapidly escalating clock rates can go a long way towards force-fitting a particular processor to an application. The alternative approach for fixed-ISA processors is to provide more computational resources that operate in parallel so that the processor can perform more work per clock.
Designers of fixed-ISA microprocessors and DSPs attempt to develop architectures that are good at executing a wide range of algorithms, but not tailored to any specific application. This design approach can reduce clock-rate requirements, but it doesn't match the clock-rate or gate-usage efficiencies of a tailored solution. The design of complex portable and battery-powered processor-based systems increasingly calls for taking a different approach, one that more closely matches the processing resources to the application tasks so that very high clock rates (and correspondingly high power and heat dissipations) are not required. FPGA, ASIC and SOC technologies provide an ideal medium for tailoring processors to specific applications, using newly available configurable and extensible processor technology.
To read the full article, click here
Related Semiconductor IP
- UCIe D2D Adapter & PHY Integrated IP
- Low Dropout (LDO) Regulator
- 16-Bit xSPI PSRAM PHY
- MIPI CSI-2 CSE2 Security Module
- ASIL B Compliant MIPI CSI-2 CSE2 Security Module
Related Articles
- Tuning Fork - A Tool For Optimizing Parallel Configurable Processors
- Configurable processors or RTL -- evaluating the tradeoffs
- Configurable Processors: Ready for Prime Time
- Configurable Processors for Video Processing SOCs
Latest Articles
- RISC-V Functional Safety for Autonomous Automotive Systems: An Analytical Framework and Research Roadmap for ML-Assisted Certification
- Emulation-based System-on-Chip Security Verification: Challenges and Opportunities
- A 129FPS Full HD Real-Time Accelerator for 3D Gaussian Splatting
- SkipOPU: An FPGA-based Overlay Processor for Large Language Models with Dynamically Allocated Computation
- TensorPool: A 3D-Stacked 8.4TFLOPS/4.3W Many-Core Domain-Specific Processor for AI-Native Radio Access Networks