Balancing Power and Performance With Task Dependencies in Multi-Core Systems
By Gokhan Akgun and Diana Göhringer
Technische Universität Dresden, Germany
Abstract:
The increasing use of FPGAs necessitates energy-efficient solutions, particularly for battery-powered applications. Although power dissipation is often perceived as a hardware issue, it can be mitigated through power-saving techniques such as dynamic voltage and frequency scaling and clock gating. In real-time systems, these strategies must reduce the power consumption and meet strict timing requirements to avoid deadline violations. However, hardware constraints and variability in execution times complicate their implementation, particularly in multi-core systems where task dependencies and inter-processor communication introduce delays and unpredictability. Real-time Operating Systems (RTOSs) manage task execution using scheduling algorithms, periodically checking task queues during context switches. Incoming messages trigger sporadic tasks that the RTOS must prioritize immediately, while regular tasks are executed, or power-saving strategies are applied during idle phases. Handling these diverse tasks in multi-core systems adds complexity, making it challenging to balance between predictability, energy efficiency, and system performance. This work introduces a heterogeneous multi-core architecture that integrates power-aware task scheduling algorithms, such as the Look-Ahead algorithm or Race-to-Idle strategy, to optimize power consumption while addressing task dependencies and inter-core communication. A hardware-based task scheduler improves scheduling performance and predictability, while tasks leverage the reconfigurable capabilities of FPGAs and are executed as hardware accelerators to further enhance energy efficiency. The experimental results demonstrate an improvement in scheduling performance of 64.91% and energy efficiency of 92% compared to a baseline without power optimization, highlighting the effectiveness of the proposed approach.
To read the full article, click here
Related Semiconductor IP
- Process/Voltage/Temperature Sensor with Self-calibration (Supply voltage 1.2V) - TSMC 3nm N3P
- USB 20Gbps Device Controller
- SM4 Cipher Engine
- Ultra-High-Speed Time-Interleaved 7-bit 64GSPS ADC on 3nm
- Fault Tolerant DDR2/DDR3/DDR4 Memory controller
Related White Papers
- A RISC-V Multicore and GPU SoC Platform with a Qualifiable Software Stack for Safety Critical Systems
- Achieving Lower Power, Better Performance, And Optimized Wire Length In Advanced SoC Designs
- Optimizing LPDDR4 Performance and Power with Multi-Channel Architectures
- eFPGAs Bring a 10X Advantage in Power and Cost
Latest White Papers
- Fault Injection in On-Chip Interconnects: A Comparative Study of Wishbone, AXI-Lite, and AXI
- eFPGA – Hidden Engine of Tomorrow’s High-Frequency Trading Systems
- aTENNuate: Optimized Real-time Speech Enhancement with Deep SSMs on RawAudio
- Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference
- Hardware Acceleration of Kolmogorov-Arnold Network (KAN) in Large-Scale Systems