Balancing Power and Performance With Task Dependencies in Multi-Core Systems
By Gokhan Akgun and Diana Göhringer
Technische Universität Dresden, Germany
Abstract:
The increasing use of FPGAs necessitates energy-efficient solutions, particularly for battery-powered applications. Although power dissipation is often perceived as a hardware issue, it can be mitigated through power-saving techniques such as dynamic voltage and frequency scaling and clock gating. In real-time systems, these strategies must reduce the power consumption and meet strict timing requirements to avoid deadline violations. However, hardware constraints and variability in execution times complicate their implementation, particularly in multi-core systems where task dependencies and inter-processor communication introduce delays and unpredictability. Real-time Operating Systems (RTOSs) manage task execution using scheduling algorithms, periodically checking task queues during context switches. Incoming messages trigger sporadic tasks that the RTOS must prioritize immediately, while regular tasks are executed, or power-saving strategies are applied during idle phases. Handling these diverse tasks in multi-core systems adds complexity, making it challenging to balance between predictability, energy efficiency, and system performance. This work introduces a heterogeneous multi-core architecture that integrates power-aware task scheduling algorithms, such as the Look-Ahead algorithm or Race-to-Idle strategy, to optimize power consumption while addressing task dependencies and inter-core communication. A hardware-based task scheduler improves scheduling performance and predictability, while tasks leverage the reconfigurable capabilities of FPGAs and are executed as hardware accelerators to further enhance energy efficiency. The experimental results demonstrate an improvement in scheduling performance of 64.91% and energy efficiency of 92% compared to a baseline without power optimization, highlighting the effectiveness of the proposed approach.
To read the full article, click here
Related Semiconductor IP
- Chiplet Die-to-Die Interconnect IP Solution
- High speed MACsec Engine 100G/200G/400G/800G/1.6T
- Temperature/Voltage sensors
- AMBA Bus Host to eSPI Controller/Target
- AMBA Bus Host to eSPI Controller
Related Articles
- A RISC-V Multicore and GPU SoC Platform with a Qualifiable Software Stack for Safety Critical Systems
- Achieving Lower Power, Better Performance, And Optimized Wire Length In Advanced SoC Designs
- Optimizing Communication and Data Sharing in Multi-Core SoC Designs
- How silicon and circuit optimizations help FPGAs offer lower size, power and cost in video bridging applications
Latest Articles
- ZK-Flex: A Flexible and Scalable Framework for Accelerating Zero-Knowledge Proofs
- ITP-STDP: An Intrinsic-Timing Power-of-Two Learning Engine for On-Chip SNN Training
- OpenEye: A Scalable Open-Source Hardware Accelerator for DNNs
- CHIMERA: A Flexible and Scalable 3.1 TOPS/W AI-MCU with Transformer Accelerator and 563 Gb/s Shared-L2 Memory Subsystem with QoS Guarantees
- CXL-ClusterSim: Modeling CXL-based Disaggregated Memory Cluster for Pooling and Sharing using gem5 and SST