2D Dual-Rate Texturing in D-Series GPUs
For every GPU generation the performance teams within Imagination run through a wide range of content, analysing and understanding the different workload types and their bottlenecks. As part of this analysis, the data revealed that many modern games spend an increasing amount of time executing post-processing algorithms to enable depth of field, bloom, blur and other effects.
Most of these post-processing passes are texture-sampling heavy filter effects which are modest in ALU requirements but bottlenecked by the throughput rate of the Texture Processing Unit (TPU). One approach to resolve this would be to simply brute force change the ratio of the number of TPU units versus the USC/ALU rate. However, our analysis indicated this was not a good strategy, for several reasons.
First, in regular render passes the ratio of ALU versus TPU in D-Series GPUs was already optimal and adding another TPU would simply not result in any benefits as the workload would become ALU limited. Meanwhile, other processing passes were TPU-heavy but also bandwidth-heavy, and hence boosting the TPU would not help, as there would be insufficient bandwidth to feed the extra TPU throughput so performance would not be enhanced.
To read the full article, click here
Related Semiconductor IP
- E-Series GPU IP
- Arm's most performance and efficient GPU till date, offering unparalled mobile gaming and ML performance
- 3D OpenGL ES GPU (Graphics Processing Unit)
- Highest performance automotive GPU IP, with revolutionary functional safety technology
- High-performance 2D (sprite graphics) GPU IP combining high pixel processing capacity and minimum gate count.
Related Blogs
- Pipelined Data Masters in D-Series GPUs
- GPUs in cars are for more than just graphics
- What's driving 3D IC design? Do 2D EDA tools need a total overhaul to support 3D design?
- GPUs Taking Bigger Share Of SOC
Latest Blogs
- Cadence Extends Support for Automotive Solutions on Arm Zena Compute Subsystems
- The Role of GPU in AI: Tech Impact & Imagination Technologies
- Time-of-Flight Decoding with Tensilica Vision DSPs - AI's Role in ToF Decoding
- Synopsys Expands Collaboration with Arm to Accelerate the Automotive Industry’s Transformation to Software-Defined Vehicles
- Deep Robotics and Arm Power the Future of Autonomous Mobility