Advantages and Challenges of Designing with Multiple Inferencing Chips

By Geoff Tate, Flex Logix
EEtimes (November 12, 2019)

Using multiple inferencing chips can deliver significant improvements in performance, but only when the neural network is designed correctly

The last two years have been extremely busy in the inferencing chip business. For a while, it seemed like every other week another company introduced a new and better solution. While all this innovation was great, the problem was that most companies didn’t know what to make of the various solutions because they could not tell which one performed better than another. With no set of established benchmarks in this new market, they either had to get up to speed really quickly on inference chips, or they had to believe the performance figures provided by the various vendors.

Most vendors provided some type of performance figure and usually it was whatever benchmark made them look good. Some vendors talked about TOPS and TOPS/Watt without specifying models, batch sizes or process/voltage/temperature conditions. Others used the ResNet-50 benchmark, which is a much simpler model than most people need, so its value in evaluating inference options is questionable.

We’ve come a long way from those early days. Companies have slowly figured out that what really matters when measuring the performance of inference chips is 1) high MAC utilization, 2) low power and 3) you need to keep everything small.

To read the full article, click here

Flex Logix IP Selector

Related Semiconductor IP

Multi-Channel Flex DMA IP Core for PCI Express

Advantages and Challenges of Designing with Multiple Inferencing Chips

Related Semiconductor IP

Related Articles

Latest Articles

Related Articles

Bigger Chips, More IPs, and Mounting Challenges in Addressing the Growing Complexity of SoC Design

Real-Time ESD Monitoring and Control in Semiconductor Manufacturing Environments With Silicon Chip of ESD Event Detection

Seven Key Advantages of Implementing eFPGA with Soft IP vs. Hard IP

In-DRAM True Random Number Generation Using Simultaneous Multiple-Row Activation: An Experimental Study of Real DRAM Chips

RISC-V Functional Safety for Autonomous Automotive Systems: An Analytical Framework and Research Roadmap for ML-Assisted Certification

Emulation-based System-on-Chip Security Verification: Challenges and Opportunities

A 129FPS Full HD Real-Time Accelerator for 3D Gaussian Splatting

SkipOPU: An FPGA-based Overlay Processor for Large Language Models with Dynamically Allocated Computation

TensorPool: A 3D-Stacked 8.4TFLOPS/4.3W Many-Core Domain-Specific Processor for AI-Native Radio Access Networks

Advantages and Challenges of Designing with Multiple Inferencing Chips

Subscribe to the Semi IP Hub Newsletter

Related Semiconductor IP

Related Articles

Latest Articles