AI accelerator (NPU) IP - 1 to 20 TOPS

Overview

The Expedera Origin E2 is designed for Artificial Intelligence (AI) applications in power sensitive devices such as mobile phones and edge nodes. By using on-chip memory only, the E2 eliminates the requirement for external DRAM access, saving system power while increasing performance, reducing latency and shrinking system BOM costs. It is tunable for specific workloads to provide an optimal performance profile for unique application requirements.

Expedera's scalable tile-based design includes a single controller (SSP), and multiple matrix-math units (MMP), accumulators (PSM), vector engines (VSP) and memory to store the network. Specific configurations depend on unique application requirements. The unified compute pipeline architecture enables highly efficient hardware scheduling and advanced memory management to achieve unsurpassed end-to-end low-latency performance. The patented architecture is mathematically proven to utilize the least amount of memory for neural network (NN) execution. This minimizes die area, improves bandwidth, saves power, and maximizes performance.

Key Features

  • 1 to 20 TOPS performance
  • Performance efficient 18 TOPS/Watt
  • Scalable performance from 2-9K MACS
  • Capable of processing HD images on chip
  • Advanced activation memory management
  • Low latency
  • Tunable for specific workloads
  • Hardware scheduler for NN
  • Processes model as trained, no need for software optimizations
  • Use familiar open-source platforms like TFlite

Benefits

  • Speedup AI inference performance dramatically
  • Avoid system over-design and bloated system costs
  • Lowers BOM costs
  • Optimal performance for power sensitive applications
  • Suitable for latency sensitive applications
  • Increases performance for your most critical and common workloads
  • No heavy software support burden
  • Speeds deployment
  • Best in class platform support

Block Diagram

AI accelerator (NPU) IP - 1 to 20 TOPS Block Diagram

Applications

  • Smartphone
  • Consumer Electronics
  • AR/VR
  • Handheld devices

Deliverables

  • RTL -or- GDS
  • Documentation
  • SDK (TVM based)

Technical Specifications

Maturity
In production
Availability
Production
×
Semiconductor IP