NEOX® | GA100 is the leading GPGPU solution for the MCU market and world’s first GPGPU architecture based on RISC-V, offering the unique combination of performing both graphics and ?? on the same IP. Ultra-low power, tiny silicon area & memory requirements are key benefits that position NEOX ® | GA100 as the ultimate choice for battery powered and cost-constrained wearable and edge AIoT devices.
RISC-V GPGPU for 3D graphics and AI at the edge
Overview
Key Features
- GPGPU: 3D, Vector & 2.5D Graphics, AI
- ISA: RV64IMFC + custom GFX & AI extensions
- Vertex / Shader Processing: Unified Fully Programmable LLVM C/C++ RISCV
- Cores: 1
- Max Core Clock Frequency (MHz): 400
- Characterization Configuration: 32 threads; AXI4:64 or AXI4:128
- Colors: 16.7Million (24-bit)
- Texture Read Performance @100MHz: 100 Mpixel/sec
- Frame Buffer Write @100MHz: 100 Mpixel/sec
- Power Consumption: <10mW (workload dependent)
- Typical watch face performance @200 MHz (454x454, 32bpp): 2D >60fps (Vsync bound), VG >60fps (Vsync bound), 3D >30fps
- AI Performance: Performance reported by NEOX® | AI SDK available upon request
- Typical CPU utilization: <5%
- Resolution: Up to 4K
Benefits
- UNPARALLELED RISC-V FLEXIBILITY & PROGRAMMABILITY
- NEOX® | GA100 is the world’s first RISC-V GPGPU, having full entitlement of the RISC-V RV64 architecture and enabling greater flexibility and programmability for Graphics, AI & compute. Offering the unique opportunity to leverage the open ecosystem of tools (profilers/debuggers etc.) and mature compiler infrastructure (LLVM), it is a highly configurable, turnkey accelerator equipped with custom graphics and AI instructions.
- COMPELLING ULTRA-LOW POWER 3D FOR WEARABLES & AIOT
- NEOX® | GA100 multithreaded nature enables 3D graphics rendering on a variety of target systems, from low-power bare-metal systems to high-performance, Linux based systems. It introduces new hardware and a new set of software extensions to accelerate 3D graphics. 3D support is introduced through a royalty free file format called glTFTM developed by the Khronos® that enables the efficient transmission and loading of 3D models by engines and applications. Once the 3D object has been imported, effects like rotation, lighting, animation can be applied before rendering it to the screen. The entire 3D, VG & 2.5D API requires less than 150KB of memory and 10mW of power.
- ENABLING YOUR AI APPLICATION WITHOUT ADDITIONAL SILICON COST
- NEOX® | GA100 is delivered with an AI SDK that can perform offline graph optimization and AI inference based on TensorFlow Lite for MCU. It includes various open source, proprietary tools, and algorithms for analyzing, visualizing, converting, compressing, and deploying Deep Neural Networks (DNNs) on NEOX® architecture. It allows to perform various iterative steps in model compression and model analysis, until the desired balance between “accuracy-performance-memory” is achieved. NEOX® | AI SDK is also integrated into Edge Impulse platform, enabling ML developers to easily target NEOX® | GA100 for edge applications on wearable and AIoT devices.
- MARKETS & APPLICATIONS: LEADING AT THE EDGE
- NEOX® | GA100 is addressing diverse vertical markets including next generation ultra-low power devices, such as wearables (smartwatches, AI smart glasses, hearables), smart home (appliances, thermostats, speakers, cameras), mobility (e-bikes, e-scooters) and industrial HMI/IoT.
- Example graphics applications for NEOX® | GA100 include rendering 3D / VG / 2.5D watch faces, VG or raster fonts, VG maps, notifications and alerts, 3D avatars for video calls, and more. Example AI applications include human activity recognition, health monitoring and diagnostics, keyword spotting, audio and speech enhancement, among others.
Block Diagram
Applications
- Small to mid-sized displays (1,5" - 10")
- Wearables: smartwatches/smart bands/ AI smart glasses/hearables
- Mobility: E-bikes & E-scooters
- Smart Home: smart appliances/thermostats/speakers
- Industrial: PoS/Pol terminals
Technical Specifications
Related IPs
- IP platform for intelligence gathering chips at the Edge
- Scalar and SIMD floating point option for the ARC HS5x, HS5xD and HS6x processors
- Scalable 3D Graphics Accelerator
- 2D (vector graphics) & 3D GPU IP A GPU IP combining 3D and 2D rendering features with high performance, low power consumption, and minimum CPU load
- Shader architecture type 3D GPU Integrating the OpenVG 1.1 hardware processing pipeline
- OpenGL® ES2.0 compatible 3D graphics IP core