Arm Brings Transformers to IoT Devices
By Sally Ward-Foxton, EETimes (May 4, 2024)
NUREMBERG, Germany— The next generation of Arm’s Ethos micro-NPU, Ethos-U85, is designed to support transformer operations, bringing generative AI models to IoT devices. The IP giant is seeing demand for transformer workloads at the edge, according to Paul Williamson, senior VP and general manager for Arm’s IoT line of business, though in much smaller forms than their bigger brothers, large language models (LLMs). For example, Arm has ported vision transformer ViT-Tiny and generative language model TinyLlama-1.1B to the Ethos-U85 so far.
“Most machine learning inferencing is already being done on Arm-powered devices today,” Williamson said. “It may seem like the AI explosion came overnight, but the truth is Arm’s been preparing for this moment for a long time. The benefits of edge AI cut across a whole host of segments within IoT…AI needs tight integration between the hardware and the software, and Arm has invested heavily in the last decade.”
To read the full article, click here
Related Semiconductor IP
- NFC wireless interface supporting ISO14443 A and B with EEPROM on SMIC 180nm
- DDR5 MRDIMM PHY and Controller
- RVA23, Multi-cluster, Hypervisor and Android
- HBM4E PHY and controller
- LZ4/Snappy Data Compressor
Related News
- Arm Accelerates Edge AI with Latest Generation Ethos-U NPU and New IoT Reference Design Platform
- Socionext Announces Collaboration with Arm and TSMC on 2nm Multi-Core Leading CPU Chiplet Development
- How Arm Total Design is built around 5 key building blocks
- Renesas' New Ultra-High Performance MCUs are Industry's First Based on Arm Cortex-M85 Processor
Latest News
- CAST Releases First Dual LZ4 and Snappy Lossless Data Compression IP Core
- Arteris Wins “AI Engineering Innovation Award” at the 2025 AI Breakthrough Awards
- SEMI Forecasts 69% Growth in Advanced Chipmaking Capacity Through 2028 Due to AI
- eMemory’s NeoFuse OTP Qualifies on TSMC’s N3P Process, Enabling Secure Memory for Advanced AI and HPC Chips
- AIREV and Tenstorrent Unite to Launch Advanced Agentic AI Stack