Arm Brings Transformers to IoT Devices
By Sally Ward-Foxton, EETimes (May 4, 2024)
NUREMBERG, Germany— The next generation of Arm’s Ethos micro-NPU, Ethos-U85, is designed to support transformer operations, bringing generative AI models to IoT devices. The IP giant is seeing demand for transformer workloads at the edge, according to Paul Williamson, senior VP and general manager for Arm’s IoT line of business, though in much smaller forms than their bigger brothers, large language models (LLMs). For example, Arm has ported vision transformer ViT-Tiny and generative language model TinyLlama-1.1B to the Ethos-U85 so far.
“Most machine learning inferencing is already being done on Arm-powered devices today,” Williamson said. “It may seem like the AI explosion came overnight, but the truth is Arm’s been preparing for this moment for a long time. The benefits of edge AI cut across a whole host of segments within IoT…AI needs tight integration between the hardware and the software, and Arm has invested heavily in the last decade.”
To read the full article, click here
Related Semiconductor IP
- UCIe D2D Adapter & PHY Integrated IP
- Low Dropout (LDO) Regulator
- 16-Bit xSPI PSRAM PHY
- ASIL B Compliant MIPI CSI-2 CSE2 Security Module
- SHA-256 Secure Hash Algorithm IP Core
Related News
- Arm Accelerates Edge AI with Latest Generation Ethos-U NPU and New IoT Reference Design Platform
- Google Cloud Delivers Customized Silicon Powered by Arm Neoverse for General-Purpose Compute and AI Inference Workloads
- Faraday Partners with Arm to Innovate AI-driven Vehicle ASICs
- Kalray Joins Arm Total Design, Extending Collaboration with Arm on Accelerated AI Processing
Latest News
- EU DARE Project Is Scrambling to Replace Codasip
- Sofics and Alcyon Photonics Partner to Support Next-Generation Photonic Systems
- QuickLogic Appoints Quantum Leap Solutions as Authorized Sales Representative
- Cadence and NVIDIA Expand Partnership to Reinvent Engineering for the Age of AI and Accelerated Computing
- Cadence and Google Collaborate to Scale AI-Driven Chip Design with ChipStack AI Super Agent on Google Cloud