Why ASIC Design Makes Sense for LLM-On-Device
A look at architectural and design considerations when designing ASICs for LLM-on-device.
By Steve Xu, Co-Founder and Chief Architect, XgenSilicon
EETimes | July 14, 2025

Multimodality LLMs can enable powerful real-time vision and audio applications if chip power and cost meet the constraints of edge devices. By adopting an ASIC approach, it’s possible to achieve a hardware-efficient implementation through custom design, resulting in lower power and cost compared to using off-the-shelf components, such as GPUs, NPUs, and application processors.
An ASIC design is a systematic approach to address power efficiency bottlenecks, which may be different from model to model and per deployment constraint.
For example, the power of Snapdragon AR1+ Gen 1 running a 1B vision model is 1 watt. An ASIC implementation of the same model can reduce it to 0.1 watt with design tradeoffs between silicon die area and power consumption by shifting the design from NPU + DDR architecture to ASIC + on-chip memory architecture. For smart glasses with a 500 mAh battery, this translates the active time of vision from 0.5 hours to 5 hours.
In this article, we’ll illustrate architectural and design considerations to be taken into account when planning and designing ASICs for LLM-on-device.
To read the full article, click here
Related Semiconductor IP
- Chiplet Die-to-Die Interconnect IP Solution
- High speed MACsec Engine 100G/200G/400G/800G/1.6T
- Temperature/Voltage sensors
- AMBA Bus Host to eSPI Controller/Target
- AMBA Bus Host to eSPI Controller
Related News
- Why the Microsemi-Actel deal makes complete sense
- Analysis: Why ARM-AMD makes sense
- Cadence-Mentor deal makes sense, says analyst
- Analyst: AMD-ARM deal makes no sense
Latest News
- Alliance for Open Media Releases AV2 Codec, Advancing Next-Generation Open Video Coding
- VeriSilicon Drives Commercial Adoption of AV2 Across Next-Generation Video and Streaming Applications
- Cadence Announces Collaboration with Intel Foundry to Accelerate Intel 14A Process Optimization for HPC and Mobile Designs
- Menta and Presto Engineering Announce Strategic Collaboration to Accelerate Adaptive ASIC Architectures with Embedded FPGA Technology
- MIPI A-PHY To Power Industry’s First Four-Company Automotive SerDes Interoperability Demonstration at AutoSens USA