LLMs On The Edge

Name: LLMs On The Edge
Uploaded: 2025-06-16T14:09:00+00:00
Description: Sharad Chole, chief scientist and co-founder of Expedera, talks with Semiconductor Engineering about the tradeoffs involved in making this work, how to reduce the size of LLMs, and what impact this wi...

Nearly all the data input for AI so far has been text, but that's about to change. In the future, that input likely will include video, voice, as well as other types of data, causing a massive increase in the amount of data that needs to be modeled and the compute resources necessary to make it all work. This is hard enough in hyperscale data centers, which are sprouting up everywhere to handle the training and some inferencing, but it's even more of a challenge in bandwidth- and power-limited edge devices. Sharad Chole, chief scientist and co-founder of Expedera, talks with Semiconductor Engineering about the tradeoffs involved in making this work, how to reduce the size of LLMs, and what impact this will have on engineers working in this space.

NPU IP Selector

Related Semiconductor IP

NPU IP Core for Edge
NPU
NPU IP Core for Mobile
Specialized Video Processing NPU IP
NPU IP Core for Data Center

Latest Videos

How UCIe 3.0 Redefining Chiplet Architecture: From Protocol to Platform
Teradyne Testimonial: Silicon Creations' 16nm SerDes Enables Fastest TTM and Most Cost-Effective Teradyne ASIC Development To-Date
Webinar: Unpacking System Performance – Supercharge Your Systems with Lossless Compression IPs
Arm: From Cloud-to-Car Architecture
High Performance RISC-V is here!

LLMs On The Edge

Related Semiconductor IP

Related Videos

Latest Videos