LLM Inference on RISC-V Embedded CPUs

Name: LLM Inference on RISC-V Embedded CPUs
Uploaded: 2024-11-04T13:54:00+00:00
Description: The advancement of large language models (LLMs) has significantly enhanced natural language processing capabilities, enabling complex text understanding and generation tasks. This presentation focuses...

By Yueh-Feng Lee, Andes Technology

The advancement of large language models (LLMs) has significantly enhanced natural language processing capabilities, enabling complex text understanding and generation tasks. This presentation focuses on optimizing the open-source LLaMA CPP project for the RISC-V P extension. By running the TinyLLaMA 1.1B model on the Andes Voyager development board using a quad-core CPU supporting the RISC-V P extension, performance results show that the model can achieve near real-time response. This work highlights the potential of RISC-V as an efficient platform for deploying advanced AI models in resource-constrained environments, contributing to the growing field of edge computing and embedded AI applications.

RISC-V IP Selector

Related Semiconductor IP

Gen#2 of 64-bit RISC-V core with out-of-order pipeline based complex
Compact Embedded RISC-V Processor
Multi-core capable 64-bit RISC-V CPU with vector extensions
64 bit RISC-V Multicore Processor with 2048-bit VLEN and AMM
RISC-V AI Acceleration Platform - Scalable, standards-aligned soft chiplet IP

Latest Videos

BrainChip Expands IP Business Model with AKD1500 Production to Accelerate Edge AI Deployment
UCIe™ Chiplet IP on TSMC 3DFabric® Platform
MACsec at Terabit Line Rates
SiFive 2nd Generation Intelligence Family Introduction
ZeroPoint on the benefits of AI-MX at FMS 2025

LLM Inference on RISC-V Embedded CPUs

Related Semiconductor IP

Related Videos

Latest Videos