cMPI: Using CXL Memory Sharing for MPI One-Sided and Two-Sided Inter-Node Communications By Xi Wang, University of California October 8, 2025
From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs By Tianhao Zhu, Shanghai Jiao Tong University October 8, 2025
Double Duty: FPGA Architecture to Enable Concurrent LUT and Adder Chain Usage By Junius Pun, Nanyang Technological University October 6, 2025
A Resource-Driven Approach for Implementing CNNs on FPGAs Using Adaptive IPs By Philippe Magalhães, Hubert Curien Laboratory - Université Jean Monnet October 6, 2025
Fault Injection in On-Chip Interconnects: A Comparative Study of Wishbone, AXI-Lite, and AXI By Hongwei Zhao, Université Bretagne Sud September 30, 2025
aTENNuate: Optimized Real-time Speech Enhancement with Deep SSMs on RawAudio By Yan Ru Pei, BrainChip September 18, 2025
Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference By Haoran Wu, University of Cambridge September 16, 2025
Hardware Acceleration of Kolmogorov-Arnold Network (KAN) in Large-Scale Systems By Wei-Hsing Huang, Georgia Institute of Technology September 16, 2025
CRADLE: Conversational RTL Design Space Exploration with LLM-based Multi-Agent Systems By Lukas Krupp, RPTU University of Kaiserslautern-Landau September 13, 2025
On the Thermal Vulnerability of 3D-Stacked High-Bandwidth Memory Architectures By Mehdi Elahi, North Carolina A&T State University September 10, 2025
OmniSim: Simulating Hardware with C Speed and RTL Accuracy for High-Level Synthesis Designs By Rishov Sarkar, Georgia Institute of Technology September 5, 2025
Balancing Power and Performance With Task Dependencies in Multi-Core Systems By Gokhan Akgun, Technische Universität Dresden September 5, 2025
LLM Inference with Codebook-based Q4X Quantization using the Llama.cpp Framework on RISC-V Vector CPUs By Saqib Akram, 10xEngineers September 5, 2025
PCIe 5.0: The universal high-speed interconnect for High Bandwidth and Low Latency Applications Design Challenges & Solutions By September 2, 2025
Basilisk: A 34 mm2 End-to-End Open-Source 64-bit Linux-Capable RISC-V SoC in 130nm BiCMOS By Philippe Sauter, ETH Zurich September 1, 2025
Redefining Speed: How FPGAs are shaping the future of high-frequency trading By Jean-François Gagnon, Orthogone August 26, 2025
Bare-Metal RISC-V + NVDLA SoC for Efficient Deep Learning Inference By Vineet Kumar, University College Dublin August 25, 2025