An FPGA Implementation of Displacement Vector Search for Intra Pattern Copy in JPEG XS
By Qiyue Chen, Yao Li, Jie Tao, Song Chen, Li Li, Dong Liu
University of Science and Technology of China, Hefei, China

Abstract
Recently, progress has been made on the Intra Pattern Copy (IPC) tool for JPEG XS, an image compression standard designed for low-latency and low-complexity coding. IPC performs wavelet-domain intra compensation predictions to reduce spatial redundancy in screen content. A key module of IPC is the displacement vector (DV) search, which aims to solve the optimal prediction reference offset. However, the DV search process is computationally intensive, posing challenges for practical hardware deployment. In this paper, we propose an efficient pipelined FPGA architecture design for the DV search module to promote the practical deployment of IPC. Optimized memory organization, which leverages the IPC computational characteristics and data inherent reuse patterns, is further introduced to enhance the performance. Experimental results show that our proposed architecture achieves a throughput of 38.3 Mpixels/s with a power consumption of 277 mW, demonstrating its feasibility for practical hardware implementation in IPC and other predictive coding tools, and providing a promising foundation for ASIC deployment.
To read the full article, click here
Related Semiconductor IP
- TicoXS | JPEG XS 4K Encoder / Decoder IP-core
- JPEG XS Codec with Flawless Imaging - Max res: 7680x4320 - Max fps: 60
- JPEG XS - Low-Latency Video
- TicoXS | JPEG XS 8K Encoder / Decoder IP-core
- JPEG XS Codec with Flawless Imaging - Max res: 4096x2160 + 5K extension - Max fps: 60
Related Articles
- An 800 Mpixels/s, ~260 LUTs Implementation of the QOI Lossless Image Compression Algorithm and its Improvement through Hilbert Scanning
- The rise of FPGA technology in High-Performance Computing
- Growing demand for high-speed data in consumer devices gives rise to new generation of low-end FPGAs
- Design and Implementation of Test Infrastructure for Higher Parallel Wafer Level Testing of System-on-Chip
Latest Articles
- An FPGA Implementation of Displacement Vector Search for Intra Pattern Copy in JPEG XS
- A Persistent-State Dataflow Accelerator for Memory-Bound Linear Attention Decode on FPGA
- VMXDOTP: A RISC-V Vector ISA Extension for Efficient Microscaling (MX) Format Acceleration
- PDF: PUF-based DNN Fingerprinting for Knowledge Distillation Traceability
- TeraPool: A Physical Design Aware, 1024 RISC-V Cores Shared-L1-Memory Scaled-up Cluster Design with High Bandwidth Main Memory Link