Recoil: Parallel rANS Decoding with Decoder-Adaptive Scalability

06/21/2023
by   Fangzheng Lin, et al.
0

Entropy coding is essential to data compression, image and video coding, etc. The Range variant of Asymmetric Numeral Systems (rANS) is a modern entropy coder, featuring superior speed and compression rate. As rANS is not designed for parallel execution, the conventional approach to parallel rANS partitions the input symbol sequence and encodes partitions with independent codecs, and more partitions bring extra overhead. This approach is found in state-of-the-art implementations such as DietGPU. It is unsuitable for content-delivery applications, as the parallelism is wasted if the decoder cannot decode all the partitions in parallel, but all the overhead is still transferred. To solve this, we propose Recoil, a parallel rANS decoding approach with decoder-adaptive scalability. We discover that a single rANS-encoded bitstream can be decoded from any arbitrary position if the intermediate states are known. After renormalization, these states also have a smaller upper bound, which can be stored efficiently. We then split the encoded bitstream using a heuristic to evenly distribute the workload, and store the intermediate states and corresponding symbol indices as metadata. The splits can then be combined simply by eliminating extra metadata entries. The main contribution of Recoil is reducing unnecessary data transfer by adaptively scaling parallelism overhead to match the decoder capability. The experiments show that Recoil decoding throughput is comparable to the conventional approach, scaling massively on CPUs and GPUs and greatly outperforming various other ANS-based codecs.

READ FULL TEXT

page 4

page 6

page 9

research
04/30/2019

On the Construction of G_N-coset Codes for Parallel Decoding

In this paper, we propose a type of G_N-coset codes for a highly paralle...
research
11/17/2021

Accelerating JPEG Decompression on GPUs

The JPEG compression format has been the standard for lossy image compre...
research
06/09/2022

PartitionPIM: Practical Memristive Partitions for Fast Processing-in-Memory

Digital memristive processing-in-memory overcomes the memory wall throug...
research
07/05/2023

Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression

Recently, the performance of neural image compression (NIC) has steadily...
research
01/14/2021

Decoding of Interleaved Linearized Reed-Solomon Codes with Applications to Network Coding

Recently, Martinez-Penas and Kschischang (IEEE Trans. Inf. Theory, 2019)...
research
03/05/2021

An Optimized H.266/VVC Software Decoder On Mobile Platform

As the successor of H.265/HEVC, the new versatile video coding standard ...
research
03/09/2022

Neural Data-Dependent Transform for Learned Image Compression

Learned image compression has achieved great success due to its excellen...

Please sign up or login with your details

Forgot password? Click here to reset