HOLZ: High-Order Entropy Encoding of Lempel-Ziv Factor Distances

11/03/2021
by   Dominik Köppl, et al.
0

We propose a new representation of the offsets of the Lempel-Ziv (LZ) factorization based on the co-lexicographic order of the processed prefixes. The selected offsets tend to approach the k-th order empirical entropy. Our evaluations show that this choice of offsets is superior to the rightmost LZ parsing and the bit-optimal LZ parsing on datasets with small high-order entropy.

READ FULL TEXT

page 12

page 13

research
11/07/2019

Towards Better Compressed Representations

We introduce the problem of computing a parsing where each phrase is of ...
research
06/18/2023

Transferring Neural Potentials For High Order Dependency Parsing

High order dependency parsing leverages high order features such as sibl...
research
11/02/2022

Entropy conservative high-order fluxes in the presence of boundaries

In this paper, we propose a novel development in the context of entropy ...
research
03/19/2022

On the entropy projection and the robustness of high order entropy stable discontinuous Galerkin schemes for under-resolved flows

High order entropy stable schemes provide improved robustness for comput...
research
04/16/2022

Tensor-networks for High-order Polynomial Approximation: A Many-body Physics Perspective

We analyze the problem of high-order polynomial approximation from a man...
research
05/20/2023

Low-Entropy Latent Variables Hurt Out-of-Distribution Performance

We study the relationship between the entropy of intermediate representa...
research
02/08/2021

In-Order Chart-Based Constituent Parsing

We propose a novel in-order chart-based model for constituent parsing. C...

Please sign up or login with your details

Forgot password? Click here to reset