Efficient Compression of Long Arbitrary Sequences with No Reference at the Encoder

02/23/2020
by   Yuval Cassuto, et al.
0

In a distributed information application an encoder compresses an arbitrary vector while a similar reference vector is available to the decoder as side information. For the Hamming-distance similarity measure, and when guaranteed perfect reconstruction is required, we present two contributions to the solution of this problem. One result shows that when a set of potential reference vectors is available to the encoder, lower compression rates can be achieved when the set satisfies a certain clustering property. Another result reduces the best known decoding complexity from exponential in the vector length n to O(n^1.5) by generalized concatenation of inner coset codes and outer error-correcting codes. One potential application of the results is the compression of DNA sequences, where similar (but not identical) reference vectors are shared among senders and receivers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/23/2020

Information set decoding of Lee-metric codes over finite rings

Information set decoding (ISD) algorithms are the best known procedures ...
research
02/12/2023

Efficient Integer Retrieving from Unordered Compressed Sequences

The variable-length Reverse Multi-Delimiter (RMD) codes are known to rep...
research
11/07/2021

Applications of Convolutional Codes to DNA Codes and Error-Correction

Convolutional codes are error-correcting linear codes that utilize shift...
research
05/16/2022

Genomic Compression with Read Alignment at the Decoder

We propose a new compression scheme for genomic data given as sequence f...
research
10/21/2022

Non-binary Codes for Correcting a Burst of at Most t Deletions

The problem of correcting deletions has received significant attention, ...
research
01/04/2019

Efficient and Explicit Balanced Primer Codes

To equip DNA-based data storage with random-access capabilities, Yazdi e...
research
12/30/2019

A New Burrows Wheeler Transform Markov Distance

Prior work inspired by compression algorithms has described how the Burr...

Please sign up or login with your details

Forgot password? Click here to reset