Hierarchical Overlap Graph

02/13/2018
by   Bastien Cazaux, et al.
0

Given a set of finite words, the Overlap Graph (OG) is a complete weighted digraph where each word is a node and where the weight of an arc equals the length of the longest overlap of one word onto the other (Overlap is an asymmetric notion). The OG serves to assemble DNA fragments or to compute shortest superstrings which are a compressed representation of the input. The OG requires a space is quadratic in the number of words, which limits its scalability. The Hierarchical Overlap Graph (HOG) is an alternative graph that also encodes all maximal overlaps, but uses a space that is linear in the sum of the lengths of the input words. We propose the first algorithm to build the HOG in linear space for words of equal length.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2021

A Linear Time Algorithm for Constructing Hierarchical Overlap Graphs

The hierarchical overlap graph (HOG) is a graph that encodes overlaps fr...
research
06/17/2020

Extremal overlap-free and extremal β-free binary words

An overlap-free (or β-free) word w over a fixed alphabet Σ is extremal i...
research
05/25/2021

An explicit algorithm for normal forms in small overlap monoids

If 𝒫 = ⟨ A | R ⟩ is a monoid presentation, then the relation words in...
research
04/20/2023

KOIOS: Top-k Semantic Overlap Set Search

We study the top-k set similarity search problem using semantic overlap....
research
02/04/2021

Optimal Construction of Hierarchical Overlap Graphs

Genome assembly is a fundamental problem in Bioinformatics, where for a ...
research
05/24/2020

An inequality for the number of periods in a word

We prove an inequality for the number of periods in a word x in terms of...
research
06/04/2023

Overlap Splines and Meshless Finite Difference Methods

We consider overlap splines that are defined by connecting the patches o...

Please sign up or login with your details

Forgot password? Click here to reset