Improved Approximation Guarantees for Shortest Superstrings using Cycle Classification by Overlap to Length Ratios

11/06/2021
by   Matthias Englert, et al.
0

In the Shortest Superstring problem, we are given a set of strings and we are asking for a common superstring, which has the minimum number of characters. The Shortest Superstring problem is NP-hard and several constant-factor approximation algorithms are known for it. Of particular interest is the GREEDY algorithm, which repeatedly merges two strings of maximum overlap until a single string remains. The GREEDY algorithm, being simpler than other well-performing approximation algorithms for this problem, has attracted attention since the 1980s and is commonly used in practical applications. Tarhio and Ukkonen (TCS 1988) conjectured that GREEDY gives a 2-approximation. In a seminal work, Blum, Jiang, Li, Tromp, and Yannakakis (STOC 1991) proved that the superstring computed by GREEDY is a 4-approximation, and this upper bound was improved to 3.5 by Kaplan and Shafrir (IPL 2005). We show that the approximation guarantee of GREEDY is at most (13+√(57))/6 ≈ 3.425, making the first progress on this question since 2005. Furthermore, we prove that the Shortest Superstring can be approximated within a factor of (37+√(57))/18≈ 2.475, improving slightly upon the currently best 211/23-approximation algorithm by Mucha (SODA 2013).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2021

All instantiations of the greedy algorithm for the shortest superstring problem are equivalent

In the Shortest Common Superstring problem (SCS), one needs to find the ...
research
09/23/2018

Collapsing Superstring Conjecture

In the Shortest Common Superstring (SCS) problem, one is given a collect...
research
04/30/2018

On improving the approximation ratio of the r-shortest common superstring problem

The Shortest Common Superstring problem (SCS) consists, for a set of str...
research
02/28/2019

Graph and String Parameters: Connections Between Pathwidth, Cutwidth and the Locality Number

We investigate the locality number, a recently introduced structural par...
research
11/11/2018

Coverage Centrality Maximization in Undirected Networks

Centrality metrics are among the main tools in social network analysis. ...
research
03/10/2022

A Tighter Approximation Guarantee for Greedy Minimum Entropy Coupling

We examine the minimum entropy coupling problem, where one must find the...
research
02/02/2021

On Greedy Approaches to Hierarchical Aggregation

We analyze greedy algorithms for the Hierarchical Aggregation (HAG) prob...

Please sign up or login with your details

Forgot password? Click here to reset