Triplet Reconstruction and all other Phylogenetic CSPs are Approximation Resistant

12/24/2022
by   Vaggos Chatziafratis, et al.
0

We study the natural problem of Triplet Reconstruction (also Rooted Triplets Consistency or Triplet Clustering), originally motivated in computational biology and relational databases (Aho, Sagiv, Szymanski, and Ullman, 1981): given n points, we want to embed them onto the n leaves of a rooted binary tree (a hierarchical clustering or ultrametric embedding) such that a given set of m triplet constraints is satisfied. Triplet ij|k indicates that “i, j are more closely related to each other than to k” and a tree satisfies ij|k if d(i,j) is the smallest among the 3 distances. Aho et al. (1981) gave an elegant efficient algorithm to find a tree respecting all constraints (if it exists) and it is easy to see that a random binary tree is a 1/3-approximation. Unfortunately, despite more than four decades of research, no better approximation is known. Our main theorem–which captures Triplet Reconstruction as a special case–is a general hardness of approximation result about Constraint Satisfaction Problems (CSPs) over infinite domains (the variables are mapped to any of the n leaves of a tree). Specifically, we prove, under Unique Games (Khot, 2002), that Triplet Reconstruction and more generally, every CSP over hierarchies is approximation resistant (there is no polynomial-time algorithm that does asymptotically better than a biased random assignment). This settles the approximability for many interesting Subtree or Supertree Aggregation Problems. More broadly, our result significantly extends the list of approximation resistant predicates and is a generalization of Guruswami, Hastad, Manokaran, Raghavendra, and Charikar (2011), who showed that ordering CSPs are approximation resistant. The main challenge in our analyses stems from the fact that trees have topology which is what determines whether a given triplet constraint on the leaves is satisfied or not.

READ FULL TEXT
research
07/09/2019

On the Approximability of Presidential Type Predicates

Given a predicate P: {-1, 1}^k →{-1, 1}, let CSP(P) be the set of constr...
research
10/07/2021

Faster algorithm for Unique (k,2)-CSP

In a (k,2)-Constraint Satisfaction Problem we are given a set of arbitra...
research
05/04/2021

Streaming approximation resistance of every ordering CSP

An ordering constraint satisfaction problem (OCSP) is given by a positiv...
research
05/10/2018

ETH-Hardness of Approximating 2-CSPs and Directed Steiner Network

We study the 2-ary constraint satisfaction problems (2-CSPs), which can ...
research
07/24/2018

A Note on Clustering Aggregation

We consider the clustering aggregation problem in which we are given a s...
research
10/14/2021

The AI Triplet: Computational, Conceptual, and Mathematical Representations in AI Education

Expertise in AI requires integrating computational, conceptual, and math...
research
05/18/2020

Approximate Denial Constraints

The problem of mining integrity constraints from data has been extensive...

Please sign up or login with your details

Forgot password? Click here to reset