Fitting Metrics and Ultrametrics with Minimum Disagreements

08/29/2022
by   Vincent Cohen-Addad, et al.
0

Given x ∈ (ℝ_≥ 0)^[n]2 recording pairwise distances, the METRIC VIOLATION DISTANCE (MVD) problem asks to compute the ℓ_0 distance between x and the metric cone; i.e., modify the minimum number of entries of x to make it a metric. Due to its large number of applications in various data analysis and optimization tasks, this problem has been actively studied recently. We present an O(log n)-approximation algorithm for MVD, exponentially improving the previous best approximation ratio of O(OPT^1/3) of Fan et al. [ SODA, 2018]. Furthermore, a major strength of our algorithm is its simplicity and running time. We also study the related problem of ULTRAMETRIC VIOLATION DISTANCE (UMVD), where the goal is to compute the ℓ_0 distance to the cone of ultrametrics, and achieve a constant factor approximation algorithm. The UMVD can be regarded as an extension of the problem of fitting ultrametrics studied by Ailon and Charikar [SIAM J. Computing, 2011] and by Cohen-Addad et al. [FOCS, 2021] from ℓ_1 norm to ℓ_0 norm. We show that this problem can be favorably interpreted as an instance of Correlation Clustering with an additional hierarchical structure, which we solve using a new O(1)-approximation algorithm for correlation clustering that has the structural property that it outputs a refinement of the optimum clusters. An algorithm satisfying such a property can be considered of independent interest. We also provide an O(log n loglog n) approximation algorithm for weighted instances. Finally, we investigate the complementary version of these problems where one aims at choosing a maximum number of entries of x forming an (ultra-)metric. In stark contrast with the minimization versions, we prove that these maximization versions are hard to approximate within any constant factor assuming the Unique Games Conjecture.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/29/2023

Fitting Tree Metrics with Minimum Disagreements

In the L_0 Fitting Tree Metrics problem, we are given all pairwise dista...
research
12/20/2020

Pattern Matching in Doubling Spaces

We consider the problem of matching a metric space (X,d_X) of size k wit...
research
07/21/2018

Metric Violation Distance: Revisited and Extended

Metric data plays an important role in various settings such as metric-b...
research
04/23/2020

Directed Girth

It is known that a better than 2-approximation algorithm for the girth i...
research
03/05/2020

Minimum bounded chains and minimum homologous chains in embedded simplicial complexes

We study two optimization problems on simplicial complexes with homology...
research
06/03/2021

Approximation Algorithms for Min-Distance Problems in DAGs

The min-distance between two nodes u, v is defined as the minimum of the...
research
07/10/2023

Improved Diversity Maximization Algorithms for Matching and Pseudoforest

In this work we consider the diversity maximization problem, where given...

Please sign up or login with your details

Forgot password? Click here to reset