Composition of nested embeddings with an application to outlier removal

06/20/2023
by   Shuchi Chawla, et al.
0

We study the design of embeddings into Euclidean space with outliers. Given a metric space (X,d) and an integer k, the goal is to embed all but k points in X (called the "outliers") into ℓ_2 with the smallest possible distortion c. Finding the optimal distortion c for a given outlier set size k, or alternately the smallest k for a given target distortion c are both NP-hard problems. In fact, it is UGC-hard to approximate k to within a factor smaller than 2 even when the metric sans outliers is isometrically embeddable into ℓ_2. We consider bi-criteria approximations. Our main result is a polynomial time algorithm that approximates the outlier set size to within an O(log^4 k) factor and the distortion to within a constant factor. The main technical component in our result is an approach for constructing a composition of two given embeddings from subsets of X into ℓ_2 which inherits the distortions of each to within small multiplicative factors. Specifically, given a low c_S distortion embedding from S⊂ X into ℓ_2 and a high(er) c_X distortion embedding from the entire set X into ℓ_2, we construct a single embedding that achieves the same distortion c_S over pairs of points in S and an expansion of at most O(log k)· c_X over the remaining pairs of points, where k=|X∖ S|. Our composition theorem extends to embeddings into arbitrary ℓ_p metrics for p≥ 1, and may be of independent interest. While unions of embeddings over disjoint sets have been studied previously, to our knowledge, this is the first work to consider compositions of nested embeddings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2020

Computing Bi-Lipschitz Outlier Embeddings into the Line

The problem of computing a bi-Lipschitz embedding of a graphical metric ...
research
03/07/2023

Diversity Embeddings and the Hypergraph Sparsest Cut

Good approximations have been attained for the sparsest cut problem by r...
research
03/03/2021

Minimum-Distortion Embedding

We consider the vector embedding problem. We are given a finite set of i...
research
02/22/2018

Near Isometric Terminal Embeddings for Doubling Metrics

Given a metric space (X,d), a set of terminals K⊆ X, and a parameter t> ...
research
07/16/2019

Lossless Prioritized Embeddings

Given metric spaces (X,d) and (Y,ρ) and an ordering x_1,x_2,...,x_n of (...
research
09/14/2022

Small Transformers Compute Universal Metric Embeddings

We study representations of data from an arbitrary metric space 𝒳 in the...
research
08/08/2018

Steiner Point Removal with distortion O( k), using the Noisy-Voronoi algorithm

In the Steiner Point Removal (SPR) problem, we are given a weighted grap...

Please sign up or login with your details

Forgot password? Click here to reset