New Streaming Algorithms for High Dimensional EMD and MST

11/05/2021
by   Xi Chen, et al.
0

We study streaming algorithms for two fundamental geometric problems: computing the cost of a Minimum Spanning Tree (MST) of an n-point set X ⊂{1,2,…,Δ}^d, and computing the Earth Mover Distance (EMD) between two multi-sets A,B ⊂{1,2,…,Δ}^d of size n. We consider the turnstile model, where points can be added and removed. We give a one-pass streaming algorithm for MST and a two-pass streaming algorithm for EMD, both achieving an approximation factor of Õ(log n) and using polylog(n,d,Δ)-space only. Furthermore, our algorithm for EMD can be compressed to a single pass with a small additive error. Previously, the best known sublinear-space streaming algorithms for either problem achieved an approximation of O(min{log n , log (Δ d)}log n) [Andoni-Indyk-Krauthgamer '08, Backurs-Dong-Indyk-Razenshteyn-Wagner '20]. For MST, we also prove that any constant space streaming algorithm can only achieve an approximation of Ω(log n), analogous to the Ω(log n) lower bound for EMD of [Andoni-Indyk-Krauthgamer '08]. Our algorithms are based on an improved analysis of a recursive space partitioning method known generically as the Quadtree. Specifically, we show that the Quadtree achieves an Õ(log n) approximation for both EMD and MST, improving on the O(min{log n , log (Δ d)}log n) approximation of [Andoni-Indyk-Krauthgamer '08, Backurs-Dong-Indyk-Razenshteyn-Wagner '20].

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2022

Streaming Euclidean MST to a Constant Factor

We study streaming algorithms for the fundamental geometric problem of c...
research
09/30/2018

Streaming Algorithms for Planar Convex Hulls

Many classical algorithms are known for computing the convex hull of a s...
research
02/13/2023

Maximum Coverage in Sublinear Space, Faster

Given a collection of m sets from a universe 𝒰, the Maximum Set Coverage...
research
12/12/2017

Approximate Convex Hull of Data Streams

Given a finite set of points P ⊆R^d, we would like to find a small subse...
research
04/08/2022

High-Dimensional Geometric Streaming in Polynomial Space

Many existing algorithms for streaming geometric data analysis have been...
research
09/10/2023

Streaming Semidefinite Programs: O(√(n)) Passes, Small Space and Fast Runtime

We study the problem of solving semidefinite programs (SDP) in the strea...
research
03/28/2022

Sublinear Algorithms and Lower Bounds for Estimating MST and TSP Cost in General Metrics

We consider the design of sublinear space and query complexity algorithm...

Please sign up or login with your details

Forgot password? Click here to reset