Approximate Convex Hull of Data Streams

12/12/2017
by   Avrim Blum, et al.
0

Given a finite set of points P ⊆R^d, we would like to find a small subset S ⊆ P such that the convex hull of S approximately contains P. More formally, every point in P is within distance ϵ from the convex hull of S. Such a subset S is called an ϵ-hull. Computing an ϵ-hull is an important problem in computational geometry, machine learning, and approximation algorithms. In many real world applications, the set P is too large to fit in memory. We consider the streaming model where the algorithm receives the points of P sequentially and strives to use a minimal amount of memory. Existing streaming algorithms for computing an ϵ-hull require O(ϵ^-(d-1)/2) space, which is optimal for a worst-case input. However, this ignores the structure of the data. The minimal size of an ϵ-hull of P, which we denote by OPT, can be much smaller. A natural question is whether a streaming algorithm can compute an ϵ-hull using only O(OPT) space. We begin with lower bounds that show that it is not possible to have a single-pass streaming algorithm that computes an ϵ-hull with O(OPT) space. We instead propose three relaxations of the problem for which we can compute ϵ-hulls using space near-linear to the optimal size. Our first algorithm for points in R^2 that arrive in random-order uses O( n·OPT) space. Our second algorithm for points in R^2 makes O((1/ϵ)) passes before outputting the ϵ-hull and requires O(OPT) space. Our third algorithm for points in R^d for any fixed dimension d outputs an ϵ-hull for all but δ-fraction of directions and requires O(OPT·OPT) space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2018

Streaming Algorithms for Planar Convex Hulls

Many classical algorithms are known for computing the convex hull of a s...
research
11/05/2021

New Streaming Algorithms for High Dimensional EMD and MST

We study streaming algorithms for two fundamental geometric problems: co...
research
01/27/2018

Variance-Optimal Offline and Streaming Stratified Random Sampling

Stratified random sampling (SRS) is a fundamental sampling technique tha...
research
03/15/2019

Distribution-Sensitive Bounds on Relative Approximations of Geometric Ranges

A family R of ranges and a set X of points together define a range space...
research
01/11/2019

Depth First Search in the Semi-streaming Model

Depth first search (DFS) tree is a fundamental data structure for solvin...
research
07/03/2023

A numerical algorithm for attaining the Chebyshev bound in optimal learning

Given a compact subset of a Banach space, the Chebyshev center problem c...
research
02/15/2020

Sparse Coresets for SVD on Infinite Streams

In streaming Singular Value Decomposition (SVD), d-dimensional rows of a...

Please sign up or login with your details

Forgot password? Click here to reset