Practical Low-Dimensional Halfspace Range Space Sampling

04/30/2018
by   Michael Matheny, et al.
0

We develop, analyze, implement, and compare new algorithms for creating ε-samples of range spaces defined by halfspaces which have size sub-quadratic in 1/ε, and have runtime linear in the input size and near-quadratic in 1/ε. The key to our solution is an efficient construction of partition trees. Despite not requiring any techniques developed after the early 1990s, apparently such a result was not ever explicitly described. We demonstrate that our implementations, including new implementations of several variants of partition trees, do indeed run in time linear in the input, appear to run linear in output size, and observe smaller error for the same size sample compared to the ubiquitous random sample (which requires size quadratic in 1/ε). This result has direct applications in speeding up discrepancy evaluation, approximate range counting, and spatial anomaly detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2021

Distribution Compression in Near-linear Time

In distribution compression, one aims to accurately summarize a probabil...
research
06/25/2021

Approximate Maximum Halfspace Discrepancy

Consider the geometric range space (X, ℋ_d) where X ⊂ℝ^d and ℋ_d is the ...
research
03/21/2022

Improved Sampling-to-Counting Reductions in High-Dimensional Expanders and Faster Parallel Determinantal Sampling

We study parallel sampling algorithms for classes of distributions defin...
research
04/30/2018

Computing Approximate Statistical Discrepancy

Consider a geometric range space (X,A̧) where each data point x ∈ X has ...
research
09/11/2021

The Labeled Direct Product Optimally Solves String Problems on Graphs

Suffix trees are an important data structure at the core of optimal solu...
research
01/25/2019

Faster Boosting with Smaller Memory

The two state-of-the-art implementations of boosted trees: XGBoost and L...
research
05/23/2016

A Sub-Quadratic Exact Medoid Algorithm

We present a new algorithm, trimed, for obtaining the medoid of a set, t...

Please sign up or login with your details

Forgot password? Click here to reset