Approximate Maximum Halfspace Discrepancy

06/25/2021
by   Michael Matheny, et al.
0

Consider the geometric range space (X, ℋ_d) where X ⊂ℝ^d and ℋ_d is the set of ranges defined by d-dimensional halfspaces. In this setting we consider that X is the disjoint union of a red and blue set. For each halfspace h ∈ℋ_d define a function Φ(h) that measures the "difference" between the fraction of red and fraction of blue points which fall in the range h. In this context the maximum discrepancy problem is to find the h^* = max_h ∈ (X, ℋ_d)Φ(h). We aim to instead find an ĥ such that Φ(h^*) - Φ(ĥ) ≤ε. This is the central problem in linear classification for machine learning, in spatial scan statistics for spatial anomaly detection, and shows up in many other areas. We provide a solution for this problem in O(|X| + (1/ε^d) log^4 (1/ε)) time, which improves polynomially over the previous best solutions. For d=2 we show that this is nearly tight through conditional lower bounds. For different classes of Φ we can either provide a Ω(|X|^3/2 - o(1)) time lower bound for the exact solution with a reduction to APSP, or an Ω(|X| + 1/ε^2-o(1)) lower bound for the approximate solution with a reduction to 3SUM. A key technical result is a ε-approximate halfspace range counting data structure of size O(1/ε^d) with O(log (1/ε)) query time, which we can build in O(|X| + (1/ε^d) log^4 (1/ε)) time.

READ FULL TEXT
research
08/18/2021

A Tighter Relation Between Hereditary Discrepancy and Determinant Lower Bound

In seminal work, Lovász, Spencer, and Vesztergombi [European J. Combin.,...
research
04/30/2018

Computing Approximate Statistical Discrepancy

Consider a geometric range space (X,A̧) where each data point x ∈ X has ...
research
04/30/2018

Practical Low-Dimensional Halfspace Range Space Sampling

We develop, analyze, implement, and compare new algorithms for creating ...
research
03/14/2023

On the Gap between Hereditary Discrepancy and the Determinant Lower Bound

The determinant lower bound of Lovasz, Spencer, and Vesztergombi [Europe...
research
12/01/2020

Lower Bounds for Semialgebraic Range Searching and Stabbing Problems

In the semialgebraic range searching problem, we are to preprocess n poi...
research
03/19/2018

On the streaming complexity of fundamental geometric problems

In this paper, we focus on lower bounds and algorithms for some basic ge...
research
10/18/2022

Computing MEMs on Repetitive Text Collections

We consider the problem of computing the Maximal Exact Matches (MEMs) of...

Please sign up or login with your details

Forgot password? Click here to reset