Computing Approximate Statistical Discrepancy

04/30/2018
by   Michael Matheny, et al.
0

Consider a geometric range space (X,A̧) where each data point x ∈ X has two or more values (say r(x) and b(x)). Also consider a function Φ(A) defined on any subset A ∈ (X,A̧) on the sum of values in that range e.g., r_A = ∑_x ∈ A r(x) and b_A = ∑_x ∈ A b(x). The Φ-maximum range is A^* = _A ∈ (X,A̧)Φ(A). Our goal is to find some  such that |Φ(Â) - Φ(A^*)| ≤ε. We develop algorithms for this problem for range spaces with bounded VC-dimension, as well as significant improvements for those defined by balls, halfspaces, and axis-aligned rectangles. This problem has many applications in many areas including discrepancy evaluation, classification, and spatial scan statistics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2021

Approximate Maximum Halfspace Discrepancy

Consider the geometric range space (X, ℋ_d) where X ⊂ℝ^d and ℋ_d is the ...
research
12/06/2021

Polychromatic Colorings of Unions of Geometric Hypergraphs

We consider the polychromatic coloring problems for unions of two or mor...
research
01/19/2021

Star Discrepancy Subset Selection: Problem Formulation and Efficient Approaches for Low Dimensions

Motivated by applications in instance selection, we introduce the star d...
research
07/02/2021

Linear Discrepancy is Π_2-Hard to Approximate

In this note, we prove that the problem of computing the linear discrepa...
research
04/30/2018

Practical Low-Dimensional Halfspace Range Space Sampling

We develop, analyze, implement, and compare new algorithms for creating ...
research
10/16/2019

Some Geometric Applications of Anti-Chains

We present an algorithmic framework for computing anti-chains of maximum...
research
10/16/2017

Geometric Learning and Filtering in Finance

We develop a method for incorporating relevant non-Euclidean geometric i...

Please sign up or login with your details

Forgot password? Click here to reset