For Kernel Range Spaces a Constant Number of Queries Are Sufficient

06/28/2023
by   Jeff M. Phillips, et al.
0

We introduce the notion of an ε-cover for a kernel range space. A kernel range space concerns a set of points X ⊂ℝ^d and the space of all queries by a fixed kernel (e.g., a Gaussian kernel K(p,·) = exp(-p-·^2)). For a point set X of size n, a query returns a vector of values R_p ∈ℝ^n, where the ith coordinate (R_p)_i = K(p,x_i) for x_i ∈ X. An ε-cover is a subset of points Q ⊂ℝ^d so for any p ∈ℝ^d that 1/nR_p - R_q_1≤ε for some q ∈ Q. This is a smooth analog of Haussler's notion of ε-covers for combinatorial range spaces (e.g., defined by subsets of points within a ball query) where the resulting vectors R_p are in {0,1}^n instead of [0,1]^n. The kernel versions of these range spaces show up in data analysis tasks where the coordinates may be uncertain or imprecise, and hence one wishes to add some flexibility in the notion of inside and outside of a query range. Our main result is that, unlike combinatorial range spaces, the size of kernel ε-covers is independent of the input size n and dimension d. We obtain a bound of (1/ε)^Õ(1/ε^2), where Õ(f(1/ε)) hides log factors in (1/ε) that can depend on the kernel. This implies that by relaxing the notion of boundaries in range queries, eventually the curse of dimensionality disappears, and may help explain the success of machine learning in very high-dimensions. We also complement this result with a lower bound of almost (1/ε)^Ω(1/ε), showing the exponential dependence on 1/ε is necessary.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2022

In-Range Farthest Point Queries and Related Problem in High Dimensions

Range-aggregate query is an important type of queries with numerous appl...
research
02/10/2019

Set Cover in Sub-linear Time

We study the classic set cover problem from the perspective of sub-linea...
research
03/19/2019

Independent Range Sampling, Revisited Again

We revisit the range sampling problem: the input is a set of points wher...
research
02/06/2023

Stochastic Minimum Vertex Cover in General Graphs: a 3/2-Approximation

Our main result is designing an algorithm that returns a vertex cover of...
research
05/28/2021

A lower bound for essential covers of the cube

Essential covers were introduced by Linial and Radhakrishnan as a model ...
research
03/23/2022

Modules in Robinson Spaces

A Robinson space is a dissimilarity space (X,d) (i.e., a set X of size n...
research
02/28/2019

Probabilistic smallest enclosing ball in high dimensions via subgradient sampling

We study a variant of the median problem for a collection of point sets ...

Please sign up or login with your details

Forgot password? Click here to reset