Condition number-free query and active learning of linear families

11/27/2017
by   Xue Chen, et al.
0

We consider the problem of learning a function from samples with ℓ_2-bounded noise. In the simplest agnostic learning setting, the number of samples required for robust estimation depends on a condition number that can be arbitrarily large. We show how to improve this dependence in two natural extensions of the setting: a query access setting, where we can estimate the function at arbitrary points, and an active learning setting, where we get a large number of unlabeled points and choose a small subset to label. For linear spaces of functions, such as the family of n-variate degree-d polynomials, this eliminates the dependence on the condition number. The technique can also yield improvements for nonlinear spaces, as we demonstrate for the family of k-Fourier-sparse signals with continuous frequencies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2019

Disagreement-based Active Learning in Online Settings

We study online active learning for classifying streaming instances with...
research
04/19/2019

Online Active Learning: Label Complexity vs. Classification Errors

We study online active learning for classifying streaming instances. At ...
research
06/06/2023

Query Complexity of Active Learning for Function Family With Nearly Orthogonal Basis

Many machine learning algorithms require large numbers of labeled data t...
research
01/15/2020

Noise-tolerant, Reliable Active Classification with Comparison Queries

With the explosion of massive, widely available unlabeled data in the pa...
research
10/24/2022

Active Learning for Single Neuron Models with Lipschitz Non-Linearities

We consider the problem of active learning for single neuron models, als...
research
07/09/2019

Reconstruction under outliers for Fourier-sparse functions

We consider the problem of learning an unknown f with a sparse Fourier s...
research
02/09/2021

Bounded Memory Active Learning through Enriched Queries

The explosive growth of easily-accessible unlabeled data has lead to gro...

Please sign up or login with your details

Forgot password? Click here to reset