Improved Algorithms for Efficient Active Learning Halfspaces with Massart and Tsybakov noise

02/10/2021
by   Chicheng Zhang, et al.
0

We develop a computationally-efficient PAC active learning algorithm for d-dimensional homogeneous halfspaces that can tolerate Massart noise <cit.> and Tsybakov noise <cit.>. Specialized to the η-Massart noise setting, our algorithm achieves an information-theoretic optimal label complexity of Õ( d/(1-2η)^2polylog(1/ϵ) ) under a wide range of unlabeled data distributions (specifically, the family of "structured distributions" defined in <cit.>). Under the more challenging Tsybakov noise condition, we identify two subfamilies of noise conditions, under which our algorithm achieves computational efficiency and provide label complexity guarantees strictly lower than passive learning algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2018

Efficient active learning of sparse halfspaces

We study the problem of efficient PAC active learning of homogeneous lin...
research
08/08/2011

Activized Learning: Transforming Passive to Active with Improved Label Complexity

We study the theoretical advantages of active learning over passive lear...
research
10/11/2019

Not All are Made Equal: Consistency of Weighted Averaging Estimators Under Active Learning

Active learning seeks to build the best possible model with a budget of ...
research
02/12/2020

Efficient active learning of sparse halfspaces with arbitrary bounded noise

In this work we study active learning of homogeneous s-sparse halfspaces...
research
02/18/2017

Revisiting Perceptron: Efficient and Label-Optimal Learning of Halfspaces

It has been a long-standing problem to efficiently learn a halfspace usi...
research
06/20/2014

Noise-adaptive Margin-based Active Learning and Lower Bounds under Tsybakov Noise Condition

We present a simple noise-robust margin-based active learning algorithm ...
research
01/15/2020

Noise-tolerant, Reliable Active Classification with Comparison Queries

With the explosion of massive, widely available unlabeled data in the pa...

Please sign up or login with your details

Forgot password? Click here to reset