Convergence Rates of Active Learning for Maximum Likelihood Estimation

06/08/2015
by   Kamalika Chaudhuri, et al.
0

An active learner is given a class of models, a large set of unlabeled examples, and the ability to interactively query labels of a subset of these examples; the goal of the learner is to learn a model in the class that fits the data well. Previous theoretical work has rigorously characterized label complexity of active learning, but most of this work has focused on the PAC or the agnostic PAC model. In this paper, we shift our attention to a more general setting -- maximum likelihood estimation. Provided certain conditions hold on the model class, we provide a two-stage active learning algorithm for this problem. The conditions we require are fairly general, and cover the widely popular class of Generalized Linear Models, which in turn, include models for binary and multi-class classification, regression, and conditional random fields. We provide an upper bound on the label requirement of our algorithm, and a lower bound that matches it up to lower order terms. Our analysis shows that unlike binary classification in the realizable case, just a single extra round of interaction is sufficient to achieve near-optimal performance in maximum likelihood estimation. On the empirical side, the recent work in Zhang12 and Zhang14 (on active linear and logistic regression) shows the promise of this approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2015

Active Learning from Weak and Strong Labelers

An active learner is given a hypothesis class, a large set of unlabeled ...
research
06/09/2022

Trimmed Maximum Likelihood Estimation for Robust Learning in Generalized Linear Models

We study the problem of learning generalized linear models under adversa...
research
10/29/2021

Convergence of Uncertainty Sampling for Active Learning

Uncertainty sampling in active learning is heavily used in practice to r...
research
05/29/2018

Active and Adaptive Sequential learning

A framework is introduced for actively and adaptively solving a sequence...
research
09/10/2018

Learning Time Dependent Choice

We explore questions dealing with the learnability of models of choice o...
research
06/23/2022

Regression with Label Permutation in Generalized Linear Model

The assumption that response and predictor belong to the same statistica...
research
06/12/2021

Semi-supervised Active Regression

Labelled data often comes at a high cost as it may require recruiting hu...

Please sign up or login with your details

Forgot password? Click here to reset