Query Complexity of Active Learning for Function Family With Nearly Orthogonal Basis

06/06/2023
by   Xiang Chen, et al.
0

Many machine learning algorithms require large numbers of labeled data to deliver state-of-the-art results. In applications such as medical diagnosis and fraud detection, though there is an abundance of unlabeled data, it is costly to label the data by experts, experiments, or simulations. Active learning algorithms aim to reduce the number of required labeled data points while preserving performance. For many convex optimization problems such as linear regression and p-norm regression, there are theoretical bounds on the number of required labels to achieve a certain accuracy. We call this the query complexity of active learning. However, today's active learning algorithms require the underlying learned function to have an orthogonal basis. For example, when applying active learning to linear regression, the requirement is the target function is a linear composition of a set of orthogonal linear functions, and active learning can find the coefficients of these linear functions. We present a theoretical result to show that active learning does not need an orthogonal basis but rather only requires a nearly orthogonal basis. We provide the corresponding theoretical proofs for the function family of nearly orthogonal basis, and its applications associated with the algorithmically efficient active learning framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/12/2019

ALiPy: Active Learning in Python

Supervised machine learning methods usually require a large set of label...
research
04/23/2021

One-Round Active Learning

Active learning has been a main solution for reducing data labeling cost...
research
12/23/2016

Active Learning and Proofreading for Delineation of Curvilinear Structures

Many state-of-the-art delineation methods rely on supervised machine lea...
research
11/27/2017

Condition number-free query and active learning of linear families

We consider the problem of learning a function from samples with ℓ_2-bou...
research
01/25/2016

A Robust UCB Scheme for Active Learning in Regression from Strategic Crowds

We study the problem of training an accurate linear regression model by ...
research
03/09/2018

Highly Automated Learning for Improved Active Safety of Vulnerable Road Users

Highly automated driving requires precise models of traffic participants...
research
11/08/2018

Active Learning using Deep Bayesian Networks for Surgical Workflow Analysis

For many applications in the field of computer assisted surgery, such as...

Please sign up or login with your details

Forgot password? Click here to reset