Experimental Design for Overparameterized Learning with Application to Single Shot Deep Active Learning

09/27/2020
by   Neta Shoham, et al.
0

The impressive performance exhibited by modern machine learning models hinges on the ability to train such models on a very large amounts of labeled data. However, since access to large volumes of labeled data is often limited or expensive, it is desirable to alleviate this bottleneck by carefully curating the training set. Optimal experimental design is a well-established paradigm for selecting data point to be labeled so to maximally inform the learning process. Unfortunately, classical theory on optimal experimental design focuses on selecting examples in order to learn underparameterized (and thus, non-interpolative) models, while modern machine learning models such as deep neural networks are overparameterized, and oftentimes are trained to be interpolative. As such, classical experimental design methods are not applicable in many modern learning setups. Indeed, the predictive performance of underparameterized models tends to be variance dominated, so classical experimental design focuses on variance reduction, while the predictive performance of overparameterized models can also be, as is shown in this paper, bias dominated or of mixed nature. In this paper we propose a design strategy that is well suited for overparameterized regression and interpolation, and we demonstrate the applicability of our method in the context of deep learning by proposing a new algorithm for single-shot deep active learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2023

Active Learning for Classifying 2D Grid-Based Level Completability

Determining the completability of levels generated by procedural generat...
research
03/28/2023

Automated wildlife image classification: An active learning tool for ecological applications

Wildlife camera trap images are being used extensively to investigate an...
research
05/23/2022

Bayesian Active Learning with Fully Bayesian Gaussian Processes

The bias-variance trade-off is a well-known problem in machine learning ...
research
06/07/2017

Active Learning for Structured Prediction from Partially Labeled Data

We propose a general purpose active learning algorithm for structured pr...
research
11/21/2018

Robust Active Learning for Electrocardiographic Signal Classification

The classification of electrocardiographic (ECG) signals is a challengin...
research
02/28/2022

Single-shot self-supervised particle tracking

Particle tracking is a fundamental task in digital microscopy. Recently,...
research
03/26/2020

Active Learning Approach to Optimization of Experimental Control

In this work we present a general machine learning based scheme to optim...

Please sign up or login with your details

Forgot password? Click here to reset