BatchGFN: Generative Flow Networks for Batch Active Learning

06/26/2023
by   Shreshth A. Malik, et al.
0

We introduce BatchGFN – a novel approach for pool-based active learning that uses generative flow networks to sample sets of data points proportional to a batch reward. With an appropriate reward function to quantify the utility of acquiring a batch, such as the joint mutual information between the batch and the model parameters, BatchGFN is able to construct highly informative batches for active learning in a principled way. We show our approach enables sampling near-optimal utility batches at inference time with a single forward pass per point in the batch in toy regression problems. This alleviates the computational complexity of batch-aware algorithms and removes the need for greedy approximations to find maximizers for the batch reward. We also present early results for amortizing training across acquisition steps, which will enable scaling to real-world tasks.

READ FULL TEXT
research
06/19/2019

BatchBALD: Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning

We develop BatchBALD, a tractable approximation to the mutual informatio...
research
11/17/2021

GFlowNet Foundations

Generative Flow Networks (GFlowNets) have been introduced as a method to...
research
08/06/2019

Bayesian Batch Active Learning as Sparse Subset Approximation

Leveraging the wealth of unlabeled data produced in recent years provide...
research
11/01/2022

Consistent Training via Energy-Based GFlowNets for Modeling Discrete Joint Distributions

Generative Flow Networks (GFlowNets) have demonstrated significant perfo...
research
11/01/2022

Batch Active Learning from the Perspective of Sparse Approximation

Active learning enables efficient model training by leveraging interacti...
research
02/07/2020

Ready Policy One: World Building Through Active Learning

Model-Based Reinforcement Learning (MBRL) offers a promising direction f...
research
10/10/2018

Batch Active Preference-Based Learning of Reward Functions

Data generation and labeling are usually an expensive part of learning f...

Please sign up or login with your details

Forgot password? Click here to reset