Bayesian Active Learning for Discrete Latent Variable Models

02/27/2022
by   Aditi Jha, et al.
11

Active learning seeks to reduce the number of samples required to estimate the parameters of a model, thus forming an important class of techniques in modern machine learning. However, past work on active learning has largely overlooked latent variable models, which play a vital role in neuroscience, psychology, and a variety of other engineering and scientific disciplines. Here we address this gap in the literature and propose a novel framework for maximum-mutual-information input selection for learning discrete latent variable regression models. We first examine a class of models known as "mixtures of linear regressions" (MLR). This example is striking because it is well known that active learning confers no advantage for standard least-squares regression. However, we show – both in simulations and analytically using Fisher information – that optimal input selection can nevertheless provide dramatic gains for mixtures of regression models; we also validate this on a real-world application of MLRs. We then consider a powerful class of temporally structured latent variable models known as Input-Output Hidden Markov Models (IO-HMMs), which have recently gained prominence in neuroscience. We show that our method substantially speeds up learning, and outperforms a variety of approximate methods based on variational and amortized inference.

READ FULL TEXT

page 3

page 6

page 9

page 12

page 26

research
12/17/2018

A Tutorial on Deep Latent Variable Models of Natural Language

There has been much recent, exciting work on combining the complementary...
research
05/27/2016

Asymptotic Analysis of Objectives based on Fisher Information in Active Learning

Obtaining labels can be costly and time-consuming. Active learning allow...
research
08/20/2020

A Value of Information Framework for Latent Variable Models

In this paper, a general value of information (VoI) framework is formali...
research
05/04/2023

On factor copula-based mixed regression models

In this article, a copula-based method for mixed regression models is pr...
research
06/26/2018

Dropout-based Active Learning for Regression

Active learning is relevant and challenging for high-dimensional regress...
research
07/17/2019

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples

For many important problems the quantity of interest (or output) is an u...
research
12/01/2021

Structural Sieves

This paper explores the use of deep neural networks for semiparametric e...

Please sign up or login with your details

Forgot password? Click here to reset