Bayesian experimental design using regularized determinantal point processes

06/10/2019
by   Michał Dereziński, et al.
0

In experimental design, we are given n vectors in d dimensions, and our goal is to select k≪ n of them to perform expensive measurements, e.g., to obtain labels/responses, for a linear regression task. Many statistical criteria have been proposed for choosing the optimal design, with popular choices including A- and D-optimality. If prior knowledge is given, typically in the form of a d× d precision matrix A, then all of the criteria can be extended to incorporate that information via a Bayesian framework. In this paper, we demonstrate a new fundamental connection between Bayesian experimental design and determinantal point processes, the latter being widely used for sampling diverse subsets of data. We use this connection to develop new efficient algorithms for finding (1+ϵ)-approximations of optimal designs under four optimality criteria: A, C, D and V. Our algorithms can achieve this when the desired subset size k is Ω(d_ A/ϵ + 1/ϵ/ϵ^2), where d_ A≤ d is the A-effective dimension, which can often be much smaller than d. Our results offer direct improvements over a number of prior works, for both Bayesian and classical experimental design, in terms of algorithm efficiency, approximation quality, and range of applicable criteria.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2018

Removal of the points that do not support an E-optimal experimental design

We propose a method of removal of design points that cannot support any ...
research
02/25/2021

Efficient computational algorithms for approximate optimal designs

In this paper, we propose two simple yet efficient computational algorit...
research
11/14/2017

Near-Optimal Discrete Optimization for Experimental Design: A Regret Minimization Approach

The experimental design problem concerns the selection of k points from ...
research
04/06/2023

Optimal subsampling designs

Subsampling is commonly used to overcome computational and economical bo...
research
10/23/2014

Attribute Efficient Linear Regression with Data-Dependent Sampling

In this paper we analyze a budgeted learning setting, in which the learn...
research
01/14/2019

Optimality Criteria for Probabilistic Numerical Methods

It is well understood that Bayesian decision theory and average case ana...
research
11/09/2020

On proportional volume sampling for experimental design in general spaces

Optimal design for linear regression is a fundamental task in statistics...

Please sign up or login with your details

Forgot password? Click here to reset