Proportional Volume Sampling and Approximation Algorithms for A-Optimal Design

02/22/2018
by   Aleksandar Nikolov, et al.
0

We study the A-optimal design problem where we are given vectors v_1,...,v_n∈R^d, an integer k≥ d, and the goal is to select a set S of k vectors that minimizes the trace of (∑_i∈ Sv_iv_i^)^-1. Traditionally, the problem is an instance of optimal design of experiments in statistics where each vector corresponds to a linear measurement of an unknown vector and the goal is to pick k of them that minimize the average variance of the error in the maximum likelihood estimate of the vector being measured. The problem also finds applications in sensor placement in wireless networks, sparse least squares regression, feature selection for k-means clustering, and matrix approximation. In this paper, we introduce proportional volume sampling to obtain improved approximation algorithms for A-optimal design. Given a matrix, proportional volume sampling picks a set of columns S of size k with probability proportional to μ(S) times (∑_i∈ Sv_iv_i^) for some measure μ. Our main result is to show the approximability of the A-optimal design problem can be reduced to approximate independence properties of the measure μ. We appeal to hard-core distributions as candidate distributions μ that allow us to obtain improved approximation algorithms for the A-optimal design. Our results include a d-approximation when k=d, an (1+ϵ)-approximation when k=Ω(d/ϵ+1/ϵ^21/ϵ) and k/k-d+1-approximation when repetitions of vectors are allowed in the solution. We consider generalization of the problem for k≤ d and obtain a k-approximation. The last result implies a restricted invertibility principle for the harmonic mean of singular values. We also show that the problem is NP-hard to approximate within a fixed constant when k=d.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2018

Approximate Positively Correlated Distributions and Approximation Algorithms for D-optimal Design

Experimental design is a classical problem in statistics and has also fo...
research
06/19/2020

λ-Regularized A-Optimal Design and its Approximation by λ-Regularized Proportional Volume Sampling

In this work, we study the λ-regularized A-optimal design problem and in...
research
07/13/2018

Approximation Algorithms for Clustering via Weighted Impurity Measures

An impurity measures I:R^k →R^+ maps a k-dimensional vector v to a non-...
research
11/09/2020

On proportional volume sampling for experimental design in general spaces

Optimal design for linear regression is a fundamental task in statistics...
research
07/10/2017

Subdeterminant Maximization via Nonconvex Relaxations and Anti-concentration

Several fundamental problems that arise in optimization and computer sci...
research
02/23/2019

Fast Distributed Backup Placement in Sparse and Dense Graphs

We consider the Backup Placement problem in networks in the CONGEST dist...
research
04/27/2020

Biomechanical surrogate modelling using stabilized vectorial greedy kernel methods

Greedy kernel approximation algorithms are successful techniques for spa...

Please sign up or login with your details

Forgot password? Click here to reset