λ-Regularized A-Optimal Design and its Approximation by λ-Regularized Proportional Volume Sampling

06/19/2020
by   Uthaipon Tantipongpipat, et al.
0

In this work, we study the λ-regularized A-optimal design problem and introduce the λ-regularized proportional volume sampling algorithm, generalized from [Nikolov, Singh, and Tantipongpipat, 2019], for this problem with the approximation guarantee that extends upon the previous work. In this problem, we are given vectors v_1,…,v_n∈ℝ^d in d dimensions, a budget k≤ n, and the regularizer parameter λ≥0, and the goal is to find a subset S⊆ [n] of size k that minimizes the trace of (∑_i∈ Sv_iv_i^⊤ + λ I_d)^-1 where I_d is the d× d identity matrix. The problem is motivated from optimal design in ridge regression, where one tries to minimize the expected squared error of the ridge regression predictor from the true coefficient in the underlying linear model. We introduce λ-regularized proportional volume sampling and give its polynomial-time implementation to solve this problem. We show its (1+ϵ/√(1+λ'))-approximation for k=Ω(d/ϵ+log 1/ϵ/ϵ^2) where λ' is proportional to λ, extending the previous bound in [Nikolov, Singh, and Tantipongpipat, 2019] to the case λ>0 and obtaining asymptotic optimality as λ→∞.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2018

Proportional Volume Sampling and Approximation Algorithms for A-Optimal Design

We study the A-optimal design problem where we are given vectors v_1,......
research
11/09/2020

On proportional volume sampling for experimental design in general spaces

Optimal design for linear regression is a fundamental task in statistics...
research
06/10/2020

On the Optimal Weighted ℓ_2 Regularization in Overparameterized Linear Regression

We consider the linear model 𝐲 = 𝐗β_⋆ + ϵ with 𝐗∈ℝ^n× p in the overparam...
research
04/10/2022

Optimal Subsampling for Large Sample Ridge Regression

Subsampling is a popular approach to alleviating the computational burde...
research
03/08/2017

Polynomial Time Algorithms for Dual Volume Sampling

We study dual volume sampling, a method for selecting k columns from an ...
research
10/08/2011

Regularized Laplacian Estimation and Fast Eigenvector Approximation

Recently, Mahoney and Orecchia demonstrated that popular diffusion-based...
research
02/23/2018

Approximate Positively Correlated Distributions and Approximation Algorithms for D-optimal Design

Experimental design is a classical problem in statistics and has also fo...

Please sign up or login with your details

Forgot password? Click here to reset