Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees

01/23/2020
by   Yongchun Li, et al.
0

This paper studies a classic maximum entropy sampling problem (MESP), which aims to select the most informative principal submatrix of a prespecified size from a covariance matrix. MESP has been widely applied to many areas, including healthcare, power system, manufacturing and data science. By investigating its Lagrangian dual and primal characterization, we derive a novel convex integer program for MESP and show that its continuous relaxation yields a near-optimal solution. The results motivate us to study an efficient sampling algorithm and develop its approximation bound for MESP, which improves the best-known bound in literature. We then provide an efficient deterministic implementation of the sampling algorithm with the same approximation bound. By developing new mathematical tools for the singular matrices and analyzing the Lagrangian dual of the proposed convex integer program, we investigate the widely-used local search algorithm and prove its first-known approximation bound for MESP. The proof techniques further inspire us with an efficient implementation of the local search algorithm. Our numerical experiments demonstrate that these approximation algorithms can efficiently solve medium-sized and large-scale instances to near-optimality. Our proposed algorithms are coded and released as open-source software. Finally, we extend the analyses to the A-Optimal MESP (A-MESP), where the objective is to minimize the trace of the inverse of the selected principal submatrix.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2020

Exact and Approximation Algorithms for Sparse PCA

Sparse PCA (SPCA) is a fundamental model in machine learning and data an...
research
12/20/2020

Outliers Detection Is Not So Hard: Approximation Algorithms for Robust Clustering Problems Using Local Search Techniques

In this paper, we consider two types of robust models of the k-median/k-...
research
06/04/2013

Iterated Tabu Search Algorithm for Packing Unequal Circles in a Circle

This paper presents an Iterated Tabu Search algorithm (denoted by ITS-PU...
research
02/20/2023

A novel dual-decomposition method based on p-Lagrangian relaxation

In this paper, we propose the novel p-branch-and-bound method for solvin...
research
07/11/2022

Breaching the 2 LMP Approximation Barrier for Facility Location with Applications to k-Median

The Uncapacitated Facility Location (UFL) problem is one of the most fun...
research
01/19/2019

On Efficient Optimal Transport: An Analysis of Greedy and Accelerated Mirror Descent Algorithms

We provide theoretical analyses for two algorithms that solve the regula...
research
11/06/2017

Computing Maximum Entropy Distributions Everywhere

We study the problem of computing the maximum entropy distribution with ...

Please sign up or login with your details

Forgot password? Click here to reset