Maximin Optimization for Binary Regression

10/10/2020
by   Nisan Chiprut, et al.
6

We consider regression problems with binary weights. Such optimization problems are ubiquitous in quantized learning models and digital communication systems. A natural approach is to optimize the corresponding Lagrangian using variants of the gradient ascent-descent method. Such maximin techniques are still poorly understood even in the concave-convex case. The non-convex binary constraints may lead to spurious local minima. Interestingly, we prove that this approach is optimal in linear regression with low noise conditions as well as robust regression with a small number of outliers. Practically, the method also performs well in regression with cross entropy loss, as well as non-convex multi-layer neural networks. Taken together our approach highlights the potential of saddle-point optimization for learning constrained models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2021

Why Do Local Methods Solve Nonconvex Problems?

Non-convex optimization is ubiquitous in modern machine learning. Resear...
research
08/31/2023

An Efficient Framework for Global Non-Convex Polynomial Optimization over the Hypercube

We present a novel efficient theoretical and numerical framework for sol...
research
11/20/2017

Optimistic Robust Optimization With Applications To Machine Learning

Robust Optimization has traditionally taken a pessimistic, or worst-case...
research
04/17/2018

Two-Player Games for Efficient Non-Convex Constrained Optimization

In recent years, constrained optimization has become increasingly releva...
research
10/15/2021

Gradient Descent on Infinitely Wide Neural Networks: Global Convergence and Generalization

Many supervised machine learning methods are naturally cast as optimizat...
research
06/14/2021

A scalable multi-step least squares method for network identification with unknown disturbance topology

Identification methods for dynamic networks typically require prior know...
research
09/26/2021

Data Summarization via Bilevel Optimization

The increasing availability of massive data sets poses a series of chall...

Please sign up or login with your details

Forgot password? Click here to reset