Stochastic Majorization-Minimization Algorithms for Large-Scale Optimization

06/19/2013
by   Julien Mairal, et al.
0

Majorization-minimization algorithms consist of iteratively minimizing a majorizing surrogate of an objective function. Because of its simplicity and its wide applicability, this principle has been very popular in statistics and in signal processing. In this paper, we intend to make this principle scalable. We introduce a stochastic majorization-minimization scheme which is able to deal with large-scale or possibly infinite data sets. When applied to convex optimization problems under suitable assumptions, we show that it achieves an expected convergence rate of O(1/√(n)) after n iterations, and of O(1/n) for strongly convex functions. Equally important, our scheme almost surely converges to stationary points for a large class of non-convex problems. We develop several efficient algorithms based on our framework. First, we propose a new stochastic proximal gradient method, which experimentally matches state-of-the-art solvers for large-scale ℓ_1-logistic regression. Second, we develop an online DC programming algorithm for non-convex sparse estimation. Finally, we demonstrate the effectiveness of our approach for solving large-scale structured matrix factorization problems.

READ FULL TEXT

page 8

page 24

page 25

research
02/18/2014

Incremental Majorization-Minimization Optimization with Application to Large-Scale Machine Learning

Majorization-minimization algorithms consist of successively minimizing ...
research
05/14/2013

Optimization with First-Order Surrogate Functions

In this paper, we study optimization methods consisting of iteratively m...
research
01/05/2022

Convergence and Complexity of Stochastic Block Majorization-Minimization

Stochastic majorization-minimization (SMM) is an online extension of the...
research
05/11/2023

Stochastic Variance-Reduced Majorization-Minimization Algorithms

We study a class of nonconvex nonsmooth optimization problems in which t...
research
10/28/2019

A First-Order Algorithmic Framework for Wasserstein Distributionally Robust Logistic Regression

Wasserstein distance-based distributionally robust optimization (DRO) ha...
research
04/26/2016

Efficient Splitting-based Method for Global Image Smoothing

Edge-preserving smoothing (EPS) can be formulated as minimizing an objec...
research
11/16/2012

Distance Majorization and Its Applications

The problem of minimizing a continuously differentiable convex function ...

Please sign up or login with your details

Forgot password? Click here to reset