Dual coordinate solvers for large-scale structural SVMs

12/06/2013
by   Deva Ramanan, et al.
0

This manuscript describes a method for training linear SVMs (including binary SVMs, SVM regression, and structural SVMs) from large, out-of-core training datasets. Current strategies for large-scale learning fall into one of two camps; batch algorithms which solve the learning problem given a finite datasets, and online algorithms which can process out-of-core datasets. The former typically requires datasets small enough to fit in memory. The latter is often phrased as a stochastic optimization problem; such algorithms enjoy strong theoretical properties but often require manual tuned annealing schedules, and may converge slowly for problems with large output spaces (e.g., structural SVMs). We discuss an algorithm for an "intermediate" regime in which the data is too large to fit in memory, but the active constraints (support vectors) are small enough to remain in memory. In this case, one can design rather efficient learning algorithms that are as stable as batch algorithms, but capable of processing out-of-core datasets. We have developed such a MATLAB-based solver and used it to train a collection of recognition systems for articulated pose estimation, facial analysis, 3D object recognition, and action classification, all with publicly-available code. This writeup describes the solver in detail.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2022

Recipe for Fast Large-scale SVM Training: Polishing, Parallelism, and more RAM!

Support vector machines (SVMs) are a standard method in the machine lear...
research
02/10/2021

An exact solver for the Weston-Watkins SVM subproblem

Recent empirical evidence suggests that the Weston-Watkins support vecto...
research
07/26/2020

Approaches of large-scale images recognition with more than 50,000 categoris

Though current CV models have been able to achieve high levels of accura...
research
04/16/2020

MPLP++: Fast, Parallel Dual Block-Coordinate Ascent for Dense Graphical Models

Dense, discrete Graphical Models with pairwise potentials are a powerful...
research
10/13/2010

Online Multiple Kernel Learning for Structured Prediction

Despite the recent progress towards efficient multiple kernel learning (...
research
06/26/2015

Finding Linear Structure in Large Datasets with Scalable Canonical Correlation Analysis

Canonical Correlation Analysis (CCA) is a widely used spectral technique...
research
02/20/2020

Nyström Subspace Learning for Large-scale SVMs

As an implementation of the Nyström method, Nyström computational regula...

Please sign up or login with your details

Forgot password? Click here to reset