Submodular Optimization for Efficient Semi-supervised Support Vector Machines

07/26/2011
by   Wael Emara, et al.
0

In this work we present a quadratic programming approximation of the Semi-Supervised Support Vector Machine (S3VM) problem, namely approximate QP-S3VM, that can be efficiently solved using off the shelf optimization packages. We prove that this approximate formulation establishes a relation between the low density separation and the graph-based models of semi-supervised learning (SSL) which is important to develop a unifying framework for semi-supervised learning methods. Furthermore, we propose the novel idea of representing SSL problems as submodular set functions and use efficient submodular optimization algorithms to solve them. Using this new idea we develop a representation of the approximate QP-S3VM as a maximization of a submodular set function which makes it possible to optimize using efficient greedy algorithms. We demonstrate that the proposed methods are accurate and provide significant improvement in time complexity over the state of the art in the literature.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2020

Optimally Combining Classifiers for Semi-Supervised Learning

This paper considers semi-supervised learning for tabular data. It is wi...
research
02/14/2012

Active Semi-Supervised Learning using Submodular Functions

We consider active, semi-supervised learning in an offline transductive ...
research
07/08/2016

Graph Construction with Label Information for Semi-Supervised Learning

In the literature, most existing graph-based semi-supervised learning (S...
research
06/19/2022

Semi-supervised Change Detection of Small Water Bodies Using RGB and Multispectral Images in Peruvian Rainforests

Artisanal and Small-scale Gold Mining (ASGM) is an important source of i...
research
02/26/2019

Quadratic Decomposable Submodular Function Minimization: Theory and Practice

We introduce a new convex optimization problem, termed quadratic decompo...
research
03/24/2019

Exploiting Synthetically Generated Data with Semi-Supervised Learning for Small and Imbalanced Datasets

Data augmentation is rapidly gaining attention in machine learning. Synt...
research
04/20/2020

Flow-based Algorithms for Improving Clusters: A Unifying Framework, Software, and Performance

Clustering points in a vector space or nodes in a graph is a ubiquitous ...

Please sign up or login with your details

Forgot password? Click here to reset