Optimally Combining Classifiers for Semi-Supervised Learning

06/07/2020
by   Zhiguo Wang, et al.
15

This paper considers semi-supervised learning for tabular data. It is widely known that Xgboost based on tree model works well on the heterogeneous features while transductive support vector machine can exploit the low density separation assumption. However, little work has been done to combine them together for the end-to-end semi-supervised learning. In this paper, we find these two methods have complementary properties and larger diversity, which motivates us to propose a new semi-supervised learning method that is able to adaptively combine the strengths of Xgboost and transductive support vector machine. Instead of the majority vote rule, an optimization problem in terms of ensemble weight is established, which helps to obtain more accurate pseudo labels for unlabeled data. The experimental results on the UCI data sets and real commercial data set demonstrate the superior classification performance of our method over the five state-of-the-art algorithms improving test accuracy by about 3%-4%. The partial code can be found at https://github.com/hav-cam-mit/CTO.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 6

page 7

page 9

page 10

research
12/23/2016

RSSL: Semi-supervised Learning in R

In this paper, we introduce a package for semi-supervised learning resea...
research
07/26/2011

Submodular Optimization for Efficient Semi-supervised Support Vector Machines

In this work we present a quadratic programming approximation of the Sem...
research
06/09/2016

Mutual Exclusivity Loss for Semi-Supervised Deep Learning

In this paper we consider the problem of semi-supervised learning with d...
research
09/14/2020

Fairness Constraints in Semi-supervised Learning

Fairness in machine learning has received considerable attention. Howeve...
research
01/15/2020

Two Cycle Learning: Clustering Based Regularisation for Deep Semi-Supervised Classification

This works addresses the challenge of classification with minimal annota...
research
12/06/2021

Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data

Despite great strides made on fine-grained visual classification (FGVC),...
research
07/26/2019

Scalable Semi-Supervised SVM via Triply Stochastic Gradients

Semi-supervised learning (SSL) plays an increasingly important role in t...

Please sign up or login with your details

Forgot password? Click here to reset