TAGLETS: A System for Automatic Semi-Supervised Learning with Auxiliary Data

11/08/2021
by   Wasu Piriyakulkij, et al.
0

Machine learning practitioners often have access to a spectrum of data: labeled data for the target task (which is often limited), unlabeled data, and auxiliary data, the many available labeled datasets for other tasks. We describe TAGLETS, a system built to study techniques for automatically exploiting all three types of data and creating high-quality, servable classifiers. The key components of TAGLETS are: (1) auxiliary data organized according to a knowledge graph, (2) modules encapsulating different methods for exploiting auxiliary and unlabeled data, and (3) a distillation stage in which the ensembled modules are combined into a servable model. We compare TAGLETS with state-of-the-art transfer learning and semi-supervised learning methods on four image classification tasks. Our study covers a range of settings, varying the amount of labeled data and the semantic relatedness of the auxiliary data to the target task. We find that the intelligent incorporation of auxiliary and unlabeled data into multiple learning techniques enables TAGLETS to match-and most often significantly surpass-these alternatives. TAGLETS is available as an open-source system at github.com/BatsResearch/taglets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2022

AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

Semi-supervised learning (SSL) has seen great strides when labeled data ...
research
09/22/2018

Semi-Supervised Sequence Modeling with Cross-View Training

Unsupervised representation learning algorithms such as word2vec and ELM...
research
10/16/2020

Auxiliary Task Reweighting for Minimum-data Learning

Supervised learning requires a large amount of training data, limiting i...
research
01/31/2023

NP-Match: Towards a New Probabilistic Model for Semi-Supervised Learning

Semi-supervised learning (SSL) has been widely explored in recent years,...
research
04/28/2022

On tuning a mean-field model for semi-supervised classification

Semi-supervised learning (SSL) has become an interesting research area d...
research
02/01/2022

Deep Reference Priors: What is the best way to pretrain a model?

What is the best way to exploit extra data – be it unlabeled data from t...
research
08/18/2021

STAR: Noisy Semi-Supervised Transfer Learning for Visual Classification

Semi-supervised learning (SSL) has proven to be effective at leveraging ...

Please sign up or login with your details

Forgot password? Click here to reset