Towards Principled Unsupervised Learning

11/19/2015
by   Ilya Sutskever, et al.
0

General unsupervised learning is a long-standing conceptual problem in machine learning. Supervised learning is successful because it can be solved by the minimization of the training error cost function. Unsupervised learning is not as successful, because the unsupervised objective may be unrelated to the supervised task of interest. For an example, density modelling and reconstruction have often been used for unsupervised learning, but they did not produced the sought-after performance gains, because they have no knowledge of the supervised tasks. In this paper, we present an unsupervised cost function which we name the Output Distribution Matching (ODM) cost, which measures a divergence between the distribution of predictions and distributions of labels. The ODM cost is appealing because it is consistent with the supervised cost in the following sense: a perfect supervised classifier is also perfect according to the ODM cost. Therefore, by aggressively optimizing the ODM cost, we are almost guaranteed to improve our supervised performance whenever the space of possible predictions is exponentially large. We demonstrate that the ODM cost works well on number of small and semi-artificial datasets using no (or almost no) labelled training cases. Finally, we show that the ODM cost can be used for one-shot domain adaptation, which allows the model to classify inputs that differ from the input distribution in significant ways without the need for prior exposure to the new domain.

READ FULL TEXT

page 5

page 7

research
12/29/2016

Meta-Unsupervised-Learning: A supervised approach to unsupervised learning

We introduce a new paradigm to investigate unsupervised learning, reduci...
research
06/06/2019

From Caesar Cipher to Unsupervised Learning: A New Method for Classifier Parameter Estimation

Many important classification problems, such as object classification, s...
research
03/11/2020

Rethinking Image Mixture for Unsupervised Visual Representation Learning

In supervised learning, smoothing label/prediction distribution in neura...
research
09/14/2017

Supervising Unsupervised Learning

We introduce a framework to leverage knowledge acquired from a repositor...
research
12/23/2018

Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching

We consider the problem of training speech recognition systems without u...
research
12/20/2017

Unsupervised learning of dynamical and molecular similarity using variance minimization

In this report, we present an unsupervised machine learning method for d...
research
12/14/2020

A Perturbation Resilient Framework for Unsupervised Learning

Designing learning algorithms that are resistant to perturbations of the...

Please sign up or login with your details

Forgot password? Click here to reset