Meta Discovery: Learning to Discover Novel Classes given Very Limited Data

02/08/2021
by   Haoang Chi, et al.
0

In learning to discover novel classes(L2DNC), we are given labeled data from seen classes and unlabeled data from unseen classes, and we need to train clustering models for the unseen classes. Since L2DNC is a new problem, its application scenario and implicit assumption are unclear. In this paper, we analyze and improve it by linking it to meta-learning: although there are no meta-training and meta-test phases, the underlying assumption is exactly the same, namely high-level semantic features are shared among the seen and unseen classes. Under this assumption, L2DNC is not only theoretically solvable, but also can be empirically solved by meta-learning algorithms slightly modified to fit our proposed framework. This L2DNC methodology significantly reduces the amount of unlabeled data needed for training and makes it more practical, as demonstrated in experiments. The use of very limited data is also justified by the application scenario of L2DNC: since it is unnatural to label only seen-class data, L2DNC is causally sampling instead of labeling. The unseen-class data should be collected on the way of collecting seen-class data, which is why they are novel and first need to be clustered.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2018

Learning to Accept New Classes without Training

Classic supervised learning makes the closed-world assumption, meaning t...
research
06/24/2022

Mutual Information-guided Knowledge Transfer for Novel Class Discovery

We tackle the novel class discovery problem, aiming to discover novel cl...
research
08/21/2023

MetaGCD: Learning to Continually Learn in Generalized Category Discovery

In this paper, we consider a real-world scenario where a model that is t...
research
06/22/2023

An Interactive Interface for Novel Class Discovery in Tabular Data

Novel Class Discovery (NCD) is the problem of trying to discover novel c...
research
10/13/2022

Exploiting Mixed Unlabeled Data for Detecting Samples of Seen and Unseen Out-of-Distribution Classes

Out-of-Distribution (OOD) detection is essential in real-world applicati...
research
06/06/2022

GenSDF: Two-Stage Learning of Generalizable Signed Distance Functions

We investigate the generalization capabilities of neural signed distance...
research
06/29/2021

Open-Set Representation Learning through Combinatorial Embedding

Visual recognition tasks are often limited to dealing with a small subse...

Please sign up or login with your details

Forgot password? Click here to reset