Open-World Semi-Supervised Learning

02/06/2021
by   Kaidi Cao, et al.
3

Supervised and semi-supervised learning methods have been traditionally designed for the closed-world setting based on the assumption that unlabeled test data contains only classes previously encountered in the labeled training data. However, the real world is inherently open and dynamic, and thus novel, previously unseen classes may appear in the test data or during the model deployment. Here, we introduce a new open-world semi-supervised learning setting in which the model is required to recognize previously seen classes, as well as to discover novel classes never seen in the labeled dataset. To tackle the problem, we propose ORCA, an approach that learns to simultaneously classify and cluster the data. ORCA classifies examples from the unlabeled dataset to previously seen classes, or forms a novel class by grouping similar examples together. The key idea in ORCA is in introducing uncertainty based adaptive margin that effectively circumvents the bias caused by the imbalance of variance between seen and novel classes/clusters. We demonstrate that ORCA accurately discovers novel classes and assigns samples to previously seen classes on benchmark image classification datasets, including CIFAR and ImageNet. Remarkably, despite solving the harder task ORCA outperforms semi-supervised methods on seen classes, as well as novel class discovery methods on novel classes, achieving 7 classes in the ImageNet dataset.

READ FULL TEXT
research
08/27/2023

Semi-Supervised Learning in the Few-Shot Zero-Shot Scenario

Semi-Supervised Learning (SSL) leverages both labeled and unlabeled data...
research
09/21/2023

Bridging the Gap: Learning Pace Synchronization for Open-World Semi-Supervised Learning

In open-world semi-supervised learning, a machine learning model is task...
research
06/28/2021

Rail-5k: a Real-World Dataset for Rail Surface Defects Detection

This paper presents the Rail-5k dataset for benchmarking the performance...
research
08/11/2020

S2OSC: A Holistic Semi-Supervised Approach for Open Set Classification

Open set classification (OSC) tackles the problem of determining whether...
research
02/10/2020

Semi-Supervised Class Discovery

One promising approach to dealing with datapoints that are outside of th...
research
02/04/2020

Introduction to quasi-open set semi-supervised learning for big data analytics

State-of-the-art performance and low system complexity has made deep-lea...
research
01/17/2018

Unseen Class Discovery in Open-world Classification

This paper concerns open-world classification, where the classifier not ...

Please sign up or login with your details

Forgot password? Click here to reset